This course gives an introduction to data science. The emphasis is on strategies for the design of experiments, considering both workflow paradigms and aspects of reproducibility and traceability of solutions. Furthermore, knowledge about the lifecycle of data, from acquisition through processing and analysis to the long-term provision and reuse, is covered. Students are also introduced to the complex legal and ethical aspects of working with data.
The following topics are covered in the lectures:
- Introduction to Data Science
- Data and the data lifecycle
- Conceptual Experiment design
- Workflow paradigms
- Data management, reproducibilty and traceability
- Experiment error analysis and statistical testing
- Advanced experiment design
In addtion, two exercises will be done.
The effort breakdown is:
7 2-hour lectures: 14hExercise 1: 15hExercise 2 (incl presentation): 25hExam preparation: 20hExam: 1hSUM: 75h
Syllabus
(all in FH HS2, Thu, 2-4pm c.t.)
BLOCK 14.10.: Introduction to data science - data science process -Hanbury11.10.: Data and the data lifecycle, ethical and legal aspects -Hanbury
BLOCK 2[18.10.: Optional: Machine Learning Primer -Knees]25.10.: Conceptual Experiment Design: Planning and Execution of Experiments, hypotheses -Knees
Exercise 1: Design an experimental workflow for a given dataset
22.11.: Workflow paradigms and Scientific Workflow Environments; iPython, Jupyter Notebook, WEKA, Graphical Experimentation Workflow; -Schindler, KneesBLOCK 329.11.: Facilitating reproducibility and traceability; Basics data management planning and data stewardship; - Rauber6.12.: Experiment Error Analysis and Statistical Testing 1 -Knees13.12.: Experiment Error Analysis and Statistical Testing 2 -KneesExercise 2: Reproduce experimental results from a paper
17.1.: Presentations of Exercise 224.1.: Written Exam
<ul>
<li>Ex1: 1..100 points. Minimum 35.</li>
</ul>
<ul>
<li>Ex2: 1..100 points. Minimum 35.</li>
</ul>
<ul>
<li>Exam: 1..100 points. Minimum 35.</li>
</ul>
<ul>
<li>Final Grade=0.20*Ex1+0.35*Ex2+0.45*Exam. Minimum 50.</li>
</ul>