Building a data set
 General principles
 Data linkage
 Security and storage
 Version control
 Naming conventions
 Descriptive statistics
 Assumptions testing (parametric etc.)
 Handling Missing Data
 Understanding distributions, their application and consequences
 Data transformation