This course is designed to introduce statistical research methodology--the procedures by which statisticians go about approaching and analyzing data--to early undergraduates. Students will learn basic concepts of statistical learning--inference vs. prediction, supervised vs. unsupervised learning, regression vs. classification, etc.--and will reinforce this knowledge by applying, e.g., linear regression, random forest, principal components analysis, and/or hierarchical clustering and more to datasets provided by the instructor. Students will also practice disseminating the results of their analyses via oral presentations and posters. Analyses will primarily be carried out using the R
programming language, but with attention paid to how one would perform similar analyses using Python. Previous knowledge of R
is not required for this course. Space is very limited; there will be an application process. The course is currently open to sophomore statistics students only.
Fall 2019, Tu-Th 1:30 - 2:50 PM, Wean 4625
Week | Day | Topic |
---|---|---|
1 | Tu | pre-course assessment + R + statistical learning |
Th | R: vectors + lab | |
2 | Tu | R: dplyr + lab |
Th | R: ggplot + lab | |
3 | Tu | exploratory data analysis + lab |
Th | K-means + hierarchical clustering + lab | |
4 | Tu | PCA + lab |
Th | model assessment + bias-variance tradeoff + lab | |
5 | Tu | GLM + linear regression + lab |
Th | logistic regression + lab | |
6 | Tu | best subset selection + lab |
Th | penalized regression + lab | |
7 | Tu | ML + trees + lab |
Th | random forest + lab | |
8 | Tu | reserved for project dataset work |
Th | cancelled: mid-semester break (first DA draft due) | |
9 | Tu | boosting + lab |
Th | KNN + lab | |
10 | Tu | SVM + lab |
Th | naive Bayes + lab | |
11 | Tu | discussion of first DA draft + initial revision |
Th | kernels: density estimation and regression + lab | |
12 | Tu | TBD |
Th | reserved for project dataset work | |
Su | second AD draft due | |
13 | Tu | poster discussion + hackathon preliminaries |
Th | hackathon presentations | |
14 | Tu | cancelled: Thanksgiving |
Th | cancelled: Thanksgiving | |
15 | Tu | reserved for group poster work |
Th | post-course assessment + retrospective survey + poster work | |
Fr | final DA + final poster due | |
F | We | departmental poster presentation |