Prof Armando Teixeira-Pinto Sydney School of Public Health, University of Sydney
Linear Models or Regression methods for epidemiology (or equivalent unit)
Categorical Data and Generalised Linear Models
8-12 hours total study time per week
Two major assignments worth 40% each (equivalent to 2 x 2000 words) and two short assignments worth 10% each.
James G, Witten D, Hastie T, Tibshirani R. An Introduction to Statistical Learning with Applications in R. Springer, 2003. (freely available online: http://www-bcf.usc.edu/~gareth/ISL/ISLR%20Seventh%20Printing.pdf) For details, including ISBN, see the BCA Textbook and Software Guide
R and RStudio
The topics covered include : Linear Regression and K -Nearest Neighbors; Classification (logistic regression, linear discriminant analysis); Resampling Methods (Cross-Validation, Bootstrap); Model Selection and Regularization (subset selection, shrinkage methods, dimension reduction methods); Beyond Linearity (fractional polynomials, basis functions, splines, generalized additive models); Tree-Based Methods (decision trees, bagging, random forests, boosting).
Course notes, assignment material and interaction facilities available online
*co-requisite, may be taken before or concurrently