Statistical learning techniques commonly used to analyse high-dimensional (or multivariate) data. Penalised regression, classification trees, clustering, dimension-reduction, bagging, stacking, boosting, random forests and ensemble learning.
Statistical learning techniques commonly used to analyse high-dimensional (or multivariate) data. Principal component analysis, clustering, dimensionality reduction, classification methods, tree-based methods, penalised regression.
The aim of this paper is to introduce students to many of the statistical learning techniques that are now used to analyse high-dimensional data. Students will learn the underlying rationale for each method and gain practice in using it on real data in R.
|Paper title||Statistical Learning|
|Teaching period||Semester 2 (On campus)|
|Domestic Tuition Fees (NZD)||$1,206.91|
|International Tuition Fees||Tuition Fees for international students are elsewhere on this website.|
- STAT 401 or (STAT 260 and STAT 270 and STAT 310), or equivalent (contact department for further information)
- STAT 312
Dr Matthew Parry email@example.com
- Teaching staff
- Paper Structure
- Principal component analysis.
- Exploratory factor analysis.
- Clustering methods.
- Dimensionality reduction.
- Classification methods.
- Tree-based methods.
- Penalised regression.
- Multiple testing.
Textbooks are not required for this paper.
- Graduate Attributes Emphasised
Interdisciplinary perspective, Lifelong learning, Scholarship, Communication, Critical thinking, Research.
View more information about Otago's graduate attributes.
- Learning Outcomes
Students who successfully complete the paper will be able to:
- Describe the issues that arise when analysing high-dimensional data.
- Use a range of statistical learning techniques to analyse real, high-dimensional data using R.
- Determine which is the most appropriate technique for a given research objective.
- Interpret the results of the analysis, including any assumptions or limitations.
- Write a clear and succinct report on an analysis of high-dimensional data for a collaborator or potential client.
- Give an oral presentation of their results.