Accessibility Skip to Global Navigation Skip to Local Navigation Skip to Content Skip to Search Skip to Site Map Menu

STAT312 Modelling High Dimensional Data

Due to COVID-19 restrictions, a selection of on-campus papers will be made available via distance and online learning for eligible students.
Find out which papers are available and how to apply on our COVID-19 website

An introduction to the statistical learning techniques commonly used to analyse high-dimensional (or multivariate) data. Penalised regression, classification trees, clustering, dimension-reduction, bagging, stacking, boosting, random forests and ensemble learning.

The aim of this paper is to introduce students to many of the statistical learning techniques that are now used to analyse high-dimensional data. Students will learn the underlying rationale for each method and gain practice.

Paper title Modelling High Dimensional Data
Paper code STAT312
Subject Statistics
EFTS 0.15
Points 18 points
Teaching period Semester 2 (On campus)
Domestic Tuition Fees (NZD) $929.55
International Tuition Fees Tuition Fees for international students are elsewhere on this website.

^ Top of page

One of (ECON 210 or FINC 203 or STAT 210 or STAT 241) and STAT 260
STAT 242, STAT 342
Schedule C
Arts and Music, Science

Matthew Parry

Teaching staff

Dr Matthew Parry

Paper Structure

Main topics

  • Penalised regression
  • Classification trees
  • Clustering
  • Dimension-reduction
  • Bagging, stacking and boosting
  • Random forests
  • Ensemble learning

Textbooks are not required for this paper

Graduate Attributes Emphasised
Lifelong learning, Communication, Critical thinking, Research.
View more information about Otago's graduate attributes.
Learning Outcomes

On completion of this paper, students will be able to:

  1. Describe the issues that arise when analysing high-dimensional data.
  2. Use a range of statistical learning techniques to analyse real, high-dimensional data using R
  3. Determine which is the most appropriate technique for a give research objective
  4. Interpret the results of the analysis, including any assumptions or limitations.
  5. Write a clear and succinct report on an analysis of high-dimensional data collaborator or potential client.

^ Top of page


Semester 2

Teaching method
This paper is taught On Campus
Learning management system


Stream Days Times Weeks
A1 Wednesday 08:00-08:50 28-34, 36-41
Thursday 08:00-08:50 28-34, 36-41
Friday 08:00-08:50 28-34, 36-41


Stream Days Times Weeks
Attend one stream from
A1 Friday 13:00-13:50 28-34, 36-41
A2 Thursday 11:00-11:50 28-34, 36-41