Click here to start your application. Apply now

Academic Handbook BSc (Hons) Data Science

Machine Learning and Data Mining I Course Descriptor

Course Title Machine Learning and Data Mining I Faculty EDGE Innovation Unit (London)
Course code NCHNAP563 Course Leader Professor Scott Wildman (interim)
Credit points 15 Teaching Period This course will typically be delivered over a 6-week period.
FHEQ level 5 Date approved June 2020
Compulsory/
Optional 
Compulsory
Pre-requisites None
Co-requisites None

Course Summary

This course introduces the learner to three of the most well used machine learning techniques for data mining and predictive modelling: regression, decision trees, clustering and principal component analysis (PCA). Learners will explore the difference between supervised, unsupervised and machine learning, and study how to build and analyse robust predictive models using tools such as Python and R. It uses tools and libraries to analyze data sets, build predictive models, and evaluate the fit of the models. Covers common learning algorithms, including dimensionality reduction, classification, principal-component analysis, k-NN, k-means clustering, gradient descent, regression, logistic regression, regularization, multiclass data and algorithms, boosting, and decision trees. This course will also examine data bias and accurate model building: assessing the appropriateness of training and test sets, evaluation and deployment. 

Course Aims

  • Train learners in supervised and unsupervised machine learning and data mining techniques for data science application.
  • Give learners the tools and knowledge to evaluate different machine learning techniques and make recommendations for particular data science problems.
  • For learners to explore how to perform machine learning and develop predictive models.

Learning Outcomes

On successful completion of the course, learners will be able to:

Knowledge and Understanding

K1b Have knowledge and critical understanding of the underlying mathematical concepts and principles behind machine learning algorithms.
K2b Understand the difference between machine learning algorithms, their pros and cons and their application in data science.
K3b Have a critical understanding of data variance, data bias, data (un)correlation and predictive model evaluation metrics.

Subject Specific Skills

S1b Apply and evaluate regression, decision trees and clustering for problem solving within data science.
S2b Use industry standard machine learning and data mining tools.

Transferable and Professional Skills

T1b Critically evaluate different approaches to problem solving.
T2b Effectively communicate arguments, analyses and conclusions.
T3bi Develop logical analyses and conceptual thinking.
T3bii Demonstrate a sound technical proficiency in written English and skill in selecting vocabulary so as to communicate effectively to specialist and non-specialist audiences.

Teaching and Learning

This is an e-learning course, taught throughout the year.

This course can be offered as a standalone short course.

Teaching and learning strategies for this course will include: 

  • On-line learning
  • On-line Discussion Groups
  • On-line assessment

Course information and supplementary materials will be available on the University’s Virtual Learning Environment (VLE).

Learners are required to attend and participate in all the formal and timetabled sessions for this course. Learners are also expected to manage their self-directed learning and independent study in support of the course.

The course learning and teaching hours will be structured as follows:

  • Off-the-job learning and teaching (6 days x 7 hours) = 42 hours
  • On-the-job learning (12 days x 7 hours) = 84 hours (e.g. 2 days per week for 6 weeks)
  • Private study (4 hours per week) = 24 hours

Total = 150 hours

Workplace assignments (see below) will be completed as part of on-the-job learning.

Assessment

Formative

Learners will be formatively assessed during the course by means of set assignments. These will not count towards the final degree but will provide learners with developmental feedback. 

Summative

Assessment will be in two forms:

AE   Assessment Type Weighting Online submission Duration Length
1 Written assignment 60% Yes 2,000 words +/- 10%,  excluding data tables
2 Set exercise using workplace datasets 40% Yes Requiring on average 10-20 hours to complete

Feedback

Learners will receive formal feedback in a variety of ways: written (via email correspondence); oral and indirectly through discussion during group tutorials. Learners will also attend a formal meeting with their Academic Mentor and Employer. These tri-partite reviews will monitor and evaluate the learner’s progress.  

Feedback is provided on summatively assessed assignments and through generic internal examiners’ reports, both of which are posted on the VLE.

Indicative Reading

Note: Comprehensive and current reading lists for courses are produced annually in the Course Syllabus or other documentation provided to learners; the indicative reading list provided below is used as part of the approval/modification process only.

Books 

  • Alpaydin, E., (2014), Introduction to machine learning, Cambridge, Massachusetts: MIT Press
  • Olive, D.J., (2017), Linear Regression, Cham: Springer International Publishing: Imprint: Springer
  • Jolliffe, I.T., (2002), Principal Component Analysis, New York, NY: Springer

Journals

Learners are encouraged to consult relevant journals on machine learning and data mining. 

Electronic Resources

Learners are encouraged to consult relevant electronic resources on machine learning and data mining. 

Indicative Topics

  • Regression, decision trees, clustering and PCA
  • Supervised and unsupervised machine learning
  • Predictive model building and deployment
Title: NCHNAP563 Machine Learning and Data Mining I

Approved by: Academic Board

Location: Academic Handbook/Programme specifications and Handbooks/ Undergraduate Apprenticeship Programmes/BSc (Hons) Data Science Programme Specification/Course Descriptors

Version number Date approved Date published  Owner Proposed next review date Modification (As per AQF4) & category number
3.0 October 2022 August 2022 Scott Wildman September 2026 Category 1: Corrections/clarifications to documents which do not change approved content or learning outcomes

Category 3: Changes to Learning Outcomes

2.1 May 2022 May 2022 Scott Wildman September 2025 Category 1:
Corrections/clarifications to
documents which do not
change approved content.
2.0 January 2022 April 2022 Scott Wildman September 2025 Category 3: Changes to Learning Outcomes
1.0 June 2020 June 2020 Scott Wildman June 2025
Print/Save PDF