Lectures


Practicals


Homework


Notebooks (jupyter)


Datasets


Exam contents

  • NEW: solutions of the first exam
  • Practical exercises (4v)
    - Prediction: Decision Trees, kNN, Evaluation (Confusion Matrices and Residue-based Errors)
    - Description: Agglomerative Clustering, Evaluation (silhouette and purity)
    - Bivariate Exploration: Correlation and Information Gain
  • True or False (1v)

FAQ

  • Exercise 6-7 (homework): Which variables should be considered for clustering the observations? And to assess purity?
    Answer: Consider the input variables (y1 and y2) to learn the clustering solution; and the output variable z to provide the reference groups (ground truth) for assessing the purity of the clustering solution.