Traditional ML

Clustering, PCA, and Anomaly Detection

Prepare for unsupervised ML interview questions where labels are weak or delayed.

Recommended on day 1890 minutesIntermediate

Learning objectives

  • Describe clustering choices and what distance metric assumptions imply
  • Explain PCA as compression, denoising, and visualization rather than just a formula
  • Choose metrics for anomaly detection when positives are rare

Interview prompts

  • How do you evaluate an anomaly detector with few labels?
  • What does PCA throw away and why does that matter?