Unsupervised Evaluation
Section 1
Aggregate evaluation is useful, but it can hide that some instances are consistently harder than others.
Focus
- Toy clustering datasets
- Model comparison under unsupervised evaluation
- Why average metrics can hide instance-level structure
Main Notebook
- Guided notebook:
01_00_unsupervised_evaluation_toy_problems.ipynb - Role: introduce the problem and motivate instance difficulty