Home

Guide

Roadmap

Unsupervised Evaluation

Overview Activity Answer

Binary IRT and 2PL

Overview Activity Answer

Beta4-IRT and CLAIRE

Overview Extra Activity Answer

Authors

Contact me

Unsupervised Evaluation

Section 1

Aggregate evaluation is useful, but it can hide that some instances are consistently harder than others.

Focus

Toy clustering datasets
Model comparison under unsupervised evaluation
Why average metrics can hide instance-level structure

Main Notebook

Guided notebook: 01_00_unsupervised_evaluation_toy_problems.ipynb
Role: introduce the problem and motivate instance difficulty

Open guided notebook Go to activity