1Cademy - Fix Dev and Test Set Labels Together

Learn Before

Mislabeled Examples in the Dev Set

Concept

Fix Dev and Test Set Labels Together

Whatever process is used to fix dev set labels should also be applied to test set labels so that the dev and test sets continue to be drawn from the same distribution. Fixing them together helps avoid optimizing dev-set performance only to be judged later on a different test-set criterion.

Updated 2026-06-18

Contributors are: