Learn Before
Case Study

Diagnose a 10% dev-set error in a cat recognition app.

Case context: You are building a cat recognition application. Humans achieve near-perfect performance on this task. Your algorithm achieves a 1% error rate on your training set, and a 1.5% error rate on unseen data drawn from the exact same distribution as the training set. However, when evaluating the algorithm on your dev set, the error rate spikes to 10%.

Question: Diagnose the primary problem affecting the algorithm's performance on the dev set, and explain what specific error rates support your diagnosis.

Sample answer: The primary problem is a data mismatch between the training set and the dev set. This diagnosis is supported by the fact that the algorithm performs well on unseen data from the training distribution (1.5% error), indicating it hasn't severely overfit the training set (1% error). The sudden spike to 10% error on the dev set therefore indicates that the dev set is drawn from a different, harder distribution that the model hasn't learned.

Key points:

  • The diagnosis is a data mismatch problem.
  • The 1.5% error on unseen training-distribution data proves the model generalizes well to the training distribution.
  • The jump from 1.5% to 10% dev-set error isolates the difference in data distributions as the cause of the poor performance.

Rubric: The response must correctly identify "data mismatch" as the diagnosis and justify it by contrasting the small gap between training and unseen same-distribution error with the large gap between unseen same-distribution error and dev-set error.

0

1

Updated 2026-05-27

Contributors are:

Who are from:

Tags

Machine Learning

Deep Learning

Supervised Learning

Dive into Deep Learning @ D2L

Data Science

Machine Learning Strategy

Machine Learning Yearning @ DeepLearning.AI