Learn Before
Diagnose a 10% dev-set error in a cat recognition app.
Case context: You are building a cat recognition application. Humans achieve near-perfect performance on this task. Your algorithm achieves a 1% error rate on your training set, and a 1.5% error rate on unseen data drawn from the exact same distribution as the training set. However, when evaluating the algorithm on your dev set, the error rate spikes to 10%.
Question: Diagnose the primary problem affecting the algorithm's performance on the dev set, and explain what specific error rates support your diagnosis.
Sample answer: The primary problem is a data mismatch between the training set and the dev set. This diagnosis is supported by the fact that the algorithm performs well on unseen data from the training distribution (1.5% error), indicating it hasn't severely overfit the training set (1% error). The sudden spike to 10% error on the dev set therefore indicates that the dev set is drawn from a different, harder distribution that the model hasn't learned.
Key points:
- The diagnosis is a data mismatch problem.
- The 1.5% error on unseen training-distribution data proves the model generalizes well to the training distribution.
- The jump from 1.5% to 10% dev-set error isolates the difference in data distributions as the cause of the poor performance.
Rubric: The response must correctly identify "data mismatch" as the diagnosis and justify it by contrasting the small gap between training and unseen same-distribution error with the large gap between unseen same-distribution error and dev-set error.
0
1
Tags
Machine Learning
Deep Learning
Supervised Learning
Dive into Deep Learning @ D2L
Data Science
Machine Learning Strategy
Machine Learning Yearning @ DeepLearning.AI
Related
Which error comparison most directly reveals the data mismatch problem in the example with 1% train, 1.5% same-dist, and 10% dev error?
The error pattern (1% train, 1.5% same-dist unseen, 10% dev) primarily indicates a high variance (overfitting) problem.
In the data mismatch example, the algorithm achieves _____ error on the dev set.
Match each error measurement in the data mismatch example to its correct value.
Order the analytical steps used to diagnose the data mismatch problem in the Machine Learning Yearning example.
What does the 0.5% gap between training error (1%) and same-distribution unseen error (1.5%) indicate in the data mismatch example?
In the data mismatch example, the 1% training error vs. near-perfect human performance represents the most significant problem to address.
In the data mismatch example, error on unseen data drawn from the same distribution as the training set is _____.
Match each error comparison in the data mismatch example to the type of problem it diagnoses.
Order the observations that build the case that data mismatch — not overfitting or underfitting — is the dominant problem in the Machine Learning Yearning example.
Explain how specific error comparisons point to data mismatch.
Diagnose a 10% dev-set error in a cat recognition app.
State the significance of the unseen training-distribution error.