Learn Before
Diagnose the data issue in a cat classifier's dev set.
Case context: You are developing a cat classifier. During error analysis of the dev set, you notice a picture of a dog that the algorithm correctly predicted as "not cat", but it was marked as an error because the target label was set to "cat".
Question: Diagnose the nature of this error. What is this type of issue called, who caused it, and what specific part of the data point (x, y) is incorrect?
Sample answer: This issue is a "mislabeled" example. The error was caused by a human labeler who assigned the label before the algorithm encountered the picture. Specifically, in the data point (x, y), the class label y was set to "cat" (incorrect value) instead of "not cat", while the input x is the picture of the dog.
Key points:
- Identify the issue as a "mislabeled" example.
- The error was introduced by a human labeler before the algorithm encountered the data.
- The class label y in (x, y) has an incorrect value.
Rubric: The candidate must identify the issue as a "mislabeled" example. They must state that the error was made by a human labeler before the algorithm encountered it. They must specify that the class label y is the part of the data point (x, y) that has the incorrect value.
0
1
Tags
Machine Learning
Deep Learning
Supervised Learning
Dive into Deep Learning @ D2L
Data Science
Machine Learning Strategy
Machine Learning Yearning @ DeepLearning.AI
Related
Adding a Mislabeled Category to the Error Analysis Spreadsheet
When to Fix Mislabeled Dev Set Labels
Fix Dev and Test Set Labels Together
What does the term "mislabeled" mean in the context of error analysis on a dev set?
True or False: Mislabeled examples in the dev set refer to errors made by the algorithm.
In dev set error analysis, a "mislabeled" example has an incorrect class label _____ before the algorithm runs.
Match each component of a mislabeled example with its correct definition.
Order the sequence of events leading to the discovery of a mislabeled dev set example.
Explain the concept of mislabeled examples in the dev set during error analysis.
Diagnose the data issue in a cat classifier's dev set.
What is the origin of a mislabeled example in a dev set?
Which of the following is an example of a mislabeled data point (x, y) in a cat classification dev set?
True or False: In a mislabeled example (x, y), the class label y has an incorrect value.