Optimizing Resource Allocation for Mobile Image Classification
Case context: You are building a mobile image classification application. Your dev/test set consists of 5,000 images taken from mobile phones. To improve performance, you have acquired a supplemental dataset of 200,000 images scraped from the internet. However, you have limited computational resources and cannot afford to train a massive neural network.
Question: Given your limited computational budget, what strategy should you apply to the internet images during training, and why will this strategy help you avoid building an overly large neural network?
Sample answer: You should apply a much lower weight to the 200,000 internet images compared to the 5,000 mobile images during training. Because the internet images are both much larger in quantity and likely from a different distribution, treating them equally would require a massive neural network to model both distributions effectively. Down-weighting the internet images acts as a compromise, reducing the computational burden and allowing a smaller network to focus primarily on the target mobile distribution.
Key points:
- Assign a lower weight to the auxiliary internet images.
- The internet images are much larger in number and have a different distribution.
- Equal weighting requires a massive neural network to learn both distributions.
- Down-weighting focuses the model on the target distribution, reducing resource needs.
Rubric: The answer must propose down-weighting the internet images and explain that it prevents the model from expending excessive capacity on learning the auxiliary distribution, thus saving computational resources.
0
1
Tags
Machine Learning
Deep Learning
Supervised Learning
Dive into Deep Learning @ D2L
Data Science
Machine Learning Strategy
Machine Learning Yearning @ DeepLearning.AI
Related
Tuning Auxiliary Data Weight with the Dev Set
Which condition makes it appropriate to down-weight auxiliary training data according to Andrew Ng?
Down-weighting auxiliary data can reduce the computational burden of training a model on both auxiliary and target-distribution examples.
Re-weighting is needed only when you suspect the additional data has a very _____ distribution than the dev/test set.
Match each training scenario to its key implication for handling auxiliary data.
Order the reasoning steps a practitioner should follow when deciding whether to down-weight large-scale auxiliary training data.
In the 40x internet vs. mobile image scenario, what is the main cost of training equally on both sources without down-weighting?
Giving auxiliary data a lower weight in training is equivalent to removing it from the training set entirely.
By giving internet images a much lower _____, you don't have to build as massive a neural network.
Match each concept to its correct description in the context of auxiliary data down-weighting.
Order the causal steps explaining why down-weighting auxiliary data allows a smaller neural network to be used.
Explain the dual conditions and purpose of down-weighting auxiliary data.
Optimizing Resource Allocation for Mobile Image Classification
Impact of Down-Weighting on Neural Network Size