* Most datasets have "noise"
* Errors in labels
* Impossible to learn examples
* We add capacity, and the model learn special cases
* Dogs are always dogs unless they have a soccer ball
* Soccer balls are soccer balls unless held by a bird
* The capacity that is too much is $h*$ in the figure
---
## Error Vs Capacity