* Loss and accuracy track one another
* And with bigger networks, we can train forever
* In fact, this deep learning stuff looks easy!
---
## Features
* At this point, we can skip the linear layers of the network
* The features present can be used to train an SVM, but its classification accuracy is the same as the DNN
* We'll come back to this again though
---
## Noisy Labels
* Let's add some noise into the labels
* Given an error rate, change training labels to a different class
---
## 1% Noise Loss