* Consider a training set of handwritten digits
* If strokes are different because some pens have more ink that others, we could call that variance
* If strokes are different because someone leaned up against the writing, that is noise
* Meaning it is not reflective of anything in the true data
---
## Gaussian Vs Non-Gaussian Noise
* Gaussian noise can almost always be ignored
* Why? It is unbiased. Any algorithm that guesses the mean of the observations will have removed the noise.
* Example: a bad trackpad that leads to bounding boxes off by up to 5 pixels in any direction
* Biased noise cannot be ignored
* Example: a bad user interface that leads to bounding boxes off by 5 pixels to the left
---
## Separability
* Let's say we are classifying between class A and class B
* Can we draw a line or hyperplane to divide them perfectly?
* If yes, the data is linearly separable
* If not, the data is not linearly separable
* This concept will show up many times, so scream if you don't understand it