* Evaluate the veracity of the following statements about the decision boundaries in the following picture.
1. If you want them to be smoother, you can change to a smoother nonlinear function in the DNN.
2. If you want them to be smoother, you can increase the number of hidden layers in the DNN.
3. If you want them to be smoother, you can feed the features from a hidden layer into an SVM that uses a smooth kernel.
---
## Question 8
---
## Answer 1
Which of the following is not an advantage of deep neural networks?
2) Any randomly initialized DNN is guaranteed to converge on the global minimum during training.
Convergence is not guaranteed. If it was, we wouldn't worry about exploding gradients.
---
## Answer 2
What can prevent unstable gradients?
4) All of the above.
Any of the answers can help with exploding or vanishing gradients.
---
## Answer 3
What is true about the vanishing gradient problem?
1) The vanishing gradient problem only exists because floating point numbers are approximations.
If we were using real numbers, it wouldn't matter how small they were. The multiplication issue makes our floating point numbers unstable, so removing multiplications can help (as in ResNet), but scaling (as in BatchNorm) doesn't cause vanishing gradients.
---
## Answer 4
Why does an SVM work with the features from the final convolution of a neural network?
3) Feature universality seems to be an emergent property of DNNs trained on large bodies of images.
No guarantees, but, observationaly, this seems to be true.
---
## Answer 5
Which of these is not a method to escape the high cost of labelling?
4) All of the above are valid methods.
---
## Answer 6
* Given the figure at right, evaluate if the following statements are true or false. Assume that the test data is high quality.
1. True, or the curves would closely match
2. True, even if it is not a solution we like
3. False, we can find a model that has stronger regularizers
---
## Answer 7
* Given the figure at right, evaluate if the following statements are true or false. Assume that the test data is high quality.
1. False, test accuracy can be higher than train accuracy (it's harder, noisy, augmentations, etc)
2. True. They it seems to be preventing the test set from converging to 0, the errors must be biased in some way.
3. True. This could also be true.
---
## Answer 8
* Evaluate the veracity of the following statements about the decision boundaries in the picture.
1. True. These harder edges are something you see with ReLU.
2. True. I plotted more versions and can make it arbitrarily smooth.
3. True. Weird thing to do just for smoothness, but it is possible.