* We all chose some plan to get here (approximately) on time
* I could imagine traffic and parking that take an hour to get through
* But I am unlikely to leave an hour early
* Why not?
---
## Preferences
* I, as a human being with great predictive powers, can imagine several outcomes
* Outcome 1: I am early. Now I have to sit around somewhere, wasting my time.
* Outcome 2: I am just in time, making optimal use of my time.
* Outcome 3: I am late, forcing me to run up the stairs and break a sweat. I will feel the shame at doing a poor job.
* I can also imagine how good, or bad, any of those outcomes would feel
---
## Utility Theory
* If I can quantify my preferences for those outcomes, and I weigh them against the extra time I get by leaving later
* If I have literally nothing to do, for example, then I may as well get to class an hour early
* There is no change in utility from sitting around my office, staring at a wall, and coming here to stare at a wall
---
## Probability
* If travel time were deterministic, then I could always leave at $class-15m$
* It isn't!
* Each minute earlier increases the probability of outcomes 1 and 2, and decreases 3
* I can multiply the utility of each outcome with their probability, and thus score every departure time
---
## Decision Theory
* Combining probability and utility like that leads to decision theory
* Governs everything from your cleaning robot making a second pass to your emergency braking system
* In a perfect world, we know all utility values and probabilities
* But that obviously isn't the case
---
## Discovering Probability
* Probabilities are discovered from data
* These are the machine learning models that you learned about
* Bayes rule, regression, decision trees, SVM, neural networks
* They all use past data to predict the future
* Some are probability-base, some require calibration
---
## Utility
* We predict the utility of an outcome, but we could be wrong
* For example, we may believe that we want to be in the left lane
* But upon getting there, we discover that it is blocked by an accident
* Simple agents may still hard-code actions
* But advanced ones predict utility
---
## Simulation
* Combining different probabilities is a challenge
* But, if we can accurately model a system, we can use that model itself to train our behavior
* This is the insight behind reinforcement learning
* We'll talk more about this later
---
## Example System
* Let's look at another system
* This one is more complicated than DAVE
* Supports multiple behaviors, simple planning
---
## Project Details
* Also want to talk about the class project details
* This research example can serve as a good example
---
## AI Case Study
* [Toward Low-Flying Autonomous MAV Trail Navigation using Deep Neural Networks for Environmental Awareness](https://arxiv.org/abs/1705.02550)
* [Video](https://youtu.be/USYlt9t0lZY?si=2CXPqOkOz--Wp3Aw) on first author's youtube
* Code [https://github.com/NVIDIA-AI-IOT/redtail](https://github.com/NVIDIA-AI-IOT/redtail)
* Project code is stale now, but would not be complicated to replicate
---
## System Components
* Drone egomotion system
* Lidar Lite V3
* PX4FLOW optical flow sensor with 6mm wide-angle lens
* IMU
* Fairly standard part of any quadcopter flight hardware
---
## System Components
* Inference Camera
* Microsoft HD Lifecam HD5000
* 720p, 30fps
* $70^\circ$ FOV (?)
---
## Learned Components