This video is part of the Introduction to ML Safety course (https://course.mlsafety.org) and was recorded by Dan Hendrycks at the Center for AI Safety.
This video covers the following topics:
- weaponization
- proxy gaming
- treacherous turn
- deceptive alignment
- value lock-in
- persuasive AI