15:28Self Play for Safety - Online Multi-Agent Adversarial Training for Provably Robust LLMsNatasha Jaques3.2K views·10 months ago
1:53Personalized Multi-task Learning for Predicting Tomorrow's Mood, Stress, and HealthNatasha Jaques2.0K views·4 years ago
0:32Agent trained with intrinsic social influence reward - Tragedy of the CommonsNatasha Jaques597 views·7 years ago