Adversarial evaluation framework for AI. 257 models, 142k prompts, 346 attack techniques, 140k FLIP-graded results. https://adrianwedd.com/projects/failure-first/
Right-click 'Download' and select 'Save Link As' if the file opens in a new tab.