in the past, I've tried to only post jailbreaks that work on the most advanced models on every question I can think of because to me that's the only fair assessment of current SOTA alignment methods and their limitations
Evaluating AI Alignment: Testing Jailbreaks on Most Advanced Models
By
–
Leave a Reply