At Anthropic, we're preparing for the arrival of powerful AI systems. Based on our latest research on Constitutional Classifiers, we've developed a demo app to test new safety techniques.
— Alex Albert (@alexalbert__) 3 février 2025
We want you to help us red-team the app – so far no one has been able to crack the… pic.twitter.com/CpwnqU1tAF
At Anthropic, we're preparing for the arrival of powerful AI systems. Based on our latest research on Constitutional Classifiers, we've developed a demo app to test new safety techniques. We want you to help us red-team the app – so far no one has been able to crack the
Leave a Reply