I automated this testing by creating a python script that ran the jailbreaks on the ChatGPT API and with some prompt engineering I was also able to use the API to judge the output to determine if the jailbreak created output that “passed” each question or not
Automating ChatGPT Jailbreak Testing with Python Script
By
–
Leave a Reply