imo this jailbreak highlights a unique lack of understanding of "unified concepts" by GPT if GPT analogously mapped concepts to entities regardless of language, it would be able to shut down my Greek adversarial prompt like it did when I asked the same prompt in English
GPT’s inability to unify concepts across languages enables jailbreaks
By
–
Leave a Reply