"requests for misuse" only makes sense, to me, if defined as "request to do a thing that the system prompt asks not to do". This is not necessarily harmful or unsafe. It's just not what the developer wanted.
Redefining AI Misuse Requests: Safety vs Developer Intent
By
–
Leave a Reply