"requests for misuse" only makes sense, to me, if defined as "request to do a thing that the system prompt asks not to do". This is not necessarily harmful or unsafe. It's just not what the developer wanted.
By
–
"requests for misuse" only makes sense, to me, if defined as "request to do a thing that the system prompt asks not to do". This is not necessarily harmful or unsafe. It's just not what the developer wanted.