"prevent exploitative chat injections" – nothing specific right now, but many prompt injects don't work because of other filters in the game. We will have to test for prompt injections later once our prompt engineering (alignment engineering) phase is over.
Preventing Exploitative Chat Injections Through Alignment Engineering
By
–
Leave a Reply