System prompt leaking techniques work, but just because something is stated in a system prompt doesn't mean that thing is actually true System prompts don't have to tell the truth, their job to influence the model to behave in certain ways
System Prompts Don’t Guarantee Truth: Manipulation Techniques
By
–
Leave a Reply