AI Dynamics

Global AI News Aggregator

About

System Instructions Security: User Prompts Can Override Safety Measures

No – instruction hierarchy doesn't close the hole completely, it's always possible for the user instructions to override the system instructions if they use the right tricks

→ View original post on X — @simonw