8). Mitigating LLM Jailbreaks with Few Examples – introduces a new approach called for defending LLMs against jailbreak attacks, focusing on quickly adapting defenses after detecting new attacks rather than aiming for perfect adversarial upfront robustness.
Defending LLMs Against Jailbreak Attacks with Few Examples
By
–
Leave a Reply