AI Dynamics

Global AI News Aggregator

Two papers advance LLM jailbreaking and multi-domain adaptation

Paper 1: Siege sets the state-of-the-art on jailbreaking, formalizing multi-turn attacks as a tree search, achieving an 100% attack success rate on leading LLMs. Paper 2: CS-ReFT adapts LLMs to multiple new domains simultaneously by editing model subspaces, enabling

→ View original post on X — @askalphaxiv,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *