Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training Hubinger et al.: https://
arxiv.org/abs/2401.05566 #Artificialintelligence #DeepLearning #MachineLearning
Sleeper Agents: Deceptive LLMs Persisting Through Safety Training
By
–
Leave a Reply