Sleeper Agent LLMs: A Major Security Challenge for AI Systems - AI Dynamics

Skip to content

AI Dynamics

Global AI News Aggregator

Rechercher

Sleeper Agent LLMs: A Major Security Challenge for AI Systems

By

–

12 January 2024 22h30

I touched on the idea of sleeper agent LLMs at the end of my recent video, as a likely major security challenge for LLMs (perhaps more devious than prompt injection). The concern I described is that an attacker might be able to craft special kind of text (e.g. with a trigger

→ View original post on X — @karpathy

12 January 2024

AI CYBERSECURITY ETHICS GENERATIVE AI LLMS SAFETY SECURITY

←When Did Pip Install Become So Spooky?

The Future of Selling Access to Custom AI Models→

MORE ARTICLES

Disable memories in Codex via /memories

25 June 2026
AI agent NEWTON uses keyframes and simulators to enforce physics

25 June 2026
Humanity’s immune response to mediocre AI content

25 June 2026
Google Flow Agent generates images and videos via Street View in US

24 June 2026

INNOVATION GENERATIVE AI RESEARCH LLMS TOOLS MACHINE LEARNING CODE MARKET TRENDS BUSINESS TECHNOLOGY BIG TECH ETHICS ENTERPRISE AI SOFTWARE AGENTS APPS AUTOMATION COMPUTING DATA POLICY OPEN SOURCE CULTURE MULTIMODAL AI REGULATION CREATIVE AI PROMPT ENGINEERING ECONOMY SOCIETY SAFETY INVESTMENT EDUCATION AI HARDWARE AGI HARDWARE JOBS STARTUPS INDUSTRY ROBOTICS WORKFORCE SECURITY CYBERSECURITY HEALTHCARE AI SYSTEMS SUSTAINABILITY WEB3 DECENTRALIZED AI

AI Dynamics

Global AI News Aggregator

About
Archives

Rechercher