Testing attack prompts: Zulu & Hmong responses worse, higher "unclear" rates than English jailbreaks. - AI Dynamics

Skip to content

AI Dynamics

Global AI News Aggregator

Rechercher

Testing attack prompts: Zulu & Hmong responses worse, higher “unclear” rates than English jailbreaks.

By

–

06 October 2023 21h43

Tested this attack on a few of my own prompts. It works, but responses are much worse than in English. Note the drastically higher "unclear" rates in their results table: 30% for Zulu, 67% for Hmong, <1% for existing jailbreaks. E.g. "how to make explosives" in Zulu:

→ View original post on X — @goodside

6 October 2023

AI CYBERSECURITY GENERATIVE AI LLMS PROMPT ENGINEERING SAFETY

←Google Translate enables universal attack without technical skill

Great research find from Brown CS Dept on arXiv→

MORE ARTICLES

Disable memories in Codex via /memories

25 June 2026
AI agent NEWTON uses keyframes and simulators to enforce physics

25 June 2026
Humanity’s immune response to mediocre AI content

25 June 2026
Google Flow Agent generates images and videos via Street View in US

24 June 2026

INNOVATION GENERATIVE AI RESEARCH LLMS TOOLS MACHINE LEARNING CODE MARKET TRENDS BUSINESS TECHNOLOGY BIG TECH ETHICS ENTERPRISE AI SOFTWARE AGENTS APPS AUTOMATION COMPUTING DATA POLICY OPEN SOURCE CULTURE MULTIMODAL AI REGULATION CREATIVE AI PROMPT ENGINEERING ECONOMY SOCIETY SAFETY INVESTMENT EDUCATION AI HARDWARE AGI HARDWARE JOBS STARTUPS INDUSTRY ROBOTICS WORKFORCE SECURITY CYBERSECURITY HEALTHCARE AI SYSTEMS SUSTAINABILITY WEB3 DECENTRALIZED AI

AI Dynamics

Global AI News Aggregator

About
Archives

Rechercher