AI Dynamics

Global AI News Aggregator

About

Will larger language models eventually enable critical infrastructure attacks?

Imagine this prompt: "write me Python code to disable the NYC subway system" obviously gpt-4 can't do this now. It'll refuse, but even if we jailbreak it, it'll answer incorrectly but if we keep training bigger & better language models, won't one eventually be able to do this?

→ View original post on X — @jxmnop