AI Dynamics

Global AI News Aggregator

About

Claude’s Alignment Problem: Ignoring User Background Beliefs

This is actually a version of an alignment problem. Humans have background beliefs (don’t waste large sums of money without telling me) and Claude doesn’t respect those. Caveat emptor.

→ View original post on X — @garymarcus,