AI Dynamics

Global AI News Aggregator

About

Chain-of-Thought Monitorability for AI Safety

New work on evaluating the quality of chain-of-thought monitorability. Chain-of-thought monitorability is a very encouraging opportunity for safety and alignment, making it easy to see what models are thinking:

→ View original post on X — @gdb