AI Dynamics

Global AI News Aggregator

About

Measuring Chain-of-Thought Monitorability in AI Models

To preserve chain-of-thought (CoT) monitorability, we must be able to measure it. We built a framework + evaluation suite to measure CoT monitorability — 13 evaluations across 24 environments — so that we can actually tell when models verbalize targeted aspects of their

→ View original post on X — @openai,