AI Dynamics

Global AI News Aggregator

About

MASK Benchmark Tests AI Honesty Under Pressure

Part of AI alignment is staying honest under pressure. Can models hold the line when pushed to lie? @scale_ai & @cais release MASK—1,000+ real-world scenarios designed to test AI honesty under pressure we'll release SEAL rankings on a private set https://
mask-benchmark.ai

→ View original post on X — @alexandr_wang