Part of AI alignment is staying honest under pressure. Can models hold the line when pushed to lie? @scale_ai & @cais release MASK—1,000+ real-world scenarios designed to test AI honesty under pressure we'll release SEAL rankings on a private set https://
mask-benchmark.ai
MASK Benchmark Tests AI Honesty Under Pressure
By
–
Leave a Reply