

New from Scale: SEAL Leaderboards — a new benchmark arena for frontier LLMs – Private, novel assessments that models can’t train on
– ELO-scale rankings (via Bradley-Terry)
– Domain leaderboards (today: coding, math, instruct, Spanish — more soon!) (Links in reply)
