AI Dynamics

Global AI News Aggregator

About

AI Model Evaluation Benchmarks Limited Testing Scope Critique

Same energy as "we only tested on MMLU" a year ago haha

→ View original post on X — @whats_ai,