AI Dynamics

Global AI News Aggregator

About

HELM Benchmarks Don’t Predict Real World Model Performance

academic tasks, doesn't reflect real world use. models scoring well on helm doesn't necessary mean they are good (or vice versa).

→ View original post on X — @yitayml