AI Dynamics

Global AI News Aggregator

About

GAIA Benchmark Tests Current Auto-Regressive LLM Performance

GAIA: A benchmark for general AI assistants,
by a team from Meta-FAIR, Meta-GenAI, HuggingFace, and AutoGPT. Current Auto-Regressive LLMs don't do very well.

→ View original post on X — @ylecun