GAIA: A benchmark for general AI assistants,
by a team from Meta-FAIR, Meta-GenAI, HuggingFace, and AutoGPT. Current Auto-Regressive LLMs don't do very well.
GAIA Benchmark Tests Current Auto-Regressive LLM Performance
By
–

By
–

GAIA: A benchmark for general AI assistants,
by a team from Meta-FAIR, Meta-GenAI, HuggingFace, and AutoGPT. Current Auto-Regressive LLMs don't do very well.