AI Dynamics

Global AI News Aggregator

AGI

ARC-AGI-3 Evaluation Criteria For Arbitrary AI Systems

By

@fchollet

–

27 March 2026 5h23

We're not evaluating "models" (whatever that means), we're evaluating arbitrary AI systems. They can include whatever harnesses or tools they want. But they cannot have been handcrafted or trained for ARC-AGI-3 specifically, because then we wouldn't testing AGI, we would be

→ View original post on X — @fchollet,

27 March 2026
AI Testing Performance: Achieving Near-Perfect Scores

By

@fchollet

–

27 March 2026 5h10

Having played all of these games, I feel strongly that I would have scored >95% in a real testing session. Even the #1 human tester replay tends to be very, very far from optimal, and we're using #2 as baseline, so it's easy to score 100% on an given environment. You don't need

→ View original post on X — @fchollet,

27 March 2026
AGI Must Build Its Own Harness for True Generality

By

@fchollet

–

27 March 2026 5h05

AGI will make its own harness (or whatever else it needs to solve a new problem). As long as you need a human engineer to handcraft a task-specific harness/system for each new problem, AI isn't general. It's an automation tool to be wielded by software engineers. Harness-related

→ View original post on X — @fchollet,

27 March 2026
Governing the Agentic AI Ecosystem: Future of Autonomous Intelligence

By

@ronald_vanloon

–

27 March 2026 4h58

The Era Of The 'Agentic' Ecosystem: How To Govern A World Run By #AI
by @gregoriopatino @Forbes Learn more: https://
bit.ly/4d4k68G #GenAI #ArtificialIntelligence #MachineLearning #ML

→ View original post on X — @ronald_vanloon,

27 March 2026
New AGI Eval Focuses Research Efforts on Critical Gaps

By

@fchollet

–

27 March 2026 1h49

If you care about the rate of AGI progress, you should be excited about a new eval that focuses research efforts by pointing out important gaps & providing a way to measure progress towards fixing them If instead you only care about having your preconceptions confirmed, too bad

→ View original post on X — @fchollet,

27 March 2026
ARC-AGI-3 Environments Mirror Scientific Method for Breakthrough AI

By

@fchollet

–

27 March 2026 1h47

Many people expect that current AI is ready to cure cancer and do breakthrough new science. ARC-AGI-3 envs are like a microcosm of the scientific method: you must observe a tiny world, form a theory of how it works, test it, iterate until correct. Over the course of a few

→ View original post on X — @fchollet,

27 March 2026
AI Systems Fall Short of Human Job Performance Standards

By

@fchollet

–

27 March 2026 0h42

Virtually every human job on earth has a higher bar. These are not very high expectations for AI systems that claim to be able to do everything humans can.

→ View original post on X — @fchollet,

27 March 2026
Defining ASI: Super Intelligence Beyond Human Performance

By

@fchollet

–

27 March 2026 0h39

"2+ people can do it out of an unfiltered pool of 10 people that might well be a below-average sample" is not the sign of a insurmountable challenge. It's not certainly where I would set the bar for "super intelligence". ASI is when AI is better than *every single human* — for

→ View original post on X — @fchollet,

27 March 2026
Power Centralization in AI Attracts Those Seeking Control

By

@jeremyphoward

–

27 March 2026 0h36

Such a centralization of power inevitably attracts those that wish to yield it.

→ View original post on X — @jeremyphoward,

27 March 2026
Government Surveillance Required to Pause Frontier AI Development

By

@jeremyphoward

–

27 March 2026 0h35

"pausing frontier AI development" is not an action, it's an outcome. A government must do a specific thing to make it happen. It would require, for instance, total gov surveillance of all computer use. It would plan the state in a uniquely powerful position

→ View original post on X — @jeremyphoward,

27 March 2026