Back to work friends. Frontier models
Achieve below 1% on Arc agi 3. Let’s see if this will be saturated by end of year.
AGI
-
Frontier Models Achieve Below 1% on Arc AGI 3
By
–
-
AI as Actor: Governance Challenges and Autonomous Behavior Risks
By
–
The shift from AI as a tool to AI as an actor creates massive governance challenges, including cascading errors and unpredictable autonomous behavior. When we stop giving step-by-step instructions and start giving goals, we lose the ability to ensure the path taken is the one we… pic.twitter.com/RlwpGXas94
— Satya Mallick (@LearnOpenCV) 25 mars 2026The shift from AI as a tool to AI as an actor creates massive governance challenges, including cascading errors and unpredictable autonomous behavior. When we stop giving step-by-step instructions and start giving goals, we lose the ability to ensure the path taken is the one we
-
Harness Engineering: Controlling Powerful AI Agents
By
–
New video out!! If you’ve been hearing “harness engineering”, this one is for you! And it’s not “just” a new term. "Harnesses" matter more than ever because agents got good enough to be both useful and dangerous. They now can do more than generating text, or token. Useful
-
AGI Release Named ‘Spud’ Criticized
By
–
imagine releasing agi and calling it 'spud' x.com/theinformation…
-
Advanced Technologies Arrived in Unexpected Order
By
–
but not in the order that we had anticipated. for instance, we had thought that advanced space travel would come first, then the humanoid robots, then the super intelligent AI. we got it all backwards. https://
x.com/avischiffmann/
status/2036582508536725795
… -
AGI Race Drives Compute Demand at Expense of Popular Apps
By
–
It feels the opposite to me: the need for compute is so huge because AGI is within reach that even popular apps are being sacrificed for it.q https://t.co/VWN8iS1FCq
— Chubby♨️ (@kimmonismus) 24 mars 2026It feels the opposite to me: the need for compute is so huge because AGI is within reach that even popular apps are being sacrificed for it.q
-
OpenAI AGI Claims: Real Achievement or Elaborate Troll?
By
–
Either OpenAI officially achieved AGI or this is the biggest troll move ever: – they rename product organization to "AGI Deployment"
– Altman says the next LLM is a "very strong model"
– it very much accelerate the economy Quote: "Altman also said that the company would be -
Exponential Growth Hidden From Human Perception
By
–
There are exponentials everywhere that our reptilian inner eyes cannot see.
-
Human-in-the-Loop: The Core Feature of Safe AI Agents
By
–
This is exactly it. The approval loop is the entire product. Nobody cares how powerful your agent is if they're scared to let it run unsupervised. Human-in-the-loop isn't a limitation, it's the feature.
-
LLM Development Progress Despite LeCun’s AGI Skepticism
By
–
Despite Yann Lecun's statement that LLMs will not bring us to AGI, I would say that development is still progressing excellently!