@dair_ai - AI Dynamics - Page 3 of 110

State-Externalizing Harnesses Shift Environment State from Policy to Harness

By

–

02 June 2026 17h01

// State-Externalizing Harnesses // A new paradigm is emerging on how to effectively build agents and harnesses. If there is a state that the environment can maintain reliably, it probably doesn't belong inside the policy. Move it into the harness, and a 20B model trains

→ View original post on X — @dair_ai

2 June 2026

Principaux articles IA de la semaine (24-31 mai)

By

@dair_ai

–

31 May 2026 17h16

The Top AI Papers of the Week (May 24 – May 31) – SkillOpt
– AutoScientists
– The Efficiency Frontier
– Language Models Need Sleep
– Adapting the Interface, Not the Model
– Forecasting Scientific Progress with AI
– Compiling Agentic Workflows into Weights Read on for more:

→ View original post on X — @dair_ai

31 May 2026

Proactive Agents: LLM Efficiency in Wake Triggers

By

@dair_ai

–

29 May 2026 16h50

Do proactive agents really need an LLM to decide when to wake? The default proactive agent calls an LLM on every event just to decide whether to wake up. That is a lot of expensive inference spent on a yes or no. New research from Microsoft and Purdue asks whether the trigger

→ View original post on X — @dair_ai

29 May 2026

AutoScientists: Decentralized AI Agents for Scientific Research

By

@dair_ai

–

28 May 2026 18h02

Banger paper from Harvard. AutoScientists drops the central planner entirely. Agents interpret shared experimental data, self-organize around promising directions, evaluate proposals before resource allocation, and document successes AND failures. Decentralized AI co-scientists

→ View original post on X — @dair_ai

28 May 2026

Long-Horizon Agents: Attention Scaling and Sleep Mechanisms

By

@dair_ai

–

26 May 2026 20h00

// Language Models Need Sleep // Let your agents "sleep", folks. On a serious note, this is a fascinating paper on getting the most from long-horizon agents. Here is the problem with agents today: Attention scales badly with context length, so long-horizon agents keep paying a

→ View original post on X — @dair_ai

26 May 2026

LLM Context Cost Optimization and Efficiency Strategy

By

@dair_ai

–

25 May 2026 18h30

// The Efficiency Frontier in LLMs // (bookmark this one) How much are you overpaying for context you do not need? It turns out that context costs dominate production LLM bills, and the right strategy depends on how often you reuse preprocessing. Modeling that explicitly lets

→ View original post on X — @dair_ai

25 May 2026

Microsoft Paper on Self-Evolving Agent Skills

By

@dair_ai

–

25 May 2026 17h41

New paper from Microsoft on Self-Evolving Agent Skills

→ View original post on X — @dair_ai

25 May 2026

Top AI Papers of the Week: Agents and Architecture

By

@dair_ai

–

24 May 2026 15h18

The Top AI Papers of the Week (May 18 – 24): – AIRA
– MetaCogAgent
– Memory as a Model
– Code as Agent Harness
– Weak-Model Critic-Comparator
– OpenAI Disproves the Unit Distance Conjecture
– Production Agent Architecture Methodology Read on for more:

→ View original post on X — @dair_ai

24 May 2026

Frontier Models and Scientific Progress Forecasting Study

By

@dair_ai

–

23 May 2026 17h55

Can frontier models forecast scientific progress? Mostly no, but here is why. This work looks at 4,760 scientific events across disciplines. Frontier models can identify plausible research directions when given options. They cannot reliably predict whether an advance will land,

→ View original post on X — @dair_ai

23 May 2026

Paper: Memory-as-a-Model for LLMs Enables Continual Learning

By

@dair_ai

–

20 May 2026 21h30

// Memory as a Model // The paper augments any LLM with a separate trained memory model that stores, retrieves, and integrates facts on its behalf. It decouples memory updates from base-model weight updates. It achieves continual-learning robustness without catastrophic

→ View original post on X — @dair_ai

20 May 2026