@dair_ai - AI Dynamics - Page 2 of 107

Meta paper: Agentic Discovery of Neural Architectures

By

@dair_ai

–

18 May 2026 20h02

NEW paper from Meta: Agentic Discovery of Neural Architectures. This is a hot new area of research! Keep an eye on it.

→ View original post on X — @dair_ai,

18 May 2026

Paper: GPT-5.4 Nano with Critic-Comparator Reaches SWE-bench Parity

By

@dair_ai

–

18 May 2026 19h30

NEW paper worth reading. GPT-5.4 nano plus a critic-comparator orchestration loop hits 76.4% on SWE-bench Verified, matching standalone Gemini 3 Pro and Claude Opus 4.5 Thinking. The trick is to select from k=8 weak-model proposals using execution and proof signals. What does

→ View original post on X — @dair_ai,

18 May 2026

Top AI Papers of the Week (May 11–17)

By

@dair_ai

–

17 May 2026 16h27

The Top AI Papers of the Week (May 11 – May 17) – AEvo
– δ-mem
– AutoTTS
– AI Co-Mathematician
– Lighthouse Attention
– Is Grep All You Need?
– A Geometric Calculator Inside a Neural Network Read on for more:

→ View original post on X — @dair_ai,

17 May 2026

Key Personalization for Research Agents

By

@dair_ai

–

13 May 2026 1h00

Pay attention to this if you build research or knowledge-work agents. Most research-agent systems generate uniform outputs regardless of who is using them. This new work, NanoResearch, argues that personalization is a prerequisite for true usability and proposes a three-level approach.

→ View original post on X — @dair_ai,

13 May 2026

Microsoft Research Paper on Agent-Based Interpretability for AI

By

@dair_ai

–

06 May 2026 22h37

NEW paper from Microsoft Research. (bookmark it) The entire interpretability literature is built around human readers. As more analysis gets delegated to agents, the right target of interpretability shifts. This paper is a recipe for designing tools that agents can actually

→ View original post on X — @dair_ai,

6 May 2026

New Microsoft Research Paper on Long-Horizon Agent Generalization

By

@dair_ai

–

05 May 2026 17h06

NEW paper from Microsoft Research. Nice study on long-horizon agent generalization. (bookmark it) The team runs a study where the only variable is task horizon length. They use the same decision rules, reasoning structure but different sequence length to the goal. The main

→ View original post on X — @dair_ai,

5 May 2026

Meta FAIR Autodata: Agentic System Builds Training and Eval Data Autonomously

By

@dair_ai

–

04 May 2026 16h44

Banger paper from Meta FAIR. They introduce Autodata, an agentic data scientist that builds high-quality training and evaluation data autonomously. The headline result: on a CS research QA task, an Agentic Self-Instruct loop produces a 34-point gap between weak and strong

→ View original post on X — @dair_ai,

4 May 2026