@hardmaru - AI Dynamics

DiffusionBlocks: Training Neural Networks Block by Block

By

@hardmaru

–

28 May 2026 15h47

DiffusionBlocks: Training Neural Networks One Block at a Time

→ View original post on X — @hardmaru,

28 May 2026

Novel Block-Based Backprop Reduces AI Training Memory

By

@hardmaru

–

27 May 2026 16h52

For over a decade, we’ve accepted that end-to-end backprop is the only way to train deep networks. But holding the entire network in memory all at once is why AI training is hitting a resource wall.

We found a new way to break the network into blocks and train them… https://t.co/vxyMR6goTD
— hardmaru (@hardmaru) 27 mai 2026

For over a decade, we’ve accepted that end-to-end backprop is the only way to train deep networks. But holding the entire network in memory all at once is why AI training is hitting a resource wall. We found a new way to break the network into blocks and train them

→ View original post on X — @hardmaru,

27 May 2026

AI Forecasting Scientific Progress: Capabilities and Limitations

By

@hardmaru

–

26 May 2026 9h08

Forecasting Scientific Progress with Artificial Intelligence https://
arxiv.org/abs/2605.22681 Turns out AI is just as bad at forecasting biology and physics breakthroughs as we are. To be fair, most breakthroughs cannot be predicted. Science is more like an evolutionary search process.

→ View original post on X — @hardmaru,

26 May 2026

AI Tools Make Engineers More Productive, Expand SWE Teams

By

@hardmaru

–

24 May 2026 3h42

People keep asking if AI will replace software engineers. I believe the exact opposite. Thanks to the Jevons paradox, AI tools are making great engineers 10x more productive, allowing us to tackle much harder, larger-scale problems. We’re expanding our SWE teams at @SakanaAILabs

→ View original post on X — @hardmaru,

24 May 2026

Sakana Fugu: Multi-Agent Orchestration System as Foundation Model

By

@hardmaru

–

04 May 2026 0h00

Sakana Fugu: A Multi-Agent Orchestration System as a Foundation Model

→ View original post on X — @hardmaru,

4 May 2026

KAME Tandem Architecture Boosts Knowledge in Speech AI Systems

By

@hardmaru

–

30 April 2026 5h53

Two Heads Are Better Than One: Async Knowledge Injection for Speech AI with Tandem Architecture Blog: https://
pub.sakana.ai/kame/ KAME: Tandem Architecture for Enhancing Knowledge in Real-Time Speech-to-Speech Conversational AI Paper: https://
arxiv.org/abs/2510.02327 #ICASSP2026

→ View original post on X — @hardmaru,

30 April 2026

Multi-Agent System Cuts SMBC Corporate Strategy Workflow to Hours

By

@hardmaru

–

30 April 2026 3h09

Today we announced a multi-agent system built with SMBC, one of Japan’s largest banks. It handles complex corporate strategy proposals, reducing a one to two week workflow down to just a few hours. https://
nikkei.com/article/DGXZQO
UB2713R0X20C26A4000000/
…

→ View original post on X — @hardmaru,

30 April 2026

Tandem Voice AI Architecture Enables Speaking While Thinking

By

@hardmaru

–

29 April 2026 19h43

For years, voice AI has been stuck in a rigid loop: think, then speak. But real human conversation is messy, overlapping, and asynchronous.

In our new #ICASSP2026 work, we built a tandem architecture that shifts the paradigm to “speak while thinking.” A fast speech model starts… https://t.co/gyRFlqDSUj
— hardmaru (@hardmaru) 29 avril 2026

For years, voice AI has been stuck in a rigid loop: think, then speak. But real human conversation is messy, overlapping, and asynchronous. In our new #ICASSP2026 work, we built a tandem architecture that shifts the paradigm to “speak while thinking.” A fast speech model starts

→ View original post on X — @hardmaru,

29 April 2026

Conductor Framework Orchestrates AI Agents Using Natural Language

By

@hardmaru

–

28 April 2026 4h37

Learning to Orchestrate Agents in Natural Language with the Conductor Fugu Blog: https://
sakana.ai/fugu-beta
Paper: https://
arxiv.org/abs/2512.04388

→ View original post on X — @hardmaru,

28 April 2026

AI Conductor Model Uses RL to Automate Prompt Engineering

By

@hardmaru

–

27 April 2026 16h55

For the past few years, humans have been doing “prompt engineering” to coax the best performance out of different LLMs. In this work, we explored what happens if we train an AI to do that job instead. By training a Conductor model with RL, we found that it naturally learns to

→ View original post on X — @hardmaru,

27 April 2026