@maximelabonne - AI Dynamics

Model generalizes to unseen harness during training

By

–

10 June 2026 14h52

Fun fact: this model has never seen this harness during training, this is pure generalization. https://t.co/P8hmXZZvuf
— Maxime Labonne (@maximelabonne) 10 juin 2026

Fun fact: this model has never seen this harness during training, this is pure generalization.

→ View original post on X — @maximelabonne

10 June 2026

High-level notes and comparisons on PivotRL paper

By

@maximelabonne

–

08 June 2026 13h20

Anyways, that was a cool paper. I took some high-level notes and comparisons here. PivotRL all the things! https://
maximelabonne.substack.com/p/nemotron-3-u
ltra-what-distillation
…

→ View original post on X — @maximelabonne

8 June 2026

Low 20T token budget and bug halt DSV4 pre-training

By

@maximelabonne

–

08 June 2026 13h18

Could it be linked to the (very) low 20T token pre-training budget? DSV4 was trained on 33T tokens. More room for knowledge in the case of HLE. Looks like they had to stop it early due to a bug they never root-caused. (Not a great demo for NVFP4 pre-training to be honest.)

→ View original post on X — @maximelabonne

8 June 2026

Nemotron 3 Ultra unable to recover HLE and code performance via OPD

By

@maximelabonne

–

08 June 2026 13h16

Nemotron 3 Ultra can't recover perf on HLE, code, etc. via OPD The teacher was trained on DeepSeek-V4-Pro traces (DSV4 Max achieves 37.7% on HLE!). Looks like the MOPD warmup failed to properly init the student? No good trajectory → No improvement via OPD

→ View original post on X — @maximelabonne

8 June 2026

Confirming the growing popularity of edge models

By

@maximelabonne

–

08 June 2026 12h36

Can confirm, edge models keep getting more and more popular

→ View original post on X — @maximelabonne

8 June 2026

Two new specialized VLMs extract structured outputs quickly and reliably

By

@maximelabonne

–

05 June 2026 11h09

We released two new specialized VLMs They extract structured outputs from images quickly and reliably. You can customize your fields directly in the system prompt.

→ View original post on X — @maximelabonne

5 June 2026