@yitayml - AI Dynamics - Page 5 of 26

Inference Time Configuration Changes in Latest Diff

By

@yitayml

–

01 August 2025 17h05

The diff is mostly the inference time config.

→ View original post on X — @yitayml,

1 August 2025

Gemini Deep Think Model Launches with IMO Gold Medal Achievement

By

@yitayml

–

01 August 2025 14h43

The Gemini Deep Think model that achieved IMO gold medal 🥇 is launched! This is a general purpose model that is not only SOTA at math/proofs but also reasoning, code and many others! 🔥

The exact config that achieved IMO gold with scaled up Deep Think is being made available… https://t.co/7Ol5iZQN3l
— Yi Tay (@YiTayML) 1 août 2025

The Gemini Deep Think model that achieved IMO gold medal is launched! This is a general purpose model that is not only SOTA at math/proofs but also reasoning, code and many others! The exact config that achieved IMO gold with scaled up Deep Think is being made available

→ View original post on X — @yitayml,

1 August 2025

Architecture Ablation Study and LLM Objective Functions Analysis

By

@yitayml

–

26 July 2025 6h36

It clearly ablates the main architectures like encoder decoder, decoder only etc. it even tried prefix lm. Also a master class in llm objective functions.

→ View original post on X — @yitayml,

26 July 2025

T5 Paper Remains the Most Insightful LLM Pretraining Classic

By

@yitayml

–

26 July 2025 5h20

I think t5 paper is still the best classic most insightful paper for LLM pretraining. Way ahead of it's time.

→ View original post on X — @yitayml,

26 July 2025

Training a Strong Model with Trust and Morale Support

By

@yitayml

–

24 July 2025 21h26

Had a super fun time training this model. A big yolo run that resulted in a super strong model. Most important thing is to trust your model and give it morale support. Was also a big eye opener to see how prep for IMO is done. Before this I knew absolutely zero about this

→ View original post on X — @yitayml,

24 July 2025

AI Progress: Imperceptible Daily Changes, Remarkable Long-term Growth

By

@yitayml

–

23 July 2025 22h42

Watching AI progress is like watching your child grow up. You don't notice the little day to day changes but all of a sudden after some period of time you look back and you become amazed and shocked by how much have changed. Every minor revision to the model, every

→ View original post on X — @yitayml,

23 July 2025

Yitay expresses pride receiving compliment from renowned AI researcher

By

@yitayml

–

23 July 2025 4h43

Thanks swaroop. Feels so proud to get complimented by the coolest ai researcher!!

→ View original post on X — @yitayml,

23 July 2025

Gemini Dominates This Year

By

@yitayml

–

23 July 2025 3h22

It's entirely Gemini this year bro.

→ View original post on X — @yitayml,

23 July 2025

Model Config Named After Cat Figure on Desk

By

@yitayml

–

22 July 2025 3h48

Another fun fact that this model config was named "imocat". I was thinking of a unique name before launching the job for the first time and saw this on my desk.

→ View original post on X — @yitayml,

22 July 2025