Things move very very fast in AI. 2025 was a year of RL with verifiable rewards(RLVR), LLM ghosts/jagged intelligence, Cursor-like LLM apps, claude code/codex, vibe-coding, NanoBanana showing early glimpse of LLM promptable graphical interfaces(PGI, i just coined this lol), LLM reasoning models crushing olympiad competitions (maths, physics, code). Most altering releases tend to come early in a year, jan-feb, and then scaling-up and small fixes begin. Eagerly looking forward to new paradigm shifts. What will next NanoBanana look like, just bigger or new capabilities no one thought before? There are several stages of training now, RL(VR) being the most recent. What will be the RL successor? And continual learning, will it be fixed in 2026, or this is a problem we will live with for long? There are also world models, agents that actually work reliably in the wild for hours. Andrej Karpathy (@karpathy) x.com/i/article/200211463822… — https://nitter.net/karpathy/status/2002118205729562949#m
AI 2025 Breakthroughs: RL, Reasoning Models, and Future Paradigms
By
–
Leave a Reply