LLM Training: RL vs SFT with In-Context Learning

AI Dynamics

Global AI News Aggregator

LLM Training: RL vs SFT with In-Context Learning

–

01 October 2025 20h27

"Don’t be difficult. I mean this is obvious." Sutton is right ofc. The analogue in LLM land to what humans do is something along the lines of: Given this math problem AND human example solution in the context, solve the problem. Reward of 1 if correct. It's not SFT, it's RL.

→ View original post on X — @karpathy,

1 October 2025

AI Dynamics

LLM Training: RL vs SFT with In-Context Learning

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Cybercab Uber: Safer, Cheaper Alternative for Single Riders

Zeekr Global Unveils Latest Electric Vehicle Model

Revolutionary New Camera Technology Unveiled

Hidden Camera Recording Family Interactions Raises Privacy Concerns