Today, we got Reinforcement Fine-Tuning (RLFT)! (OpenAI releases day 2) Unlike Supervised Fine-Tuning (SFT) (already available on OpenAI for models like 4o), RLFT trains models with a flexible, non-fixed objective.
OpenAI Releases Reinforcement Fine-Tuning for Flexible Model Training
By
–