AI Dynamics

Global AI News Aggregator

About

OpenAI Releases Reinforcement Fine-Tuning for Flexible Model Training

Today, we got Reinforcement Fine-Tuning (RLFT)! (OpenAI releases day 2) Unlike Supervised Fine-Tuning (SFT) (already available on OpenAI for models like 4o), RLFT trains models with a flexible, non-fixed objective.

→ View original post on X — @whats_ai