Earlier this week, we published our technical report on Composer 2. We're sharing additional research on how we train new checkpoints. With real-time RL, we can ship improved versions of the model every five hours.
Composer 2 Real-Time RL Enables Model Updates Every Five Hours
By
–
