Every AI lab is pissing away millions on compute. Macaron just ran RL on a 1 trillion parameter model at 10% cost.
Then open sourced the whole damn thing. Already integrated into NVIDIA Megatron and ByteDance Seed Verl while most companies are still writing Medium posts about
Macaron achieves 1T parameter RL training at 10% cost
By
–
