figured out how to "undo" the RL and turn gpt-oss back into a base model will drop the weights tomorrow gn
Developer Reverses GPT-OSS Reinforcement Learning, Releases Base Model
By
–

By
–

figured out how to "undo" the RL and turn gpt-oss back into a base model will drop the weights tomorrow gn