> – Claude 3.7 is the new o3-mini-high
Wait, 3.7 is a distilled model? I thought the system card said it was RL-trained plus it uses inference time scaling?
Claude 3.7 Model: Clarifying Distillation vs RL Training Scaling
By
–
Global AI News Aggregator
By
–
> – Claude 3.7 is the new o3-mini-high
Wait, 3.7 is a distilled model? I thought the system card said it was RL-trained plus it uses inference time scaling?
Leave a Reply