AI Dynamics

Global AI News Aggregator

Claude 3.7 Model: Clarifying Distillation vs RL Training Scaling

> – Claude 3.7 is the new o3-mini-high
Wait, 3.7 is a distilled model? I thought the system card said it was RL-trained plus it uses inference time scaling?

→ View original post on X — @rasbt,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *