AI Dynamics

Global AI News Aggregator

Model trainers as RL agents: unprincipled YOLO runs reconsidered

i don't think yolo runs are actually really "yolo". model trainers (humans) are agents that are "RL-ed" by the environment of running many experiments and getting a lot feedback through their careers. it's becoming one with the model. it's not unprincipled, we just don't

→ View original post on X — @yitayml,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *