AI Dynamics

Global AI News Aggregator

Hyperparameter Changes for Muon Optimization and Optimizer Comparison

What hyperparameters changed when optimizing for muon? @clashluke has tried several new optimizers on our code and not reliably beaten Adamw.

→ View original post on X — @id_aa_carmack,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *