AI Dynamics

Global AI News Aggregator

About

Hyperparameter Changes for Muon Optimization and Optimizer Comparison

What hyperparameters changed when optimizing for muon? @clashluke has tried several new optimizers on our code and not reliably beaten Adamw.

→ View original post on X — @id_aa_carmack