AI Dynamics

Global AI News Aggregator

About

Tripling Learning Rate: Unexpected Optimization Discovery

It’s interesting that you can 3X the LR. You’d expect the original paper to be well tuned near what is tolerable.

→ View original post on X — @karpathy