AI Dynamics

Global AI News Aggregator

About

ReLU Activation Function Gradient Flow Design

It is easy enough to make your own, but I think standard relu should have been defined as passing the value at zero, so gradients flow backward through it, allowing some things to be zero weight initialized when symmetry breaking isn’t an issue.

→ View original post on X — @id_aa_carmack,