The Complexity Dynamics of Grokking A study on how neural network complexity dynamics explain the grokking phenomenon, where models transition from memorization to generalization long after overfitting. Problem: Grokking challenges our understanding of generalization in
Grokking Complexity Dynamics: Memorization to Generalization
By
–
