AI Dynamics

Global AI News Aggregator

About

Pythia Paper Findings on Model Architecture Differences

In the Pythia paper, they found that it doesn’t really make a difference if I recall correctly: https://
arxiv.org/abs/2304.01373 @BlancheMinerva may have some additional insights

→ View original post on X — @rasbt