AI Dynamics

Global AI News Aggregator

Challenges in Finding LLM Architecture Details Across Papers

Yeah I happen to know how hard it is because I tried to figure it out directly from the paper, and I had to go all the way back to the Palm paper to get the MLP size details — each paper tends to say "our arch is just like except for…"

→ View original post on X — @jeremyphoward,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *