Great post (5mo ago) "chinchilla's wild implications" giving context to LLM goldrush shifting from model size to dataset size following Chinchilla https://
lesswrong.com/posts/6Fpvch8R
R29qLEWNH/chinchilla-s-wild-implications
…
Subtle important detail: analysis assumes 1 epoch. Recent work (e.g. Galactica) gives hope for 1+ regime.
Chinchilla’s implications: Dataset size over model size in LLMs
By
–
Leave a Reply