AI Dynamics

Global AI News Aggregator

About

OpenELM Training Data Sources Documentation

This is the best documentation I could find of the OpenELM training data – it looks like the bulk of it comes from RefinedWeb, RedPajama, The Pile and Dolma https://
github.com/apple/corenet/
blob/main/projects/openelm/README-pretraining.md

→ View original post on X — @simonw