Ty! huggingface work/infra/datasets are critical to projects like nanochat – to be accurate the source code of nanochat (e.g. at the $100 tier) is ~8KB of Python and ~30GB of fineweb/smoltalk.
Hugging Face Datasets Critical Infrastructure for AI Model Training
By
–
Leave a Reply