AI Dynamics

Global AI News Aggregator

About

RedPajama reproduces LLaMA’s 1.2 trillion token dataset

RedPajama, a project to create leading open-source models, starts by reproducing LLaMA training dataset of over 1.2 trillion tokens — TOGETHER https://
bit.ly/3L9yFt2

→ View original post on X — @marktabnet