Models trained on 1T tokens with continued improvement at 7B scale

AI Dynamics

Global AI News Aggregator

Models trained on 1T tokens with continued improvement at 7B scale

–

24 February 2023 17h08

All our models were trained on at least 1T tokens, much more than what is typically used at this scale.
Interestingly, even after 1T tokens the 7B model was still improving.
3/n

→ View original post on X — @guillaumelample,

24 February 2023

AI GENERATIVE AI INNOVATION LLMS MACHINE LEARNING RESEARCH

AI Dynamics

Models trained on 1T tokens with continued improvement at 7B scale

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Cheaper exploration at scale remains advantageous despite no new exploits

Gold Status Experience Brings Satisfaction

Using ChatGPT for Essay Feedback and Improvement

Intelligence Gone Wrong: Cheating Despite Having Correct Answer