llm.c optimization: balancing historical accuracy and training speed

AI Dynamics

Global AI News Aggregator

llm.c optimization: balancing historical accuracy and training speed

–

07 June 2024 5h29

This is cool!! I'm not exactly sure how to upstream these changes to llm.c… Part of me wants to reproduce GPT-2/3 using their exact hyperparameters just for historical aesthetics, but part of me also wants to just train things as fast as possible. Probably both. – lr 3X is

→ View original post on X — @karpathy,

7 June 2024

AI Dynamics

llm.c optimization: balancing historical accuracy and training speed

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Cheaper exploration at scale remains advantageous despite no new exploits

Gold Status Experience Brings Satisfaction

Using ChatGPT for Essay Feedback and Improvement

Intelligence Gone Wrong: Cheating Despite Having Correct Answer