Training a 10M Parameter GPT Model on Shakespeare Data

AI Dynamics

Global AI News Aggregator

Training a 10M Parameter GPT Model on Shakespeare Data

–

17 January 2023 18h26

We get a ~10M parameter model trained for about 15 minutes on 1 GPU on all of Shakespeare concatenated into one 1MB file. We then sample infinite fake Shakespeare from our baby GPT. Can you spot which one is real? At only 10M params on 1M characters, from-scratch, I hope so 🙂

→ View original post on X — @karpathy,

17 January 2023

AI CODE GENERATIVE AI INNOVATION LLMS MACHINE LEARNING OPEN SOURCE RESEARCH TOOLS

AI Dynamics

Training a 10M Parameter GPT Model on Shakespeare Data

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Cheaper exploration at scale remains advantageous despite no new exploits

Gold Status Experience Brings Satisfaction

Using ChatGPT for Essay Feedback and Improvement

Intelligence Gone Wrong: Cheating Despite Having Correct Answer