llm.c Matches PyTorch Performance Training GPT-2 on GPU

AI Dynamics

Global AI News Aggregator

llm.c Matches PyTorch Performance Training GPT-2 on GPU

–

19 April 2024 20h21

llm.c update: Our single file of 2,000 ~clean lines of C/CUDA code now trains GPT-2 (124M) on GPU at speeds ~matching PyTorch (fp32, no flash attention) https://
github.com/karpathy/llm.c
/blob/master/train_gpt2.cu
… On my A100 I'm seeing 78ms/iter for llm.c and 80ms/iter for PyTorch. Keeping in mind this is fp32,

→ View original post on X — @karpathy,

19 April 2024

AI AI HARDWARE CODE COMPUTING INNOVATION LLMS MACHINE LEARNING OPEN SOURCE RESEARCH

AI Dynamics

llm.c Matches PyTorch Performance Training GPT-2 on GPU

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Chinese Geely Robotaxi Concept Challenges Tesla’s Market Position

Top 10 Strategic Technology Trends for 2026

AI Chatbots May Help Troubled Users Plan Violence Research

Claude Dispatch with Computer Use for Codex Integration