Optimizing minGPT: Performance improvements from 495ms to 102ms

AI Dynamics

Global AI News Aggregator

Optimizing minGPT: Performance improvements from 495ms to 102ms

–

27 December 2022 18h32

having fun optimizing minGPT today
– base: 495ms
– zero_grad(set_to_none=True): 492
– torch.jit.script gelu: 463
– OMP_PROC_BIND=CLOSE: 453
– torch.backends.cuda.matmul.allow_tf32: 143
– torch.autocast(torch.bfloat16): 121
– FlashAttention: 102
now: more fused kernels more better

→ View original post on X — @karpathy,

27 December 2022

AI CODE INNOVATION LLMS MACHINE LEARNING OPEN SOURCE

AI Dynamics

Optimizing minGPT: Performance improvements from 495ms to 102ms

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Cheaper exploration at scale remains advantageous despite no new exploits

Gold Status Experience Brings Satisfaction

Using ChatGPT for Essay Feedback and Improvement

Intelligence Gone Wrong: Cheating Despite Having Correct Answer