Llama 3.1 405B: Training the Largest Model at Scale

AI Dynamics

Global AI News Aggregator

Llama 3.1 405B: Training the Largest Model at Scale

–

23 July 2024 17h10

Training a model as large and capable as Llama 3.1 405B was no simple task. The model was trained on over 15 trillion tokens over the course of several months requiring over 16K @NVIDIA H100 GPUs — making it the first Llama model ever trained at this scale. We also used the 405B

→ View original post on X — @aiatmeta,

23 July 2024

AI AI HARDWARE GENERATIVE AI INNOVATION LLMS MACHINE LEARNING RESEARCH

AI Dynamics

Llama 3.1 405B: Training the Largest Model at Scale

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

OpenAI Accelerates: Exponential Growth in Artificial Analysis

GPT-5.5 Delivers Significant Vibe Shift in Capabilities

GPT Image 2 Reimagines Damaged Photos with Generative AI

GPT Image 2: AI Style Transfer for Personal Photos