LLaMa2-70B Training Costs: Computing Efficiency Analysis

AI Dynamics

Global AI News Aggregator

LLaMa2-70B Training Costs: Computing Efficiency Analysis

–

30 October 2023 23h27

Calculations:
LLaMa2-70B was trained on 2T tokens, and 3.3m hours of A100 GPU time. At a HFU of ~60%, LLama2-70B took ~4.4e+24 flops (3.3m * 624TFlops bfloat16 * 60% HFU).

→ View original post on X — @soumithchintala,

30 October 2023

AI AI HARDWARE COMPUTING HARDWARE LLMS RESEARCH

AI Dynamics

LLaMa2-70B Training Costs: Computing Efficiency Analysis

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

AI Generates Perfect Jokes Using Image Generation Skills

Codex App Transformation: Atlas Integration Reshapes User Experience

AI File Access Limitations: Screenshot vs Disk Storage Issues

Synthetic Aperture Radar: Satellite Tech for Global Monitoring