Napkin math here is 1 A100 hour atm is ~$1 on cloud providers, so roughly 1.3M hours for 8B (see model card) would mean $1.3M. And $6.4M for 70B. Keeping in mind that this is just the approx cost to hit go and wait and assuming a perfect run. And that it takes quite a bit more in
Cost analysis of training 8B and 70B language models on A100
By
–
Leave a Reply