Cost is definitely important, though i have two thoughts here:
1. Cost is decreasing quickly. E.g., Flan-PaLM-8B (2022) is about as good as GPT-3 175B (2020). So there is a ~10x improvement in just 2 years.
2. For cases where a model with 90% performance costs 10x more than a
Cost efficiency improvements in large language models
By
–
Leave a Reply