AI Dynamics

Global AI News Aggregator

About

Inferring Model Size from Speed and Pricing is Complex

I think model size is hard to infer from speed & pricing: it depends on – GPU type
– precision and quantization at inference
– batch scaling
– subsidized or not
– MoE or not
– etc.

→ View original post on X — @rasbt