"The biggest problem: developers don’t want GPUs. They want LLMs…. developers thinking about performance in terms of “tokens per second” aren’t counting milliseconds." — @mrkurt with a very clearheaded analysis of the gpu neocloud market. api calls are enough for most.
Developers Prefer LLMs Over Raw GPU Access for Performance
By
–
Leave a Reply