this is exactly why I tell people to NOT get a Mac anything for LLMs useless for concurrency and agents, basically a single chat interface that doesn’t scale up neither with sessions nor with kvcache you want to be ready for the agentic world you Buy a GPU
Why Macs Fall Short for Agentic LLM Workloads
By
–
