this is Gemma 4 running locally on a 3 year old mac meaning:
— clem 🤗 (@ClementDelangue) 2 avril 2026
– free (=$0 no matter how much you use)
– safe (you're not leaking all your data via unsafe APIs)
– fast (as you can see) https://t.co/yX3gwJ3LF6
this is Gemma 4 running locally on a 3 year old mac meaning: – free (=$0 no matter how much you use) – safe (you're not leaking all your data via unsafe APIs) – fast (as you can see) Georgi Gerganov (@ggerganov) Let me demonstrate the true power of llama.cpp: – Running on Mac Studio M2 Ultra (3 years old) – Gemma 4 26B A4B Q8_0 (full quality) – Built-in WebUI (ships with llama.cpp) – MCP support out of the box (web-search, HF, github, etc.) – Prompt speculative decoding The result: 300t/s (realtime video) — https://nitter.net/ggerganov/status/2039752638384709661#m