AI Dynamics

Global AI News Aggregator

About

Gemma 4 Running Locally on Mac: Free, Safe, and Fast

this is Gemma 4 running locally on a 3 year old mac meaning: – free (=$0 no matter how much you use) – safe (you're not leaking all your data via unsafe APIs) – fast (as you can see) Georgi Gerganov (@ggerganov) Let me demonstrate the true power of llama.cpp: – Running on Mac Studio M2 Ultra (3 years old) – Gemma 4 26B A4B Q8_0 (full quality) – Built-in WebUI (ships with llama.cpp) – MCP support out of the box (web-search, HF, github, etc.) – Prompt speculative decoding The result: 300t/s (realtime video) — https://nitter.net/ggerganov/status/2039752638384709661#m

→ View original post on X — @nandodf, 2026-04-02 19:48 UTC