Looks like this 4bit quantized MLX model uses about 35GB ofRAM to run a prompt and produces an almost-working space invaders https://t.co/S9b64sxYlq https://t.co/y8k0SspsgM
— Simon Willison (@simonw) 31 juillet 2025
Looks like this 4bit quantized MLX model uses about 35GB ofRAM to run a prompt and produces an almost-working space invaders https://
huggingface.co/mlx-community/
Qwen3-Coder-30B-A3B-Instruct-4bit
…