NVIDIA just released a quantized Gemma 4 31B on Hugging Face NVFP4 compression delivers 4x smaller weights with frontier-level accuracy. Runs on consumer GPUs with a 256K context window.
→ View original post on X — @huggingface, 2026-04-02 17:17 UTC
By
–

NVIDIA just released a quantized Gemma 4 31B on Hugging Face NVFP4 compression delivers 4x smaller weights with frontier-level accuracy. Runs on consumer GPUs with a 256K context window.
→ View original post on X — @huggingface, 2026-04-02 17:17 UTC