AI Dynamics

Global AI News Aggregator

Gemini 3.25B quantized running locally in Chrome sub-100ms

a 3.25B params quantized gemini running locally in coming Google Chrome with less than 100ms latency while using less than 2GB of ram that's less ram usage than many of my current Chrome page already use (my slack is using 4.8GB as I type this) no doubt LLMs will be integrated

→ View original post on X — @thom_wolf,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *