Gemini 3.25B quantized running locally in Chrome sub-100ms

AI Dynamics

Global AI News Aggregator

Gemini 3.25B quantized running locally in Chrome sub-100ms

–

24 June 2024 16h20

a 3.25B params quantized gemini running locally in coming Google Chrome with less than 100ms latency while using less than 2GB of ram

that's less ram usage than many of my current Chrome page already use (my slack is using 4.8GB as I type this)

no doubt LLMs will be integrated… https://t.co/Muy8o4qPVV
— Thomas Wolf (@Thom_Wolf) 24 juin 2024

a 3.25B params quantized gemini running locally in coming Google Chrome with less than 100ms latency while using less than 2GB of ram that's less ram usage than many of my current Chrome page already use (my slack is using 4.8GB as I type this) no doubt LLMs will be integrated

→ View original post on X — @thom_wolf,

24 June 2024

AI AI HARDWARE APPS COMPUTING GENERATIVE AI HARDWARE INNOVATION LLMS SOFTWARE TECHNOLOGY

AI Dynamics

Gemini 3.25B quantized running locally in Chrome sub-100ms

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Cybercab Uber: Safer, Cheaper Alternative for Single Riders

Zeekr Global Unveils Latest Electric Vehicle Model

Revolutionary New Camera Technology Unveiled

Hidden Camera Recording Family Interactions Raises Privacy Concerns