1B model running over 200 tok/s in your browser π https://t.co/JArzxn7FN8
— Maxime Labonne (@maximelabonne) 25 fΓ©vrier 2026
1B model running over 200 tok/s in your browser π Xenova (@xenovacom) Okay, this is actually insane… You can now run LFM2.5-1.2B-Thinking (a 1.2B parameter LLM from @LiquidAI) at over 200 tokens per second directly in your browser on WebGPU! π€― Zero install. Fully private. Blazingly fast. Powered by Transformers.js and ONNX Runtime Web β https://nitter.net/xenovacom/status/2026727703836004796#m
β View original post on X β @maximelabonne, 2026-02-25 18:47 UTC
Leave a Reply