AI Dynamics

Global AI News Aggregator

@reach_vb

Web-LLM and WebGPU: Superior Choice for LLMs

By

@reach_vb

–

30 November 2024 0h25

Thanks for covering this Simon, @tqchenml & @charlie_ruan are the real MVPs for developing Web-LLM – for LLMs and WebGPU it is literally the superior choice!

→ View original post on X — @reach_vb,

30 November 2024
Structured JSON Generation with SmolLM2 in Browser

By

@reach_vb

–

28 November 2024 23h04

Fuck it! Structured Generation w/ SmolLM2 running in browser & WebGPU 🔥

Powered by MLC Web-LLM & XGrammar ⚡

Define a JSON schema, Input free text, get structured data right in your browser – profit!!

To showcase how much you can do with just a 1.7B LLM, you pass free text,… pic.twitter.com/x5GYWdmTe3
— Vaibhav (VB) Srivastav (@reach_vb) 28 novembre 2024

Fuck it! Structured Generation w/ SmolLM2 running in browser & WebGPU Powered by MLC Web-LLM & XGrammar Define a JSON schema, Input free text, get structured data right in your browser – profit!! To showcase how much you can do with just a 1.7B LLM, you pass free text,

→ View original post on X — @reach_vb,

28 November 2024
Optimizing AWQ Deployment for Flexible Model Distribution

By

@reach_vb

–

28 November 2024 13h40

Think a more likely one would be to help people create the best, most optimised AWQs and then they can deploy wherever they want ofc you can have direct deployment options. But I’m 100% there’s value in this.

→ View original post on X — @reach_vb,

28 November 2024
AGI Accessible: Install QwQ with Two Lines of Code

By

@reach_vb

–

28 November 2024 11h56

You too can have AGI in just a couple lines of code! `pip install transformers` & QwQ is all you need

→ View original post on X — @reach_vb,

28 November 2024
Open Source Model Challenges OpenAI o1 Moat

By

@reach_vb

–

27 November 2024 21h49

That’s an Apache 2.0 licensed model competing with OpenAI o1 preview – the moat never existed!

→ View original post on X — @reach_vb,

27 November 2024
Qwen Q-32B Preview Model Now Available on HuggingFace

By

@reach_vb

–

27 November 2024 20h16

Try it out here: https://
huggingface.co/spaces/Qwen/Qw
Q-32B-preview
…

→ View original post on X — @reach_vb,

27 November 2024
QwQ-32B Model Now Available on Hugging Face Hub

By

@reach_vb

–

27 November 2024 20h15

Model on the hub, try it out: https://
huggingface.co/Qwen/QwQ-32B-P
review
…

→ View original post on X — @reach_vb,

27 November 2024
Qwen QwQ 32B Outperforms o1 Mini Model

By

@reach_vb

–

27 November 2024 20h13

WTF! Qwen COOKED – QwQ 32B beats o1 mini and competes with preview!

→ View original post on X — @reach_vb,

27 November 2024
UV: Fast Python package manager and environment tool

By

@reach_vb

–

26 November 2024 20h36

mate have you looked at uv? just do `brew install uv` followed by: `uv venv –python 3.12` that's it https://
docs.astral.sh/uv/

→ View original post on X — @reach_vb,

26 November 2024
Model weights and inference code now available

By

@reach_vb

–

26 November 2024 20h31

check out the model weights and inference code here:

→ View original post on X — @reach_vb,

26 November 2024