AI Dynamics

Global AI News Aggregator

@reach_vb

Stopping to assess where AI field is heading

By

@reach_vb

–

02 November 2024 4h50

agreed – automated AI slop doesn’t quite help either. But it’s good to stop every now and then and see where the field is headed too.

→ View original post on X — @reach_vb,

2 November 2024
Training Data Mix and Domain-Specific Evals Drive AI Model Performance

By

@reach_vb

–

01 November 2024 14h24

My intuition is two things:
1. Carefully curated pre-training data & finding the right data mix 2. Domain/ task specific evals for downstream use-cases In the end training data is still king! – finding which combination to go for is where the real moneys at. In addition- Post

→ View original post on X — @reach_vb,

1 November 2024
AI Personalization Gaming Disruption Unique Player Experiences

By

@reach_vb

–

01 November 2024 8h06

Unique experiences for each player – AI is going to disrupt gaming so hard! https://t.co/2RkQ7gche1
— Vaibhav (VB) Srivastav (@reach_vb) 1 novembre 2024

Unique experiences for each player – AI is going to disrupt gaming so hard!

→ View original post on X — @reach_vb,

1 November 2024
SmolLM2: Faster, Better, Cheaper Language Model

By

@reach_vb

–

31 October 2024 20h13

SmolLM2 – faster, better and cheaper! Intelligence is definitely too cheap to meter.

→ View original post on X — @reach_vb,

31 October 2024
Model Weights Now Available for Download

By

@reach_vb

–

31 October 2024 19h50

Check out the model weights here:

→ View original post on X — @reach_vb,

31 October 2024
SmolLM2 1.7B Beats Larger Models with Apache 2.0 License

By

@reach_vb

–

31 October 2024 19h50

Fuck it – it’s raining smol LMs – SmolLM2 1.7B – beats Qwen 2.5 1.5B & Llama 3.21B, Apache 2.0 licensed, trained on 11 Trillion tokens > 135M, 360M, 1.7B parameter model
> Trained on FineWeb-Edu, DCLM, The Stack, along w/ new mathematics and coding datasets
> Specialises in

→ View original post on X — @reach_vb,

31 October 2024
Base Models vs Instruct Models: Key Technical Distinction

By

@reach_vb

–

31 October 2024 13h24

they are base models not instruct

→ View original post on X — @reach_vb,

31 October 2024
VRAM Requirements for AI Models Across Hardware Architectures

By

@reach_vb

–

31 October 2024 12h02

It should work on CPU/ CUDA/ MPS across backends, w.r.t hardware requirements: 1B should take roughly 2GB VRAM to load in fp16/ bf16.
600M should take 1.2 GB VRAM
350M – ~700MB VRAM
125 – ~250MB VRAM Ofcourse at lower quants Q4/ Q8 you reduce this even further.

→ View original post on X — @reach_vb,

31 October 2024
Tiny AI Models Running on Everyday Devices

By

@reach_vb

–

31 October 2024 11h31

Models so smol that thay'd even run on your toaster!

→ View original post on X — @reach_vb,

31 October 2024
Model Checkpoints Available for Download and Review

By

@reach_vb

–

31 October 2024 11h10

Check out the model checkpoints here:

→ View original post on X — @reach_vb,

31 October 2024