AI Dynamics

Global AI News Aggregator

@theahmadosman

KAT-Dev-72B-Exp: Agentic Coding Model Ranks #2 SWE-Bench

By

@theahmadosman

–

10 October 2025 13h13

HUGE a new Agentic coding model, fits on 4x RTX 3090s @ 4-bit, fully local KAT-Dev-72B-Exp by Kwaipilot – Claude Code setup guide included – ranks #2 on SWE-Bench Verified – excels at long-horizon coding + tool-use – multi-stage tuned: Mid-Training, SFT + RFT, Agentic RL

→ View original post on X — @theahmadosman,

10 October 2025
LLM Infrastructure: Still Early, Much Work Ahead

By

@theahmadosman

–

10 October 2025 10h32

LLM infra right now is like Linux in the 90s we're still early & there is a lot of work to do

→ View original post on X — @theahmadosman,

10 October 2025
Build Autograd Engine, Mini-GPT, and LoRA Fine-tuning From Scratch

By

@theahmadosman

–

10 October 2025 2h27

– build an autograd engine from scratch
– write a mini-GPT from scratch
– implement LoRA and fine-tune a model on real data
– hate CUDA at least once
– cry
– keep going the roadmap – 5 phases
– if you already know something? skip
– if you're lost? rewatch
– if you’re stuck? use

→ View original post on X — @theahmadosman,

10 October 2025
GLM-4.5 Air Local Deployment for Sensitive Data Projects

By

@theahmadosman

–

10 October 2025 0h40

yeah i know, i am using glm-4.5 air locally for some sensitive data project otherwise that plan is great for its price they're also dropping 4.6 air soon

→ View original post on X — @theahmadosman,

10 October 2025
Ahmad Osman Endorses Open Superintelligence Stack Initiative

By

@theahmadosman

–

09 October 2025 22h30

ahmad osman here, co-signing this with every GPU i own open superintelligence stack or bust i approve this message

→ View original post on X — @theahmadosman,

9 October 2025
Home AI Server Building Best Practices Guide

By

@theahmadosman

–

09 October 2025 21h05

the basic rules i follow when building an AI server at home >direct lanes, x16 or x8, from CPU and never off chipset
>no risers unless absolutely necessary
>airflow must be front-to-back, no hot recirculation
>power budget for transient spikes, not just average draw
>always

→ View original post on X — @theahmadosman,

9 October 2025
RTX 5090 vs 4x 3090s VRAM comparison for LLM inference

By

@theahmadosman

–

09 October 2025 20h39

5090 has 32GB of VRAM, 4x 3090s have 96GB of VRAM when it comes to LLMs inference, we care more about memory as models are better fully offloaded into VRAM than being shared across system RAM and a single RTX 5090's VRAM

→ View original post on X — @theahmadosman,

9 October 2025
GLM 4.5 Air: New Language Model Release

By

@theahmadosman

–

09 October 2025 20h06

can you read? because i said glm 4.5 air lol!

→ View original post on X — @theahmadosman,

9 October 2025
Economic Insecurity in the Age of AI Acceleration

By

@theahmadosman

–

09 October 2025 18h14

Nobody is safe in this economy A C C E L E R A T E

→ View original post on X — @theahmadosman,

9 October 2025
Ollama’s bloated wrapper fails to match ggml’s efficiency

By

@theahmadosman

–

08 October 2025 17h52

do not use Ollama ggerganov wrote blazing-fast
C++ inference (ggml, llama.cpp) then Ollama wrapped it
in a bloated binary and is now somehow the face of local LLMs
soaking up VC hype and it's not even a good wrapper lol

→ View original post on X — @theahmadosman,

8 October 2025