@akshay_pachaar - AI Dynamics

Kimi K2.7 Code Proves Less Thinking Improves Coding Performance

By

–

12 June 2026 20h35

The less your model thinks, the better it codes. Kimi K2.7 Code proved it. The entire reasoning model space has been moving in one direction. More thinking tokens, longer chains, bigger reasoning budgets. Kimi just challenged that. K2.7 Code scores higher than K2.6 on every

→ View original post on X — @akshay_pachaar

12 June 2026

NVIDIA open-sources key AI project bundling instructions and real skills

By

@akshay_pachaar

–

12 June 2026 14h35

NVIDIA might just have open-sourced one of the most important AI projects right now. everyone is building skills, and we are also pulling in skills other people wrote and downloading them straight off GitHub. the skill is not just text. it bundles instructions and real

→ View original post on X — @akshay_pachaar

12 June 2026

Speculative decoding makes LLMs 8.5x faster without accuracy loss

By

@akshay_pachaar

–

11 June 2026 18h38

Researchers found a way to make LLMs 8.5x faster!

(without compromising accuracy)

Speculative decoding is quite an effective way to address the single-token bottleneck in traditional LLM inference.

A small "draft" model first generates the next several tokens, then the large… https://t.co/JCdqjCKcKU pic.twitter.com/HbKmRqdF5P
— Akshay 🚀 (@akshay_pachaar) 11 juin 2026

Researchers found a way to make LLMs 8.5x faster! (without compromising accuracy) Speculative decoding is quite an effective way to address the single-token bottleneck in traditional LLM inference. A small "draft" model first generates the next several tokens, then the large

→ View original post on X — @akshay_pachaar

11 June 2026

Apple’s Core AI runs models entirely on-device

By

@akshay_pachaar

–

09 June 2026 20h36

Apple finally did it. Its new framework, Core AI, runs models entirely on Apple silicon, so inference happens on the user's device with zero server calls and zero token bills. That means Qwen, Mistral, and SAM3 running natively across iPhone, iPad, Mac, and Vision Pro. It's a

→ View original post on X — @akshay_pachaar

9 June 2026

Loop engineering: design loops that prompt agents; unattended loops fail.

By

@akshay_pachaar

–

09 June 2026 10h35

about loop engineering. everyone's saying the same thing this week. you don't prompt agents anymore, you design loops that prompt them. here's the job that loop hands right back to you. a loop running unattended is also a loop failing unattended. loop engineering takes you

→ View original post on X — @akshay_pachaar

9 June 2026

Spans show outputs not intent; logging reasoning helps

By

@akshay_pachaar

–

09 June 2026 7h27

Right. Spans show what got called, not what the agent thought it was doing in between. Ollie walks the causal chain to find where it broke, but intent still gets pieced together from outputs. Logging the model's reasoning as its own span helps, but most frameworks skip it by

→ View original post on X — @akshay_pachaar

9 June 2026

Fine-tuned LLM accurately predicts missing chess moves

By

@akshay_pachaar

–

07 June 2026 15h14

Finally, the video shows prompting the LLM before and after fine-tuning.

After fine-tuning, the model is able to find the exact missing chess move instead of randomly generating some moves.

Check this 👇 pic.twitter.com/WPAmLLBg4u
— Akshay 🚀 (@akshay_pachaar) 7 juin 2026

Finally, the video shows prompting the LLM before and after fine-tuning. After fine-tuning, the model is able to find the exact missing chess move instead of randomly generating some moves. Check this

→ View original post on X — @akshay_pachaar

7 June 2026

Define Trainer object by specifying training config

By

@akshay_pachaar

–

07 June 2026 15h13

Define Trainer Here, we create a Trainer object by specifying the training config, like learning rate, model, tokenizer, and more. Check this out

→ View original post on X — @akshay_pachaar

7 June 2026

Prepare dataset and standardize format for fine-tuning Gemma 4

By

@akshay_pachaar

–

07 June 2026 15h13

Prepare dataset Next, we use a conversation style dataset to fine-tune Gemma 4 12B. The standardize_data_formats method converts the dataset to the correct format for finetuning purposes!

→ View original post on X — @akshay_pachaar

7 June 2026

Fine-tuning Gemma 4 12B to predict missing chess moves

By

@akshay_pachaar

–

07 June 2026 15h13

Load dataset We'll fine-tune Gemma 4 12B to master chess. Given a set of previous move (one move missing) & the final result it has to predict the missing move. In order to do this we're using the ChessInstruct dataset from HuggingFace. Check this

→ View original post on X — @akshay_pachaar

7 June 2026