@alexjc - AI Dynamics - Page 12 of 77

Question Phrasing Impact on AI Technical Answers

By

–

04 November 2025 7h33

I love how both the answers about thermal throttling and speculative decoding are correct based on how you phrased the question!

→ View original post on X — @alexjc

4 November 2025

Language Models Struggle With High School Math Fundamentals

By

@alexjc

–

31 October 2025 12h25

Language models perform poorly on high-school math? You don't want to hear this, but the problems started in grade-school. The moment we (collectively) found acceptable that mid-tier models could score only 75%-85% on a GSM test set of 1.32k straightforward problems…

→ View original post on X — @alexjc

31 October 2025

Faster Coding Model Pricing Efficiency Questions

By

@alexjc

–

30 October 2025 20h25

The speed of a faster coding model is worth it, but it seems mis-priced. C1 gobbles through files, reasons more, expect extra feedback to reach similar place as slower model do with less of everything. Intuitively it feels more expensive "the fast way" with current pricing.

→ View original post on X — @alexjc

30 October 2025

Chat modes code review and conversation summarization evolution

By

@alexjc

–

30 October 2025 10h41

Not a single feature, but evolution of:
1) Making the code changes directly in the files consistently (chat modes) and marking the diffs in a nicely reviewable way.
2) Long chat summarization combined with third-party model capabilities to handle ongoing conversations so you

→ View original post on X — @alexjc

30 October 2025

Compute Trade-offs: Factual Knowledge vs Dense Core Efficiency

By

@alexjc

–

29 October 2025 13h05

There are two kinds of people in AI: those who are happily surprised a model would know random facts like these (worth burning compute and parameters), and those who think it's a complete waste of both for a dense core.

→ View original post on X — @alexjc

29 October 2025

Quality over quantity: improving training datasets for better AI models

By

@alexjc

–

24 October 2025 13h19

Great idea for a metric to further improve what datasets the models train on. It likely leads to an answer that is not web-scale crawling… Less data is often better, better data takes less.

→ View original post on X — @alexjc

24 October 2025

Meta layoffs signal shifting competitive landscape in AI research

By

@alexjc

–

23 October 2025 20h11

Isn't this a sign of the market becoming more competitive, the golden age of open research from U.S. corporations being increasingly over, and now researchers who would be publishing anyway (without it costing Meta) are being let go?

→ View original post on X — @alexjc

23 October 2025

Meta’s Soft Tokens Enable LLMs to Invent Recursive Representations

By

@alexjc

–

23 October 2025 19h16

This paper from Meta about "Soft Tokens" in RL is interesting; it allows LLMs to invent their own non-discrete (recursive) representations in order to solve problems better… Results are mixed though: it's only a few percent better on GSM8k from pass@4 onwards, and pass@32 just

→ View original post on X — @alexjc

23 October 2025

AI Browser Strategy Shifts Scraping Liability to Users

By

@alexjc

–

23 October 2025 16h13

The reason AI companies are rushing to release browsers: they don't want the responsibility / liability of scraping on their servers. They need to push that to the users! We'll be moving into an ever more gated internet soon…

→ View original post on X — @alexjc

23 October 2025

Fair Use Defense Burden of Proof in AI Legal Cases

By

@alexjc

–

22 October 2025 22h32

I'm so glad for this, looking forward to reading! Courts may drag their feet, but specifically when relying on a Fair Use defense, the burden of proof for the absence of market harms falls on the defendants.

→ View original post on X — @alexjc

22 October 2025