Many LLM errors *are* downstream of tokenization oddities, surely. But the folk wisdom that two-R’s-in-strawberry mistakes happen *only* because the letters in “strawberry” are combined into longer prompt tokens before the model can count them is demonstrably not true.
Tokenization isn’t the only cause of LLM counting errors
By
–