AI Dynamics

Global AI News Aggregator

About

LLM Performance Falls Apart on Unmemorizable Coding Benchmarks Due to Distribution Shift

Pretty shocking result (that once again confirms what I wrote about the perils of distribution shift, 25 years ago): Translate coding benchmarks into languages LLMs can’t memorize and performance utterly falls apart.

→ View original post on X — @garymarcus