AI Dynamics

Global AI News Aggregator

About

Why 1M-Context Models Still Don’t Work Beyond 200K Tokens

it is endlessly fascinating to me that we still don't have a true 1M-context model it's an unusual case where the infra is far ahead of the science. Claude discontinued 1M+ context bc it didn't really work past ~200k we don't have the right data? training techniques? not sure

→ View original post on X — @jxmnop,