AI Dynamics

Global AI News Aggregator

About

SnorkleWordle Benchmark Evaluates LLM Performance Rare Words

Word games yield valuable insights when evaluating LLMs. We built the SnorkleWordle benchmark to test models on 100 rare English words—and the results are

→ View original post on X — @snorkelai