AI Dynamics

Global AI News Aggregator

About

LLM as Judge: Are All Results Mediocre 7 out of 10?

now the bone chilling question for the lazy people who use llm as judge for everything: have you run this analysis on your judging results and is everything ~7 out of 10

→ View original post on X — @swyx