AI Dynamics

Global AI News Aggregator

About

Comparative prompt execution tracking for benchmark evolution

I'd love to be able to buy a one-off comparative prompt execution in a few months just to see how a new benchmark has evolved over time

→ View original post on X — @simonw