AI Dynamics

Global AI News Aggregator

About

Running Evals: Scripts, Model Swapping Libraries, and Tools

how do you run these evals? is it just a bunch of scripts? do you use any libraries for swapping models? any other libraries useful in your eval?

→ View original post on X — @swyx