great! would be neat to have a more comprehensive web-based comparison UI that has a number of categories of tasks with the two models side by side with metrics, and when you click you get the "proof" behind the aggregate metric, with underlying examples and judgements etc.
Comprehensive web-based UI for side-by-side model comparison
By
–
Leave a Reply