Wow this is a game-changer!
— Thomas Wolf (@Thom_Wolf) 4 avril 2025
No. 1 question we get with +1.5M models on the hub: how to find the best one for your use-case?
Here comes YourBench 🪄 – work led by @sumukx @ailozovskaya and team
From a single document example, generate a custom eval with questions and test a… pic.twitter.com/W9tmAwTiaK
Wow this is a game-changer! No. 1 question we get with +1.5M models on the hub: how to find the best one for your use-case? Here comes YourBench – work led by @sumukx @ailozovskaya and team From a single document example, generate a custom eval with questions and test a
