“We need a thousand times more benchmarks than we have right now” is @alexgshaw of @LaudeInstitute's take on the current moment. “Coding is an extremely broad domain, 89 tasks isn’t nearly enough.”
— Snorkel AI (@SnorkelAI) 1 avril 2026
Full Benchtalks interview posted by @vincentsunnchen and YouTube in the replies pic.twitter.com/BRxjq6uzR4
“We need a thousand times more benchmarks than we have right now” is @alexgshaw of @LaudeInstitute's take on the current moment. “Coding is an extremely broad domain, 89 tasks isn’t nearly enough.” Full Benchtalks interview posted by @vincentsunnchen and YouTube in the replies
→ View original post on X — @snorkelai, 2026-04-01 16:35 UTC
Leave a Reply