Congrats to @trycua on open-sourcing cua-bench!
— Snorkel AI (@SnorkelAI) 23 janvier 2026
We're collaborating on task design & data curation for GUI workflows – bringing the same systematic evaluation approach from Terminal-Bench to computer-use agents.
15 native tasks / 40 variations + OSWorld & Windows Agent Arena.… https://t.co/7GZkGHHTYx
Congrats to @trycua on open-sourcing cua-bench! We're collaborating on task design & data curation for GUI workflows – bringing the same systematic evaluation approach from Terminal-Bench to computer-use agents. 15 native tasks / 40 variations + OSWorld & Windows Agent Arena.
Leave a Reply