AI Dynamics

Global AI News Aggregator

CUA-Bench Open-Sourced for GUI Agent Evaluation

Congrats to @trycua on open-sourcing cua-bench! We're collaborating on task design & data curation for GUI workflows – bringing the same systematic evaluation approach from Terminal-Bench to computer-use agents. 15 native tasks / 40 variations + OSWorld & Windows Agent Arena.

→ View original post on X — @snorkelai,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *