Which agents did you try? Love to know which one actually worked, we're also thinking about making benchmarks for agents. Would love to collaborate on a benchmark — we have a few internal datasets sitting around.
Which AI agents worked best? Benchmark collaboration proposal
By
–
Leave a Reply