AI Dynamics

Global AI News Aggregator

Open-source agent traces dataset: crowdsourcing AI training data

Very cool open-source traces from @TheZachMueller @LambdaAPI: huggingface.co/datasets/lamb… 150M tokens for @NousResearch's Hermes harness with Kimi-K2.5 & GLM 5.1 that was just released! clem 🤗 (@ClementDelangue) We keep saying we want open-source frontier agents. Fine. Then let’s build the dataset. @badlogicgames, creator of Pi, just shared some of his agent traces used to build Pi on @huggingface. I’m now sharing some of mine too, exporting them from @hermes, @opencode, and Claude via @tracesdotcom, and I’ll keep going. Why this matters: one of the biggest bottlenecks for open-source agent models is the data. And all of us are generating that data every day through our conversations with agents. If enough builders share even a slice of their traces publicly, we can create the largest crowdsourced open dataset for agents. Time to put your tokens where your mouth is and give a chance for open source to win! — https://nitter.net/ClementDelangue/status/2041189872556269697#m

→ View original post on X — @clementdelangue, 2026-04-07 17:57 UTC

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *