With coSTAR, we replaced manual agent iteration with an automated test and refine loop. Agents run against defined scenarios. MLflow captures execution traces, and LLM judges score the results. A coding assistant then updates the agent until it passes the tests. This cuts
coSTAR Automates Agent Testing and Refinement Loop
By
–
