Generating candidates is now easy compared to back then. An AI can generate: 100 prompt variants 50 code changes 20 tool-routing ideas 10 eval rewrites 5 new workflows
The question is not “can it come up with changes?”
The question is “which changes are actually better?”
AI Generation Quality Assessment Beyond Candidate Quantity
By
–
Leave a Reply