AI Dynamics

Global AI News Aggregator

About

Megaprompt Development Through Human Evaluation and Experimentation

This one has just been a megaprompt with tons of riffing/experimenting and working with a team of human evaluators on the test cases. It's not the kind of thing that can be accurately evaluated by LLMs or programmatically, so it's been fairly labor intensive to get right. Also,

→ View original post on X — @thatroblennon,