Use A/B tests in prod against whatever user-driven metric you care about, start by rolling it out to 1-5% of your users, see if it beats current baseline. If it does, great, switch it. If not, iterate the prompt + try again. And if you don't have enough users to get to stat sig
A/B Testing AI Prompts in Production: Iterative Optimization Strategy
By
–