In our LangChain Labs study with @Harvey
, we looked at how to measure efficiency across verifier designs. We benchmarked 5 setups against Sonnet per-criterion as the reference.
LangChain Labs study with Harvey on verifier efficiency benchmarking
By
–
