Track Eval Performance Across Revisions in LangSmith Want to make sure that performance on a benchmark dataset isn't degrading as you try new models, tweak the prompt, or change your retrieval strategy? This is now very easy to do with the introduction of test run charts in
LangSmith Test Run Charts Track Model Performance Across Revisions
By
–
Leave a Reply