🦜💪 Public LangSmith Benchmarks
— LangChain (@LangChain) 22 novembre 2023
Deploying LLM apps requires great evaluation, but writing evals can be painstaking.
We're launching a Q&A benchmark dataset on LangSmith so you can easily compare architectures .
Dataset: https://t.co/UEuqpyb15Q
Blog: https://t.co/Y85mNP1clJ pic.twitter.com/JkZ0MlbHRO
Public LangSmith Benchmarks Deploying LLM apps requires great evaluation, but writing evals can be painstaking. We're launching a Q&A benchmark dataset on LangSmith so you can easily compare architectures . Dataset: https://
smith.langchain.com/public/452ccaf
c-18e1-4314-885b-edd735f17b9d/d
…
Blog: https://
blog.langchain.dev/public-langsmi
th-benchmarks/
…
