1/ use the application and provide feedback
2/ if you have a dataset of questions/answer, share those (PR on the GitHub)
3/ once we get a full dataset, trying out different methods and benchmarking (not there yet) https://
t.co/Dj0Xqtpz13
@hwchase17
-
Building AI Applications: Dataset Collection and Benchmarking Methodology
By
–
-
SQL Question Answering Benchmarking Initiative Launch
By
–
Benchmarking SQL Question Answering The next end-to-end application we'll be looking to add benchmarking for is questions over a SQL database Similarly to last time, we need your help!
-
SQL-LangChain App: LLM-Powered Database Querying Tool
By
–
Access the app here: https://
sql-langchain.streamlit.app Please leave feedback on questions that the current implementation does a particularly good or bad job at answering! If you want to see the underlying code, its at this repo: https://
github.com/hwchase17/sql-
qa
… -
Local Models Power Generative Agents Fork Cost Reduction
By
–
Forks of generative agents repo hooked up to local models!!! Keeps costs down but fun up
-
LangSmith: Real-time Monitoring of AI Model Runs
By
–
Fun to watch cool langsmith runs in real time! https://t.co/WwN4uhNHyL
— Harrison Chase (@hwchase17) 15 août 2023Fun to watch cool langsmith runs in real time!
-
LangChain Code Splitter Demo and Sweep AI Webinar
By
–
If you enjoyed our text splitter playground from last week (
https://
langchain-text-splitter.streamlit.app) then you'll want to check out @sweepai
's code splitter demo https://
huggingface.co/spaces/sweepai
/chunker
… And then you'll want to register for our webinar with them (tmrw!) https://
crowdcast.io/c/kkn1f3ewlqvq -
Dataset Evaluation: Key Factor for AI Model Improvement
By
–
"OpenAI said before that evaluating their dataset was one of the major factors for the huge jump from ChatGPT3.5 to 4. Let's do the same for our own applications!" @SimonNom1 expressed better than I could why we're focusing so much on end-to-end evaluations!
-
Benchmarking CSV Question Answering Systems with LangChain
By
–
Full blog here: https://
blog.langchain.dev/benchmarking-q
uestion-answering-over-csv-data/
… And full code here: https://
github.com/langchain-ai/l
angchain-benchmarks
… Just benchmarking CSV question answering for now, but others up next -
Amazon Bedrock LLMs Integration Represents Major Cloud AI Development
By
–
Amazon bedrocks LLMs are a big integration!