Update: llama3-8B-Instruct beats GPT3.5
@aymericroucher
-

Llama3-70B-Instruct matches GPT4 in agent benchmark test
By
–
Preliminary testing on my agent benchmark (based on https://
github.com/aymeric-rouche
r/benchmark_agents
…): Llama3-70B-Instruct is on par with GPT4! cc @lvwerra -

Canceling ChatGPT subscription to switch to HuggingChat
By
–
My ChatGPT subscription > Cancel > switch to HuggingChat only.
-
AI Travel Planner Helps Plan Your Next Vacation Quickly
By
–
𝗡𝗲𝘄 𝗦𝗽𝗮𝗰𝗲: 𝘼𝙄 𝙏𝙧𝙖𝙫𝙚𝙡 𝙥𝙡𝙖𝙣𝙣𝙚𝙧 Plan your next vacation in a few minutes! Describe your ideal trip, and it will come up with nice places and recommendations! Try it here
-
Recommended papers for speculative decoding
By
–
@jxmnop what are some good papers you'd recommend to get up to speed on speculative decoding?
-

Beam Search Visualizer featured in Hugging Face Spaces of the Week
By
–
My Beam Search visualizer space is now featured in Spaces of the week! It allows you to check how beam search decoding works in practice. Go check it out https://
huggingface.co/spaces/m-ric/b
eam_search_visualizer
… -
Multi-agent system with orchestrator for flexible error handling
By
–
Great result! This system based on 3 "team member" agents (Web search, Terminal, & Assistant) managed by an Orchestrator. It has no fixed graph structure : at each step the Orchestrator decides which actions should be taken. Advantage: more flexible in case of error!
-

Claude-3 Opus beats GPT-4, GPT-4 Turbo stays top
By
–
The Chatbot Arena was just updated with the ELO rankings for the new Claude-3 models! – 𝗖𝗹𝗮𝘂𝗱𝗲-𝟯 𝙊𝙥𝙪𝙨 𝗯𝗲𝗮𝘁𝘀 𝗚𝗣𝗧-𝟰!
It's the first LLM to beat GPT-4 since its release 1 year ago. – GPT-4 Turbo stays on top with a comfortable lead of ~20 points. -
Find LLM leaderboards: Aymeric Roucher shares ultimate collection by C. Le Fourrier
By
–
Are you trying to find good leaderboards to compare LLMs? @clefourrier is building the ultimate collection here:
-

New HF Space for visualizing chunk splitting methods in RAG
By
–
I've built a new HF Space to let you visualize how different splitting methods affect the chunks you fet for RAG! Try it out here: https://
huggingface.co/spaces/m-ric/c
hunk_visualizer
… It's heavily inspired from @GregKamradt 's http://
chunkviz.com – all credits to him for the idea!
