@aymericroucher - AI Dynamics

Llama3-8B-Instruct outperforms GPT3.5

By

@aymericroucher

–

19 April 2024 16h48

Update: llama3-8B-Instruct beats GPT3.5

→ View original post on X — @aymericroucher

19 April 2024

Llama3-70B-Instruct matches GPT4 in agent benchmark test

By

@aymericroucher

–

19 April 2024 14h41

Preliminary testing on my agent benchmark (based on https://
github.com/aymeric-rouche
r/benchmark_agents
…): Llama3-70B-Instruct is on par with GPT4! cc @lvwerra

→ View original post on X — @aymericroucher

19 April 2024

Canceling ChatGPT subscription to switch to HuggingChat

By

@aymericroucher

–

12 April 2024 23h35

My ChatGPT subscription > Cancel > switch to HuggingChat only.

→ View original post on X — @aymericroucher

12 April 2024

AI Travel Planner Helps Plan Your Next Vacation Quickly

By

@aymericroucher

–

10 April 2024 16h27

𝗡𝗲𝘄 𝗦𝗽𝗮𝗰𝗲: 𝘼𝙄 𝙏𝙧𝙖𝙫𝙚𝙡 𝙥𝙡𝙖𝙣𝙣𝙚𝙧 Plan your next vacation in a few minutes! Describe your ideal trip, and it will come up with nice places and recommendations! Try it here

→ View original post on X — @aymericroucher

10 April 2024

Recommended papers for speculative decoding

By

@aymericroucher

–

05 April 2024 11h54

@jxmnop what are some good papers you'd recommend to get up to speed on speculative decoding?

→ View original post on X — @aymericroucher

5 April 2024

Beam Search Visualizer featured in Hugging Face Spaces of the Week

By

@aymericroucher

–

03 April 2024 13h22

My Beam Search visualizer space is now featured in Spaces of the week! It allows you to check how beam search decoding works in practice. Go check it out https://
huggingface.co/spaces/m-ric/b
eam_search_visualizer
…

→ View original post on X — @aymericroucher

3 April 2024

Multi-agent system with orchestrator for flexible error handling

By

@aymericroucher

–

11 March 2024 15h48

Great result! This system based on 3 "team member" agents (Web search, Terminal, & Assistant) managed by an Orchestrator. It has no fixed graph structure : at each step the Orchestrator decides which actions should be taken. Advantage: more flexible in case of error!

→ View original post on X — @aymericroucher

11 March 2024

Claude-3 Opus beats GPT-4, GPT-4 Turbo stays top

By

@aymericroucher

–

08 March 2024 14h39

The Chatbot Arena was just updated with the ELO rankings for the new Claude-3 models! – 𝗖𝗹𝗮𝘂𝗱𝗲-𝟯 𝙊𝙥𝙪𝙨 𝗯𝗲𝗮𝘁𝘀 𝗚𝗣𝗧-𝟰!
It's the first LLM to beat GPT-4 since its release 1 year ago. – GPT-4 Turbo stays on top with a comfortable lead of ~20 points.

→ View original post on X — @aymericroucher

8 March 2024

Find LLM leaderboards: Aymeric Roucher shares ultimate collection by C. Le Fourrier

By

@aymericroucher

–

08 March 2024 9h09

Are you trying to find good leaderboards to compare LLMs? @clefourrier is building the ultimate collection here:

→ View original post on X — @aymericroucher

8 March 2024

New HF Space for visualizing chunk splitting methods in RAG

By

@aymericroucher

–

21 February 2024 13h05

I've built a new HF Space to let you visualize how different splitting methods affect the chunks you fet for RAG! Try it out here: https://
huggingface.co/spaces/m-ric/c
hunk_visualizer
… It's heavily inspired from @GregKamradt 's http://
chunkviz.com – all credits to him for the idea!

→ View original post on X — @aymericroucher

21 February 2024