AI Dynamics

Global AI News Aggregator

@aymericroucher

New Step-by-Step Guide to Spinning Up in LLMs

By

@aymericroucher

–

13 February 2024 16h32

𝙉𝙚𝙬 𝙎𝙩𝙚𝙥-𝙗𝙮-𝙨𝙩𝙚𝙥 𝙂𝙪𝙞𝙙𝙚: 𝙎𝙥𝙞𝙣𝙣𝙞𝙣𝙜 𝙐𝙥 𝙞𝙣 𝙇𝙇𝙈𝙨! I've made a step-by-step intro to LLMs. I've explained a lot in detail, and added plentiful resources to go further: tutorials, papers, blog posts… Dive in! https://
shorturl.at/aHJR3

→ View original post on X — @aymericroucher

13 February 2024
Open-source Mixtral beats GPT3.5 in LangChain agents

By

@aymericroucher

–

24 January 2024 14h30

We've worked with @andrewrreed and Joffrey Thomas on making it possible to build LangChain agents powered by open-source models. On our benchmark, Mixtral is the most competitive of all OS models, even surpassing GPT3.5 Read our blog post here: https://
huggingface.co/blog/open-sour
ce-llms-as-agents
…

→ View original post on X — @aymericroucher

24 January 2024
Encouraging development of agents with open-source models

By

@aymericroucher

–

22 December 2023 18h16

Happy to have helped on this, let's keep developing the usage of agents based on open-source models!

→ View original post on X — @aymericroucher

22 December 2023
RAG relevance for questions using many distant document elements

By

@aymericroucher

–

02 December 2023 13h10

RAG can handle all sorts of different questions, it's really appropriate for extracting information from a document. An interesting question is "when the question requires using more many distant elements from doc, is RAG still relevant?"

→ View original post on X — @aymericroucher

2 December 2023
RAG retrieval failures due to random variations, need more tests

By

@aymericroucher

–

02 December 2023 13h07

Well spotted!
Both variations (blue and green ) are also due to random variations. For instance the RAG fails to retrieve relevant snippets on 1 single example. The tests I ran were not numerous enough yet to smooth these variations out. But maybe I'll run more.

→ View original post on X — @aymericroucher

2 December 2023
RAG system reduces token input from 128k to 2k

By

@aymericroucher

–

02 December 2023 13h03

Exactly!
To complement on the end: thanks to the RAG system, the model was fed around 2k tokens each time, down from the 128k tokens of the original document.

→ View original post on X — @aymericroucher

2 December 2023
RAG beats GPT-4-Turbo in long document retrieval test

By

@aymericroucher

–

01 December 2023 18h13

Information Retrieval: Who wins, GPT-4-Turbo or a RAG based on GPT4? I extended the "Needle in a Haystack" test created by @GregKamradt & the result is clear: 𝗥𝗔𝗚 𝘄𝗶𝗻𝘀 > its edge becomes clear for the longest document sizes. @huggingface Links below

→ View original post on X — @aymericroucher

1 December 2023