@arthurmensch - AI Dynamics

Deploying AI Models with vLLM and Temperature Control

By

–

28 September 2023 14h10

You can try deploying our models using our packaged solution (with vLLM), you'll get an API with control on temperature (which is how you get rid of repetition)

→ View original post on X — @arthurmensch

28 September 2023

Mistral 7B Now Available in Production

By

@arthurmensch

–

28 September 2023 9h19

Mistral 7B is now in prod, nice work @perplexity_ai ! https://t.co/mckgXbvPcw
— Arthur Mensch (@arthurmensch) 28 septembre 2023

Mistral 7B is now in prod, nice work @perplexity_ai !

→ View original post on X — @arthurmensch

28 September 2023

Mistral AI Releases First Model, Best Open Source 7B

By

@arthurmensch

–

27 September 2023 16h37

At @MistralAI we're releasing our very first model, the best 7B in town (outperforming Llama 13B on all metrics, and good at code), Apache 2.0. We believe in open models and we'll push them to the frontier https://
mistral.ai/news/about-mis
tral-ai/
… Very proud of the team !

→ View original post on X — @arthurmensch

27 September 2023

Llama II Release Advances Open-Source Language Model Progress

By

@arthurmensch

–

19 July 2023 11h55

Great to see the release of Llama II, open-source LLMs are making good progress! Still a lot of room to improve OS models positioning on the efficiency/performance front — so that they eventually catch up with proprietary solutions. An interesting challenge

→ View original post on X — @arthurmensch

19 July 2023

Mistral AI Founded: Guillaume Lample and Team Launch New Venture

By

@arthurmensch

–

14 June 2023 11h50

Totally thrilled to be alongside @GuillaumeLample and @tlacroix6 to create Mistral AI. A lot of work ahead of us!

→ View original post on X — @arthurmensch

14 June 2023

ChatGPT’s Ease with Boomer Requests Raises Questions

By

@arthurmensch

–

29 March 2023 17h58

ChatGPT indeed seems at ease with boomer requests

→ View original post on X — @arthurmensch

29 March 2023

Open vs Closed Source Operating Systems Competition Ahead

By

@arthurmensch

–

24 March 2023 23h17

If that's indeed an OS, let's prepare to see an interesting replay of closed vs open-source operating systems in the coming years 🙂

→ View original post on X — @arthurmensch

24 March 2023

Deep Learning vs Tree Methods on Small Datasets and Multitask Fine-tuning

By

@arthurmensch

–

11 March 2023 21h13

I can see how deep learning methods may struggle to catch up with tree methods on small size datasets (and you say it in the thread). Wondering if you did try to do multitask fine-tuning on eg Transformers and saw a positive benefits? (we observe it in text, see eg Flan/T0)

→ View original post on X — @arthurmensch

11 March 2023

Empirical Modelling: Approximation Theory and Optimization Analysis

By

@arthurmensch

–

27 February 2023 22h02

Both are empirical modelling of experimental outcomes. The bottom one has some grounding in approximation theory and optimisation analysis. It also does not tend to 0 when N,D -> infinity, which is a sound property.

→ View original post on X — @arthurmensch

27 February 2023