@chipro - AI Dynamics

Weak Models Match Strong Models on Simple Prompts

By

–

29 February 2024 17h00

In my experiment, a preference predictor was able to pick up the performance patterns of different models. One pattern is that for simple prompts, weak models can do (nearly) as well as strong models. For more challenging prompts, however, users are much more likely to prefer

→ View original post on X — @chipro,

29 February 2024

Predicting Best AI Model Selection for User Queries

By

@chipro

–

29 February 2024 16h59

A challenge of building AI applications is choosing which model to use. What if we don’t have to? What if we can predict the best model for any prompt? Predictive human preference aims to predict which model users might prefer for a specific query. One use case is model

→ View original post on X — @chipro,

29 February 2024

Voltron Data Acquires Claypot for Real-Time AI Applications

By

@chipro

–

24 January 2024 18h22

Claypot AI is joining Voltron Data! AI starts from data. By joining forces, we can further help companies leverage both batch and real-time data for AI applications, on top of Voltron Data’s GPU-native distributed engine Theseus. https://
venturebeat.com/data-infrastru
cture/exclusive-voltron-data-acquires-claypot-to-unlock-real-time-ai-with-modular-data-systems/
… For AI, GPUs are mostly

→ View original post on X — @chipro,

24 January 2024

Sampling Strategies for AI Text Generation: Temperature, Top-K, Top-P

By

@chipro

–

17 January 2024 5h13

New post: Sampling for Text Generation https://
huyenchip.com/2024/01/16/sam
pling.html
… Many challenges (and opportunities) in working with AI today stem from the way models sample their outputs. This post covers: 1. Sampling strategies and variables including temperature, top-k, and top-p.
2. How

→ View original post on X — @chipro,

17 January 2024

Gemini Technical Report: TPU Training, Performance vs GPT Models

By

@chipro

–

06 December 2023 18h53

Summary of Gemini's 60-page technical report. 1. Written in Jax and trained using TPUs. The architecture, while not explained in details, seems similar to Flamigo's. 2. Gemini Pro's performance is similar to GPT-3.5 and Gemini Ultra is reported to be better than GPT-4. Nano-1

→ View original post on X — @chipro,

6 December 2023

Multimodality and Large Multimodal Models Explained

By

@chipro

–

11 October 2023 7h00

New blog post: Multimodality and Large Multimodal Models (LMMs) Being able to work with data of different modalities — e.g. text, images, videos, audio, etc. — is essential for AI to operate in the real world. This post covers multimodal systems in general, including Large

→ View original post on X — @chipro,

11 October 2023

Open Challenges in Large Language Model Research Today

By

@chipro

–

16 August 2023 19h02

Open challenges in LLM research The first two challenges, hallucinations and context learning, are probably the most talked about today. I’m the most excited about 3 (multimodality), 5 (new architecture), and 6 (GPU alternatives). Number 5 and number 6, new architectures and

→ View original post on X — @chipro,

16 August 2023

Generative AI Strategy: Slides and Insights from Expert Talk

By

@chipro

–

08 June 2023 17h34

I had so much fun preparing this talk. Per request, here are the slides: https://
huyenchip.com/2023/06/07/gen
erative-ai-strategy.html
… The idea came from many conversations I’ve had recently with friends who need to figure out their generative AI strategy. I’d love to hear about your experience through this process.

→ View original post on X — @chipro,

8 June 2023

RLHF: Reinforcement Learning from Human Feedback Explained

By

@chipro

–

03 May 2023 17h43

New post: RLHF – Reinforcement Learning from Human Feedback Discussing 3 phases of ChatGPT development, where RLHF fits in, how RLHF works, hypotheses on why it works, and relationship between RLHF and hallucination. https://
huyenchip.com/2023/05/02/rlh
f.html
…

→ View original post on X — @chipro,

3 May 2023

In-House LLMs: Benefits and Drawbacks for Companies

By

@chipro

–

25 April 2023 18h44

Many companies seem to want their own in-house LLMs: finetune an open-source LLM on their own data. Here are a few reasons for and against in-house LLMs I can think of. Would love to hear your thoughts.

→ View original post on X — @chipro,

25 April 2023