@chipro - AI Dynamics

Why Open Source AI Models Lag Behind Commercial Solutions

By

–

01 May 2024 18h53

Several people told me to add "open source performance is still lagging compared to commercial models". Is there any fundamental reason that stops open source models from catching up? The biggest reason I can think of is what @soumithchintala told me: open source models don't

→ View original post on X — @chipro,

1 May 2024

Open Source vs Commercial AI Models: Privacy and Security Considerations

By

@chipro

–

01 May 2024 18h48

I’m making a list of things to consider when using open source models and commercial models. What else should I add? Commercial models
1. Data privacy: employees might accidentally include confidential information in the prompt, e.g. when Samsung employees leaked the company’s

→ View original post on X — @chipro,

1 May 2024

Enterprise AI applications easiest to evaluate gain adoption first

By

@chipro

–

29 April 2024 16h53

I believe that the most common enterprise AI applications today aren’t the ones that solve the most important problems or make the most money. The most common applications are the ones that are easiest to evaluate. 1. Recommender system: evaluated by increase in engagement or

→ View original post on X — @chipro,

29 April 2024

New Book on AI Engineering with Foundation Models

By

@chipro

–

23 April 2024 22h58

I’m excited to share that I’m working on a new book about building applications with foundation models! AI Engineering builds upon Machine Learning Systems Design, but with a focus on large scale, ready made models. The book covers: – The new AI stack (e.g. how it differs from

→ View original post on X — @chipro,

23 April 2024

Theseus GPU-native query engine benchmarks massive data processing

By

@chipro

–

17 April 2024 17h54

Excited to show what our team has been working on over the last 2.5 years: Theseus, our GPU-native query engine! This benchmark compares data queries of different scales — 10TB, 30TB, and 100TB — on Spark (run on CPUs) and Theseus (run on GPUs). Moving the same queries from

→ View original post on X — @chipro,

17 April 2024

Decentralized Web of Trust: Personal Reputation Systems Explained

By

@chipro

–

01 April 2024 6h34

I like the idea of the web of trust. How would trust be established/signed? Would that be like page rank but for trust? Would trust be personal: e.g. I might not necessarily trust the same website my mom does?

→ View original post on X — @chipro,

1 April 2024

Data synthesis challenges for AI startups

By

@chipro

–

01 April 2024 1h45

Problems I'd do if I'm to do a startup again (though I probably won't any time soon because startups are hard). If you’re solving any of them, I’d love to chat. 1. Data synthesis: AI has become really good both at generating and annotating data. The challenge now is to make sure

→ View original post on X — @chipro,

1 April 2024

845 Generative AI Repos on GitHub: Growth Analysis

By

@chipro

–

14 March 2024 22h26

I went through the most popular AI repos on GitHub, categorized them, and studied their growth trajectories. Here are some of the learnings: 1. There are 845 generative AI repos with at least 500 stars on GitHub. They are built with contributions from over 20,000 developers,

→ View original post on X — @chipro,

14 March 2024

Human Preference Predictor Architecture and Results

By

@chipro

–

29 February 2024 17h01

This is the architecture of my preference predictor. You can read more about how it works and the results here. https://
huyenchip.com/2024/02/28/pre
dictive-human-preference.html
… As always, feedback is much appreciated!

→ View original post on X — @chipro,

29 February 2024

Bradley-Terry Model Ranking for AI Model Preference Predictions

By

@chipro

–

29 February 2024 17h01

Using the preference predictions for different model pairs for a given prompt, I fit a Bradley-Terry model (the same ranking algorithm that Chatbot Arena uses) to compute a model ranking specific to that prompt.

→ View original post on X — @chipro,

29 February 2024