FineWeb Dataset: Improvements Over GPT-2 Training Data

AI Dynamics

Global AI News Aggregator

FineWeb Dataset: Improvements Over GPT-2 Training Data

–

28 May 2024 19h21

10B tokens of FineWeb! Ilya said WebText was 40B tokens (
https://
youtube.com/watch?v=13CZPW
mke6A&t=3645s
… – for gpt2 1.5b) what accounts for the improved loss/accuracy that you got over GPT2 – have we improved our dataset filtering? were there smarter hparam choices made here? any ballpark attributions

→ View original post on X — @swyx,

28 May 2024

AI Dynamics

FineWeb Dataset: Improvements Over GPT-2 Training Data

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Cheaper exploration at scale remains advantageous despite no new exploits

Gold Status Experience Brings Satisfaction

Using ChatGPT for Essay Feedback and Improvement

Intelligence Gone Wrong: Cheating Despite Having Correct Answer