Insufficient Training Data for Spelling Tasks in LLMs

AI Dynamics

Global AI News Aggregator

Insufficient Training Data for Spelling Tasks in LLMs

–

26 July 2024 1h43

I think there’s not enough training data naturally on the internet of spelling tasks compared to the difficulty of the task for the LLM, due to how text is chopped up into sequences of text chunks (tokens), all of which are unique / distinct. I have a whole video on Tokenization.

→ View original post on X — @karpathy,

26 July 2024

AI Dynamics

Insufficient Training Data for Spelling Tasks in LLMs

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

AI Generates Perfect Jokes Using Image Generation Skills

Codex App Transformation: Atlas Integration Reshapes User Experience

AI File Access Limitations: Screenshot vs Disk Storage Issues

Synthetic Aperture Radar: Satellite Tech for Global Monitoring