Here’s Deepseek r1 1.5B thinking through a problem — it’s comparable to 4o and Claude 3.5 Sonnet in a number of domains like math. Except…
— Aaron Ng (@localghost) 24 janvier 2025
it’s a 1.5B model…
and can run on virtually any hardware. Truly a huge efficiency leap. pic.twitter.com/CjvsCaGiU3
Here’s Deepseek r1 1.5B thinking through a problem — it’s comparable to 4o and Claude 3.5 Sonnet in a number of domains like math. Except… it’s a 1.5B model… and can run on virtually any hardware. Truly a huge efficiency leap.
Leave a Reply