Big hopes for the speakers dinner menu
@petergostev
-
Trinity Large scores lower than expected with high thinking
By
–
Trinity Large also not scoring high – 73th and 82nd, no thinking is doing better than xhigh thinking.
-
Bullshit Benchmark Data Viewer and GitHub Repository Released
By
–
Data viewer: https://
petergpt.github.io/bullshit-bench
mark/viewer/index.v2.html
… Github with all data & code: -
Gemma 4 Scores Low on BullshitBench Evaluation
By
–
-
Gemma 4 Scores Low on BullshitBench Evaluation
By
–
-
Google’s AI Strategy: Investing in Anthropic Over Gemini Development
By
–
It's so curious to me, doesn't look like Google is serious about AI. They've been investing in Anthropic for years, selling them TPUs, and basically diverting resources from Gemini while it is cracking under capacity constraints. Imagine OpenAI selling capacity to their core
-
Google’s AI Strategy: Investing in Anthropic Over Gemini Development
By
–
It's so curious to me, doesn't look like Google is serious about AI. They've been investing in Anthropic for years, selling them TPUs, and basically diverting resources from Gemini while it is cracking under capacity constraints. Imagine OpenAI selling capacity to their core
-
Winter coat evening shorts afternoon layering
By
–
Easy – winter coat for the evening and shorts for the afternoon
-
London Tech Conference Week: AI Engineer Event and Perfect Weather
By
–
Get ready for another week of 'londonmaxxing' – there's a big @aiDotEngineer conference in London paired with an absolutely gorgeous weather that would trick everyone that London is just like SF (+things to do) and not grey and miserable
-
The Information app notifications disrupt sleep at night
By
–
The worst one in my day to day is @theinformation – I literally close the app the moment it shines in my face in the middle of the night