News: @aiDotEngineer is headed to NYC!! https://
latent.space/p/2025-summit Announcing the 2025 AI Engineer Summit. Applications open now!
@latentspacepod
-
2025 AI Engineer Summit in NYC Now Accepting Applications
By
–
-
2024 AI Year in Review: NeurIPS Insights and Major Themes
By
–
π Thank you for a great 2024!
— Latent.Space (@latentspacepod) 1 janvier 2025
Presenting our 2024 year in review: NeurIPS takeaways, Biggest Themes of the Year, the Four Wars of AI, and a month-by-month recap of the Year in AI.
00:00 Welcome to the 100th Episode!
00:19 Reflecting on the Journey
00:47 AI Engineering: The⦠pic.twitter.com/VXezTxlJZQThank you for a great 2024! Presenting our 2024 year in review: NeurIPS takeaways, Biggest Themes of the Year, the Four Wars of AI, and a month-by-month recap of the Year in AI. 00:00 Welcome to the 100th Episode!
00:19 Reflecting on the Journey
00:47 AI Engineering: The -
Open Source Projects Available on GitHub and Hacker News
By
–
yea tons on github/hn. nothing BIG but lots of little open source projects, search around
-
Prompt Engineering Importance in AI: Evolution and Recognition
By
–
## Prompting, ICL & Chain of Thought We initially started writing about the AI Engineer as a reaction against "Prompt Engineer" being a fulltime role in 2023; now the pendulum has swung the other way and not enough employers appreciate the importance of good prompting in AI
-
Llama 4 versus Claude 4: The 2025 AI Model Showdown
By
–
remains to be seen but they have slightly different goals. Llama 4 vs Claude 4 will be a VERY interesting fight in 2025.
-
Vision Becomes Table Stakes Across Major LLM Platforms
By
–
2024 was the year Vision became table stakes – we went from "only OAI has it" in jan to now Gemini, Claude, Grok, Mistral, Olmo and even Llama having vision support! Where to get started in Vision + LLMs: 1. just get hands on experience with using vision in 4o/claude/gemini
-
Essential AI Benchmarks and Evaluation Metrics for LLMs
By
–
Must Know Benchmarks and Evals: Knowledge: @hendrycks
' MMLU and MATH, @idavidrein
's GPQA and BIG-Bench and their polyunsaturated 2025 variants. Ditto Math lvl 5, AIME, @tamaybes
's FrontierMath, etc Long Context: @ZayneSprague
's MuSR, @realYushiBai
's LongBench, -
Frontier LLMs: OpenAI, Anthropic, Google, Meta and competitors
By
–
Section 1: Frontier LLMs the @openai canon, @AnthropicAI Claude and @GoogleDeepMind Gemini, @AIatMeta Llama 1/2/3, @MistralAI
, @deepseek_ai 1/2/3, @apple intelligence special mention: @soldni
/
@natolambert AI2, @xai grok, @awsai Nova, @LoubnaBenAllal1 SmolLM, et al. -
2025 AI Engineering Reading List: Weekly Papers
By
–
Presenting: The 2025 AI Engineering Reading List https://
latent.space/p/2025-papers 1 paper/blog/model family per week for every week of 2025, for you to run paper clubs or binge over the break.