pod: Efficiency is Coming: 3000x Faster, Cheaper, Better AI Inference with @nyla_worker of @nvidia
, @convaitech
, @googleai
! The commoditization of intelligence takes on a few dimensions: Time to Open Model Equivalent: 15 months between GPT-4 and Llama 3.1 405B (h/t
@latentspacepod
-
AI Inference Efficiency: 3000x Faster, Cheaper, Better
By
–
-
Latent Space Podcast Episode with Carlini Video and Audio
By
–
video: https://
youtube.com/watch?v=X8il0u
VIsFs
… audio: https://
latent.space/p/carlini we'd love your feedback! -
Writing Custom LLM Benchmarks with Nicholas Carlini
By
–
🆕 Why you should write your own LLM benchmarks
— Latent.Space (@latentspacepod) 29 août 2024
w/ Nicholas Carlini of @GoogleDeepMind
Covering his greatest hits:
– How I Use AI
– My benchmark for large language models
– Extracting Training Data from Large Language Models (RIP @openai logprobs)
Full episode below! pic.twitter.com/TtVkNyIa9cWhy you should write your own LLM benchmarks w/ Nicholas Carlini of @GoogleDeepMind Covering his greatest hits:
– How I Use AI
– My benchmark for large language models
– Extracting Training Data from Large Language Models (RIP @openai logprobs) Full episode below! -
Speculative Decoding State of the Art Paper Club Discussion
By
–
This is entirely speculative but… Tomorrow's LS paper club with @picocreator is going to be extremely lit! come learn about the state of the art in Speculative Decoding!
-
AI-in-Action Club Launches DSPy Framework Training Session
By
–
our next #ai-in-action club is starting now: on @lateinteraction
’s DSPy! led by @ProgramWithAi and @kbal11 -
Answer AI Founding Journey and AI Governance Crisis Discussion
By
–
Building AI for The People Never has so much been shipped for so many by so few. https://
latent.space/p/answerai @jeremyphoward is back on the pod! sharing the founding journey of @AnswerAI
, predicting the @OpenAI governance crisis, hiring 1000x researchers and developers like -
Paper Club Discussion on ReFT Representation Fine Tuning
By
–
thanks to @honicky and @vibhuuuus for another great Paper Club on ReFT: Representation Fine Tuning by @aryaman2020 et al! signups below
-
xAI Grok-2-Large Achieves Promising Arena Results
By
–
Some very promising arena results, congrats @xai & @elonmusk
! Open invitation for a grok-2-large release podcast with us, people would love to hear the technical details -
Emergency Paper Club Discusses Llama 3 Herd Models
By
–
EMERGENCY PAPER CLUB The @latentspacepod discord is meeting in 2hrs to talk thru @lvdmaaten et al's The Llama 3 Herd of Models, early contender to win the POTY* Awards! Join us (link below) with @swyx
, @vibhuuuus
, @picocreator
, @eugeneyan
, et al! *Paper of The Year, totally