How lucky are you to have been born when and where you are? Had Opus 4.8 in Claude Code whip up a new visualization of all humans who ever lived. In addition to being neat, it is an interesting test of combining research, code, design and stats for an AI. https://
veil-of-history.netlify.app
MULTIMODAL AI
-

Visualization of all humans generated by Opus 4.8 in Claude Code
By
–
-

Step 3.7 Flash MoE Model with 256K Context Released
By
–
Step 3.7 Flash is here ICYMI: 198B MoE with 11B active params, 256K context, native image + video support. Day 0 support is live on http://
build.nvidia.com with GPU-accelerated endpoints, deploy with NVIDIA NIM inference microservices, and fine-tune with the NVIDIA NeMo -
ElevenLabs Dubbing v2 Alpha: Multilingual Speech Translation
By
–
ElevenLabs introduced a new Dubbing v2 Alpha model that can translate speech across all languages while preserving the emotional tone of the original content.
— 🚨 AI News | TestingCatalog (@testingcatalog) 28 mai 2026
Big for creators 👀 https://t.co/W7dTh3IxAT pic.twitter.com/gnC1FLeG4kElevenLabs introduced a new Dubbing v2 Alpha model that can translate speech across all languages while preserving the emotional tone of the original content. Big for creators
-
Shader test as a measure of AI coding capability
By
–
Except a shader like this is a very good measure of model capability because of the technical difficulty of building this sort of code. It translates to other coding as well. Feel free to see my many other tweets and substack posts (& book) about AI applications in businesses.
-
Aleph 2.0 Targeted Video Editing: Selective Color Change
By
–
took a clip from a kids’ ball pit fight and used Aleph 2.0 to change the wall colors from yellow and blue to white and red
— SONIA (@S0N_IA_) 28 mai 2026
what stood out is how targeted the edit was
the wall changes
but the movement, lighting and chaos of the scene stay intact
same clip
different visual… pic.twitter.com/ZRM9AdIaSntook a clip from a kids’ ball pit fight and used Aleph 2.0 to change the wall colors from yellow and blue to white and red what stood out is how targeted the edit was the wall changes
but the movement, lighting and chaos of the scene stay intact same clip
different visual -
Decentralized Training Scales Video Models Without Data Centers
By
–
This is the part nobody is talking about yet.
— AI Highlight (@AIHighlight) 28 mai 2026
You can now train a serious video model without owning a single data center.
Bagel proved decentralized training scales from images to video, and the next stop is world models. https://t.co/vhPzDocy2BThis is the part nobody is talking about yet. You can now train a serious video model without owning a single data center. Bagel proved decentralized training scales from images to video, and the next stop is world models.
-
LocateAnything: Vision-Language Detection Model for AI Agents
By
–
This #CVPR2026 paper from our research team is trending #1 on @HuggingFace 🤗
— NVIDIA AI (@NVIDIAAI) 28 mai 2026
Meet LocateAnything: a vision-language detection model that rethinks bounding box prediction. For AI agents and robots, “seeing” is only useful if a model can pinpoint where something is fast enough to… pic.twitter.com/2OGaQnUCnXThis #CVPR2026 paper from our research team is trending #1 on @HuggingFace Meet LocateAnything: a vision-language detection model that rethinks bounding box prediction. For AI agents and robots, “seeing” is only useful if a model can pinpoint where something is fast enough to
-
AI Animates Physical Materials for Google I/O Film
By
–
We wanted to see if we could take simple, physical materials (like cardboard and markers) and use AI to bring them to life. What was the result? A short film starring a bunch of TPUs getting ready for the big stage at Google I/O 2026!
— Google AI (@GoogleAI) 28 mai 2026
Working with director Laurie Rowan and Nexus… pic.twitter.com/hBk8PRka1xWe wanted to see if we could take simple, physical materials (like cardboard and markers) and use AI to bring them to life. What was the result? A short film starring a bunch of TPUs getting ready for the big stage at Google I/O 2026! Working with director Laurie Rowan and Nexus
-

Gemini API Video-to-Image Generation: Frame Optimization
By
–
API docs here: https://
ai.google.dev/gemini-api/doc
s/image-generation#video-to-image
… For a YouTube video, if your video is long, scale down the `fps` to keep the number of frames within the context window.