@jiqizhixin - AI Dynamics

Monet-7B: AI Reasoning Directly in Visual Latent Space

By

–

27 April 2026 5h00

What if your AI could truly “think in pictures,” not just describe them? Researchers from Peking University, Kling Team, and Amazon AGI introduce Monet-7B, a new framework that lets multimodal AI reason directly inside visual latent space—no external tools needed. Instead of

→ View original post on X — @jiqizhixin,

27 April 2026

Strategy Genes: AI Evolution Through Compact Reusable Code

By

@jiqizhixin

–

26 April 2026 20h58

Can you turn messy trial-and-error into pure strategy, like a game AI evolving mid-battle? From Tsinghua University and EvoMap, researchers challenge how AI reuses past experience. Instead of bulky "skill manuals," they propose "Strategy Genes" — compact, evolution-ready code

→ View original post on X — @jiqizhixin,

26 April 2026

Beyond Model Size: Smart Scaffolding for LLM Agents

By

@jiqizhixin

–

26 April 2026 16h56

Think building an AI agent is just a better brain? What if the real secret is what you add outside the model? A team from Shanghai Jiao Tong University, OPPO, and others argues that the future of LLM agents isn't about bigger weights, but smarter scaffolding. They introduce a

→ View original post on X — @jiqizhixin,

26 April 2026

Unified Framework Controls Large Language Models Without Breaking

By

@jiqizhixin

–

26 April 2026 11h54

Can you control a large language model without breaking its brain? Zhejiang University and Alibaba Group researchers just showed how. They unify all model control methods (fine-tuning, LoRA, activation edits) into one single framework, separating effects into "preference"

→ View original post on X — @jiqizhixin,

26 April 2026

Huawei AURA: Real-Time AI Video Processing System

By

@jiqizhixin

–

26 April 2026 5h52

What if your AI could watch a live video feed and answer your questions, or even alert you, all in real time? Researchers from Huawei Research and CUHK MMLab, introduce AURA. This new system allows a single video AI to continuously process live streams, enabling instant

→ View original post on X — @jiqizhixin,

26 April 2026

Single Photo to Rigged 3D Character Generation with AniGen

By

@jiqizhixin

–

25 April 2026 20h50

What if you could generate a fully rigged, animatable 3D character from just a single photo? Researchers from The University of Hong Kong, VAST, CUHK, and Tsinghua University present AniGen for exactly that. They created a unified system that simultaneously generates a 3D

→ View original post on X — @jiqizhixin,

25 April 2026

StreamingVLA: Parallel Vision-Language-Action for AI Robots

By

@jiqizhixin

–

25 April 2026 7h37

What if your AI robot could think and act simultaneously, instead of waiting between steps? Researchers from Tsinghua University & Lenovo present StreamingVLA. The new method lets a vision-language-action model run its "see," "think," and "act" stages in parallel, like a

→ View original post on X — @jiqizhixin,

25 April 2026

NES: AI Framework Predicts Your Next Code Edit

By

@jiqizhixin

–

25 April 2026 3h35

What if your IDE could predict your next code edit before you even type it? Researchers at Ant Group present NES, a new AI framework that learns from past editing patterns. It uses two models: one to guess where you'll edit next, and another to suggest what to change—all

→ View original post on X — @jiqizhixin,

25 April 2026

World Engine: Synthetic Edge Cases for Autonomous Driving Training

By

@jiqizhixin

–

24 April 2026 18h46

What if you could train a self-driving car on its hardest moments, not just its longest drives? OpenDriveLab, Huawei, NVIDIA & others present World Engine. Instead of just adding more miles of normal data, it generates massive volumes of synthetic edge cases—like cut-ins and

→ View original post on X — @jiqizhixin,

24 April 2026

Can LLMs Truly Emulate Individual Human Online Personas?

By

@jiqizhixin

–

24 April 2026 18h33

Can LLMs truly think and act like a specific person online? Researchers from Northeastern, USC, Columbia & others present OPeRA, a new dataset that captures real people’s shopping habits—their persona, screen view, action, and internal reasoning. It’s the first public

→ View original post on X — @jiqizhixin,

24 April 2026