DeepSeek-VL2: Enhanced Multimodal Model with Dynamic Tiling

AI Dynamics

Global AI News Aggregator

DeepSeek-VL2: Enhanced Multimodal Model with Dynamic Tiling

–

13 December 2024 13h48

Improvements from v1: > 2x high-quality training data vs DeepSeek-VL1
> Dynamic image tiling for flexible resolutions + efficient DeepSeek-MoE for LM
> 3-stage training + new multi-modal parallel strategies for efficiency

→ View original post on X — @reach_vb,

13 December 2024

AI GENERATIVE AI INNOVATION LLMS MACHINE LEARNING MULTIMODAL AI OPEN SOURCE RESEARCH

AI Dynamics

DeepSeek-VL2: Enhanced Multimodal Model with Dynamic Tiling

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

The Only Real Bet We Have for the Future

wacrawl 0.2.0: Encrypted Git Backup for WhatsApp

Elon Musk shifts focus to engineering work

MyOneApp Failure: The Bundling Trap in Product Design