Improvements from v1: > 2x high-quality training data vs DeepSeek-VL1
> Dynamic image tiling for flexible resolutions + efficient DeepSeek-MoE for LM
> 3-stage training + new multi-modal parallel strategies for efficiency
DeepSeek-VL2: Enhanced Multimodal Model with Dynamic Tiling
By
–
Leave a Reply