The whale strikes again! DeepSeekVL 2 > DeepSeek-VL2-Tiny, DeepSeek-VL2-Small, and DeepSeek-VL2, with 1.0B, 2.8B, and 4.5B activated parameters
> SoTA perf with similar or fewer activated parameters compared to Qwen 2 VL
> Excels at visual question answering, optical
DeepSeek-VL2 Achieves SoTA Vision Performance with Fewer Parameters
By
–
Leave a Reply