VLMs are going through quite an open revolution AND on-device friendly sizes: > Google DeepMind w/ PaliGemma2 – 3B, 10B & 28B > OpenGVLabs w/ InternVL 2.5 – 1B, 2B, 4B, 8B, 26B, 38B & 78B > Qwen w/ Qwen 2 VL – 2B, 7B & 72B > Microsoft w/ FlorenceVL – 3B & 8B (Links below)
Major VLM Revolution: Open-Source Models from Google, OpenGVLabs, Qwen, Microsoft
By
–
Leave a Reply