GPT-4o level multimodal LLM running on your phone.
— Shubham Saboo (@Saboo_Shubham_) 5 septembre 2025
MiniCPM-V 4.5 outperforms GPT-4o, Gemini-2.0 Pro, and Qwen2.5-VL 72B on vision and language AI tasks.
It can even understand videos and perform OCR in 30+ languages.
And it's 100% Opensource. pic.twitter.com/wY70HlmzDS
GPT-4o level multimodal LLM running on your phone. MiniCPM-V 4.5 outperforms GPT-4o, Gemini-2.0 Pro, and Qwen2.5-VL 72B on vision and language AI tasks. It can even understand videos and perform OCR in 30+ languages. And it's 100% Opensource.
Leave a Reply