Alibaba dropped Qwen2.5-Omni-7B, a new multimodal AI
— Rowan Cheung (@rowancheung) 27 mars 2025
—Processes across text, audio, image, and video in real-time
—Text and speech outputs
—Strong speech understanding, outperforms specialized audio models
—Can run on phones, laptops
—Open-sourcepic.twitter.com/WpOTdqv0N6
Alibaba dropped Qwen2.5-Omni-7B, a new multimodal AI —Processes across text, audio, image, and video in real-time
—Text and speech outputs
—Strong speech understanding, outperforms specialized audio models
—Can run on phones, laptops
—Open-source
Leave a Reply