Visional Language: (3/6) Earlier this year, we launched the Yi Vision Language models (Yi-VL-34B, Yi-VL-6B) with bilingual multimodal understanding and generation capabilities. Check out the architecture of the Yi-VL models and our three-stage training process.
Yi Vision Language Models: Architecture and Three-Stage Training Process
By
–
Leave a Reply