Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks paper page: https://
huggingface.co/papers/2306.04
362
… To promote the development of Vision-Language Pre-training (VLP) and multimodal Large Language Model (LLM) in the Chinese community, we
Youku-mPLUG: 10M Chinese Video-Language Dataset for Pre-training
By
–
Leave a Reply