📺 Learning Video Representations from Large Language Models
— AI at Meta (@AIatMeta) 20 juin 2023
This work repurposed pre-trained LLMs to be conditioned on visual input, and finetune them to create automatic video narrators.
Paper ➡️ https://t.co/OsZt3AnAEM
3/7 pic.twitter.com/DLunz5bcgP
Learning Representations from Large Language Models This work repurposed pre-trained LLMs to be conditioned on visual input, and finetune them to create automatic video narrators. Paper https://
bit.ly/3NDCWpR 3/7
Leave a Reply