Yann LeCun Saining Xie insane crossover of the 2 biggest visual representation researchers in the AI field “Beyond Language Modeling: An Exploration of Multimodal Pretraining” Right now, most multimodal models are basically a language model with a vision adapter bolted on,
Exploration of Multimodal Pretraining Beyond Language Modeling
By
–
