AI Dynamics

Global AI News Aggregator

About

Exploration of Multimodal Pretraining Beyond Language Modeling

Yann LeCun Saining Xie insane crossover of the 2 biggest visual representation researchers in the AI field “Beyond Language Modeling: An Exploration of Multimodal Pretraining” Right now, most multimodal models are basically a language model with a vision adapter bolted on,

→ View original post on X — @askalphaxiv