AI Dynamics

Global AI News Aggregator

About

Multimodal AI: Architectural Challenges Beyond Text Generation

Everyone is working on multimodal systems.
The question is how to do it.
And the problem is that the kind of generative architecture that works for text does not work for images and video.

→ View original post on X — @ylecun