AI Dynamics

Global AI News Aggregator

About

Google’s Omni multimodal model and blending potential

Google has the only true Omni model, but the elements aren't hooked up. It appears it can take in & output audio, images. video, songs, text, code, etc. But right now each type of output is separate. When you can access the model directly, blending modes, a lot becomes possible.

→ View original post on X — @emollick,