ImageBind: One Embedding Space To Bind Them All ImageBind is a new and the first model capable of learning from six modalities(images, text, audio, depth, thermal, and IMU data). ImageBind extends zero-shot capabilities of vision-language systems to new modalities by using
ImageBind: Revolutionary Multi-Modal AI Model for Six Data Types
By
–
