AI Dynamics

Global AI News Aggregator

GPT-4 Vision and Language Capabilities with Visual Input Examples

GPT-4 can take two important modalities that humans rely on – vision and language. Its output is text though(maybe in the future it will be many-to-many). GPT-4 can explain images, helps you understand what's in images, decode memes, etc. A few examples of GPT-4 on visual input.

→ View original post on X — @jeande_d,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *