GPT-4 can take two important modalities that humans rely on – vision and language. Its output is text though(maybe in the future it will be many-to-many). GPT-4 can explain images, helps you understand what's in images, decode memes, etc. A few examples of GPT-4 on visual input.
GPT-4 Vision and Language Capabilities with Visual Input Examples
By
–
Leave a Reply