Now let's get into the details. GPT-4 is multimodal and it now accepts the images as inputs and generates captions, classifications, and analyses. Below is one such example of giving an input image of ingredients and asking GPT-4 to generate a list of recipes.
GPT-4 Multimodal Capabilities: Image Input and Recipe Generation
By
–
Leave a Reply