They've been using the image as raw input to the model since GPT-4 vision back in late 2023 – here's a decent explanation of how that kind of model works
GPT-4 Vision: Image Processing in Large Language Models
By
–
By
–
They've been using the image as raw input to the model since GPT-4 vision back in late 2023 – here's a decent explanation of how that kind of model works