I remember that 4o was meant to become multimodal so it can be used to generate images directly without a need to use DALL-E underneath. It is happening? First noticed on the Cyberpunk Artist GPT
Discussion on GPT-4o Multimodal Image Generation Capabilities
By
–