It did horse riding on astronaut first try, but didn't pull off an analog clock reading 3pm, which remains the great undefeated test (it is possible that better prompting could help)
AI Model Struggles With Analog Clock Reading Task
By
–

By
–

It did horse riding on astronaut first try, but didn't pull off an analog clock reading 3pm, which remains the great undefeated test (it is possible that better prompting could help)