I made this plot as a quick vibe test for how "mini" o3-mini's world knowledge is. You can make small heuristic evals for domains you care about to see how much o3-mini's degradation vs. o1 matters to you — also, to measure your progress prompting back to parity.
Plot testing o3-mini’s world knowledge degradation compared to o1
By
–