It isn’t important; this task isn’t practical at all; it’s just a compact illustration of the broadened class of tasks o1 can do vs earlier models, e.g. tasks that seem to require a guess-and-check loop with dozens of iterations REPL is still best for many real-world tasks