the idea of analyzing things over a long derivation is about 60 years old, and will be part of the final answer; the particular way that o* does it (presumably via Chain of thought and RL but maybe other mechanisms) has not been transparently explained, and may or may not play a
Chain of Thought and RL in Extended AI Reasoning Analysis
By
–
Leave a Reply