Thanks for clarifying. E.g. in my view, "Pre-training, post-training, post-inference," would be too general because that's also the same flow for LLMs that are not specialized reasoning models. In a sense all reasoning techniques are post-training techniques because they all are
Reasoning Models vs General LLMs Post-Training Techniques
By
–
Leave a Reply