Having another token type would add complexity. Do you track and store reasoning tokens? If the initial reasoning is off, and you provide it feedback, how well will it guide the next reasoning, especially if the model drops those tokens (or so it seems that way based on how I’m
Reasoning tokens: tracking, feedback, and model consistency challenges
By
–
Leave a Reply