Yeah, it's going to be interesting. You can do it either with 1) a 2nd cheap LLM that classifies the prompt as "reasoning" / "non-reasoning" and then modifies the system prompt and/or adds a special token as a toggle
2) do RLFH-style preference tuning to teach the LLM when to
LLM Prompt Classification and Reasoning Toggle Strategies
By
–
Leave a Reply