Interesting. Maybe it's one with thinking always-on and one always-off. Similar to the IBM Granite model with the toggle. And on the Claude website, maybe they have another "small" LLM routing the user prompt / controlling that toggle.
LLM thinking toggle routing and model selection strategies
By
–
Leave a Reply