10). Inner Thinking Transformers – A new method that enhances reasoning efficiency in small-scale LLMs via dynamic depth scaling. ITT aims to mitigate parameter bottlenecks in LLMs, providing scalable reasoning efficiency without expanding model size.
Inner Thinking Transformers: Scaling Reasoning in Small LLMs
By
–
