LMs can struggle w/open-ended tasks that have constraints, such as advanced puzzles & math proofs. MIT’s "DisCIPL" uses an LLM to steer smaller LMs to collaborate on these prompts. It achieves accuracy & efficiency comparable to much leading models: https://
bit.ly/4rLcYmi
MIT’s DisCIPL: Steering Smaller LMs for Complex Problem-Solving
By
–
Leave a Reply