We saw that with StarCoder in @BigCodeProject as well: being trained on code the 15B StarCoder is beating much larger models on HELM synthetic reasoning and logic tasks, like cohere command beta 52B, Anthropic-LM v4 52B, Aleph-Alpha Luminous Supreme 70B or OPT 175B
StarCoder 15B Outperforms Larger Models on Reasoning Tasks
By
–
Leave a Reply