In Davis Blalock's latest arXiv roundup, BTLM is mentioned as achieving 7B parameter performance in a 3B parameter model. Davis mentions, "I’m not sure I’ve seen any other non-GPU hardware vendors do this with a model of this size and quality." Read here: https://
dblalock.substack.com/i/136702777/bt
lm-b-k-b-parameter-performance-in-a-b-parameter-model
…
BTLM Achieves 7B Performance in 3B Parameter Model
By
–
