My favorite part about @honicky
's Paper Club session this week on the 1-bit LLMs paper – relating it to @jefrankle
's Beyond Chinchilla laws and adjusting the equations for the memory/latency characteristics of 1-bit LLMs to derive an optimal param count/data size to aim for. no
1-bit LLMs: Optimizing Parameters and Data with Chinchilla Laws
By
–
Leave a Reply