My @MLSysConf keynote is now online. https://
mlsys.org/virtual/2025/i
nvited-talk/2887
… The scaling of large language models has led to impressive gains in language understanding, but at a cost of insatiable memory and bandwidth requirements. I advocated a principled approach of designing optimization
Optimizing Large Language Models: Memory and Bandwidth Solutions
By
–