LLaMA 65B can run on a MacBook! With a different model architecture it could probably run quite faster (we didn't use multi query, for instance) https://t.co/SMdlZC5NOo
— Guillaume Lample @ NeurIPS 2024 (@GuillaumeLample) 11 mars 2023
LLaMA 65B can run on a MacBook! With a different model architecture it could probably run quite faster (we didn't use multi query, for instance)
Leave a Reply