AI Dynamics

Global AI News Aggregator

About

QLoRA Bug Consumes 4888GB VRAM in Llama 3 405B

One bug in bitsandbytes caused QLoRA to consume an extra *read notes* 4888 GB of VRAM. Even with QLoRA, fine-tuning Llama 3 405B on long sequence lengths ain't cheap.

→ View original post on X — @maximelabonne,