Whisper Model 8-bit Loading: Memory-Efficient Inference

AI Dynamics

Global AI News Aggregator

Whisper Model 8-bit Loading: Memory-Efficient Inference

–

08 December 2022 17h29

Like all the models on the transformers models, all Whisper checkpoints can be loaded in a memory-efficient way! With load_in_8bit=True you can load the model with 8-bit precision. P.S. You can load a Whisper-large model < 6.6 gig VRAM

→ View original post on X — @reach_vb,

8 December 2022

AI Dynamics

Whisper Model 8-bit Loading: Memory-Efficient Inference

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Cybercab Uber: Safer, Cheaper Alternative for Single Riders

Zeekr Global Unveils Latest Electric Vehicle Model

Revolutionary New Camera Technology Unveiled

Hidden Camera Recording Family Interactions Raises Privacy Concerns