Inference Mode Quantization with BNB NF4 for CodeLlama

AI Dynamics

Global AI News Aggregator

Inference Mode Quantization with BNB NF4 for CodeLlama

–

30 August 2023 20h56

And if you are looking for inference mode quantization, you can use `–quantize "bnb.nf4" with the "generate/base.py" scripts as well.
I am currently using that for running CodeLlama 34B models.

→ View original post on X — @rasbt,

30 August 2023

AI CODE GENERATIVE AI LLMS MACHINE LEARNING OPEN SOURCE

AI Dynamics

Inference Mode Quantization with BNB NF4 for CodeLlama

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

AI Generates Perfect Jokes Using Image Generation Skills

Codex App Transformation: Atlas Integration Reshapes User Experience

AI File Access Limitations: Screenshot vs Disk Storage Issues

Synthetic Aperture Radar: Satellite Tech for Global Monitoring