llama.cpp inference and gptq quantization techniques exploration

AI Dynamics

Global AI News Aggregator

llama.cpp inference and gptq quantization techniques exploration

–

23 August 2023 23h25

Oh, reading a bit more about llama.cpp (
https://
github.com/ggerganov/llam
a.cpp
…), that's only inference, not training? I haven't tried since I don't have the model checkpoints on my laptop, but you may be able to use gptq.int4 quantization then: https://
github.com/Lightning-AI/l
it-gpt/blob/main/tutorials/quantize.md
…

→ View original post on X — @rasbt,

23 August 2023

AI Dynamics

llama.cpp inference and gptq quantization techniques exploration

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

AI Generates Perfect Jokes Using Image Generation Skills

Codex App Transformation: Atlas Integration Reshapes User Experience

AI File Access Limitations: Screenshot vs Disk Storage Issues

Synthetic Aperture Radar: Satellite Tech for Global Monitoring