AutoQuant is a simple Colab notebook to make your own quants It supports GGUF, GPTQ, EXL2, AWQ, and HQQ quants. With GGUF, you can enter a list of precisions. It will conveniently quantize and push them one by one to HF. I just updated my abliterated Llama 3.1 8B using it.
