I have never trained a ternary LLM or BERT language model. Looks like there are only few publicly available repos out there. I would maybe start with the "TernaryBERT: Distillation-aware Ultra-low Bit BERT" paper and go from there
Ternary LLM and BERT Training: Starting with TernaryBERT Paper
By
–
Leave a Reply