4/ LM-Guided Chain-of-Thought – applies knowledge distillation to a small LM with rationales generated by the large LM; the rationale is then generated by the lightweight LM and the answer prediction is done by the frozen large LM.
LM-Guided Chain-of-Thought Knowledge Distillation Technique
By
–
Leave a Reply