Towards Understanding Ensemble, Knowledge Distillation and Self-Distillation in Deep Learning. A theory to explain why ensemble and knowledge distillation work for Deep Learning. Paper https://
openreview.net/forum?id=Uuf2q
9TfXGA
… 4/6
Ensemble, Knowledge Distillation and Self-Distillation Theory
By
–
Leave a Reply