The team's approach: Introducing Self-Distilled Sparse Drafters (SD²), a novel methodology that leverages self-data distillation and fine-grained weight sparsity to produce highly efficient and well-aligned draft models.
Self-Distilled Sparse Drafters: Efficient AI Model Methodology
By
–