LLaVA-MoD Making LLaVA Tiny via MoE Knowledge Distillation discuss: https://
huggingface.co/papers/2408.15
881
… We introduce LLaVA-MoD, a novel framework designed to enable the efficient training of small-scale Multimodal Language Models (s-MLLM) by distilling knowledge from large-scale MLLM
LLaVA-MoD: Efficient Small Multimodal Models via Knowledge Distillation
By
–
