AI Dynamics

Global AI News Aggregator

About

LLaVA-MoD: Efficient Small Multimodal Models via Knowledge Distillation

LLaVA-MoD Making LLaVA Tiny via MoE Knowledge Distillation discuss: https://
huggingface.co/papers/2408.15
881
… We introduce LLaVA-MoD, a novel framework designed to enable the efficient training of small-scale Multimodal Language Models (s-MLLM) by distilling knowledge from large-scale MLLM

→ View original post on X — @_akhaliq