Treat Visual Tokens as Text? But Your MLLM Only Needs Fewer Efforts to See https://
arxiv.org/abs/2410.06169 https://
github.com/ZhangAIPI/YOPO
_MLLM_Pruning/tree/main?tab=readme-ov-file
…
YOPO Pruning: Efficient Visual Token Processing for MLLMs
By
–