mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding paper page: https://
huggingface.co/papers/2307.02
499
… Document understanding refers to automatically extract, analyze and comprehend information from various types of digital documents, such as a web page.
mPLUG-DocOwl: Multimodal LLM for Document Understanding
By
–
