Just PyMuPdf! So figures are dropped, a good direction for improvement would be to keep images, but then we'd need to switch to a VLM instead of LLM.
PyMuPdf drops figures; keeping images requires switching to VLM
By
–
By
–
Just PyMuPdf! So figures are dropped, a good direction for improvement would be to keep images, but then we'd need to switch to a VLM instead of LLM.