There you go: https://
github.com/Vaibhavs10/scr
atchpad/blob/main/langchain_hf_api_arxiv.ipynb
… Uses PyPDF2. Let me know if you face any issues, happy to hack a space later if you want 🙂
LangChain PyPDF2 Integration for ArXiv Document Processing
By
–