PyPDF2 pytesseract langchain langchain_google_genai sentence-transformers faiss-cpu langchain_huggingface langchain_community langchain_openai