PyPDF2 pytesseract langchain langchain_google_genai sentence-transformers faiss-cpu