typing beautifulsoup4 pdf2image gradio pdfplumber python-docx gradio python-pptx numpy<2 torch>=2 spaces transformers loadimg torchvision pillow scikit-image