Spaces:

chunking-ai
/

smoldocling-preview

Paused

taprosoft commited on Feb 25

Commit

204f5db

1 Parent(s): 16e4c39

fix: update torch version

Files changed (4) hide show

Dockerfile ADDED Viewed

+FROM nvidia/cuda:12.1.1-cudnn8-devel-ubuntu22.04
+ARG DEBIAN_FRONTEND=noninteractive
+ENV PYTHONUNBUFFERED=1
+RUN apt-get update && apt-get install --no-install-recommends -y \
+    build-essential \
+    python3.10-dev \
+    python3-pip \
+    git \
+    ffmpeg \
+    poppler-utils \
+    libpoppler-dev \
+    tesseract-ocr \
+    && apt-get clean && rm -rf /var/lib/apt/lists/*
+WORKDIR /code
+COPY ./requirements.txt /code/requirements.txt
+# Set up a new user named "user" with user ID 1000
+RUN useradd -m -u 1000 user
+# Switch to the "user" user
+USER user
+# Set home to the user's home directory
+ENV HOME=/home/user \
+    PATH=/home/user/.local/bin:$PATH \
+    PYTHONPATH=$HOME/app \
+    PYTHONUNBUFFERED=1 \
+    GRADIO_SERVER_NAME=0.0.0.0
+RUN pip3 install --no-cache-dir --upgrade -r /code/requirements.txt
+# Set the working directory to the user's home directory
+WORKDIR $HOME/app
+# Copy the current directory contents into the container at $HOME/app setting the owner to the user
+COPY --chown=user . $HOME/app
+CMD ["python3", "app.py"]

README.md CHANGED Viewed

@@ -3,9 +3,7 @@ title: PDFParsersPlayground
 emoji: 🐢
 colorFrom: blue
 colorTo: green
-sdk: gradio
-sdk_version: 5.7.1
-app_file: app.py
 pinned: false
 short_description: Convert PDFs to Markdown with open-source parsers
 ---

 emoji: 🐢
 colorFrom: blue
 colorTo: green
+sdk: docker
 pinned: false
 short_description: Convert PDFs to Markdown with open-source parsers
 ---

app.py CHANGED Viewed

@@ -1,4 +1,4 @@
-from utils import fix_problematic_imports  # noqa
 fix_problematic_imports()  # noqa
@@ -19,7 +19,7 @@ from backends import (
 from backends.settings import ENABLE_DEBUG_MODE
 from utils import remove_images_from_markdown, trim_pages
-TRIMMED_PDF_PATH = Path("/tmp/gradio/trim")
 TRIMMED_PDF_PATH.mkdir(exist_ok=True)

+from utils import fix_problematic_imports
 fix_problematic_imports()  # noqa
 from backends.settings import ENABLE_DEBUG_MODE
 from utils import remove_images_from_markdown, trim_pages
+TRIMMED_PDF_PATH = Path("/tmp/trimmed_input")
 TRIMMED_PDF_PATH.mkdir(exist_ok=True)

cuda_requirements.txt ADDED Viewed


1	+ torch @ https://download.pytorch.org/whl/test/cu118/torch-2.6.0%2Bcu118-cp310-cp310-linux_x86_64.whl
2	+ torchvision @ https://download.pytorch.org/whl/test/cu118/torchvision-0.21.0%2Bcu118-cp310-cp310-linux_x86_64.whl