Spaces:

davanstrien
/

ocr-time-machine

Running on Zero

davanstrien HF Staff commited on Jun 30

Commit

4af31e3

1 Parent(s): beca8ab

description

Files changed (1) hide show

app.py CHANGED Viewed

@@ -408,13 +408,14 @@ with gr.Blocks() as demo:
         "For decades, galleries, libraries, archives, and museums (GLAMs) have used Optical Character Recognition "
         "to transform digitized books, newspapers, and manuscripts into machine-readable text. Traditional OCR "
         "produces complex XML formats like ALTO, packed with layout details but difficult to use. "
-        "Now, cutting-edge Vision-Language Models (VLMs) are revolutionizing OCR with simpler, cleaner Markdown output. "
-        "This Space makes it easy to compare these two approaches and see which works best for your historical documents. "
-        "Upload a historical document image and its XML file to compare these approaches side-by-side. "
         "We'll extract the reading order from your XML for an apples-to-apples comparison of the actual text content.\n\n"
-        "**Available models:** [RolmOCR](https://huggingface.co/reducto/RolmOCR) | "
-        "[Nanonets-OCR-s](https://huggingface.co/nanonets/Nanonets-OCR-s) | "
-        "[olmOCR](https://huggingface.co/allenai/olmOCR-7B-0225-preview)"
     )
     gr.Markdown("---")

         "For decades, galleries, libraries, archives, and museums (GLAMs) have used Optical Character Recognition "
         "to transform digitized books, newspapers, and manuscripts into machine-readable text. Traditional OCR "
         "produces complex XML formats like ALTO, packed with layout details but difficult to use. "
+        "Now, Vision-Language Models (VLMs) are revolutionizing OCR with simpler, cleaner output. "
+        "This Space lets you compare three leading VLM-based OCR models against traditional approaches. "
+        "Upload a historical document image and its XML file to see them side-by-side. "
         "We'll extract the reading order from your XML for an apples-to-apples comparison of the actual text content.\n\n"
+        "**Available models:**\n"
+        "• [RolmOCR](https://huggingface.co/reducto/RolmOCR) - Fast & general-purpose\n"
+        "• [Nanonets-OCR-s](https://huggingface.co/nanonets/Nanonets-OCR-s) - Advanced with table/math support\n"
+        "• [olmOCR](https://huggingface.co/allenai/olmOCR-7B-0225-preview) - Allen AI's pioneering 7B document specialist"
     )
     gr.Markdown("---")