Spaces:
Paused
Paused
Update README.md
Browse files
README.md
CHANGED
@@ -12,9 +12,9 @@ short_description: An OCR application integrated with GOT OCR 2.0
|
|
12 |
|
13 |
<mark>OCR Model Integration Using Gradio:</mark>
|
14 |
|
15 |
-
|
16 |
|
17 |
-
|
18 |
torch
|
19 |
transformers
|
20 |
gradio
|
@@ -27,28 +27,28 @@ accelerate
|
|
27 |
|
28 |
all these libraries are included in requirements.txt to install them : pip install -r requirements.txt
|
29 |
|
30 |
-
|
31 |
|
32 |
The current web page is running on the hugging face space which is using paid GPU that is Nvidia T4 medium.
|
33 |
|
34 |
|
35 |
-
|
36 |
OCR Model: This project uses the GOT-OCR 2.0 model from Hugging Face.
|
37 |
Frontend: The frontend is built using Gradio, which provides an easy-to-use web interface.
|
38 |
Keyword Search: Users can search for specific keywords in the extracted text. The search is case-insensitive, and the matching keywords are highlighted using HTML <mark> tags with customizable colors.
|
39 |
|
40 |
-
|
41 |
The project uses a pre-trained OCR model from Hugging Face:
|
42 |
|
43 |
-
|
44 |
Architecture: Transformer-based model, fine-tuned for Optical Character Recognition.
|
45 |
Framework: Hugging Face's transformers library.
|
46 |
The model is loaded using the AutoTokenizer and AutoModel classes from Hugging Face and runs on a CUDA-enabled device.
|
47 |
|
48 |
-
|
49 |
The project uses Gradio to create an easy-to-use web interface for interacting with the model. The interface allows users to upload images, extract text, and search for keywords in the extracted text.
|
50 |
|
51 |
-
|
52 |
Image Upload: The user uploads an image, and the text is extracted using the OCR model.
|
53 |
Keyword Search: Users input a keyword to search within the extracted text.
|
54 |
Highlighting: Keywords found in the text are highlighted with a customizable color using HTML <mark> tags
|
|
|
12 |
|
13 |
<mark>OCR Model Integration Using Gradio:</mark>
|
14 |
|
15 |
+
**This project integrates a pre-trained OCR (Optical Character Recognition) model with a Gradio-based web interface. Users can upload an image (JPEG format), extract the text using the model, and search for specific keywords in the extracted text. The keywords are highlighted within the displayed results.**
|
16 |
|
17 |
+
**dependencies / libraries required:**
|
18 |
torch
|
19 |
transformers
|
20 |
gradio
|
|
|
27 |
|
28 |
all these libraries are included in requirements.txt to install them : pip install -r requirements.txt
|
29 |
|
30 |
+
**ALSO this model requires a GPU to run , so make sure you have NVIDIA CUDA or similar technologies.**
|
31 |
|
32 |
The current web page is running on the hugging face space which is using paid GPU that is Nvidia T4 medium.
|
33 |
|
34 |
|
35 |
+
**Project Overview**
|
36 |
OCR Model: This project uses the GOT-OCR 2.0 model from Hugging Face.
|
37 |
Frontend: The frontend is built using Gradio, which provides an easy-to-use web interface.
|
38 |
Keyword Search: Users can search for specific keywords in the extracted text. The search is case-insensitive, and the matching keywords are highlighted using HTML <mark> tags with customizable colors.
|
39 |
|
40 |
+
**Model Description**
|
41 |
The project uses a pre-trained OCR model from Hugging Face:
|
42 |
|
43 |
+
**Model Name: GOT-OCR 2.0**
|
44 |
Architecture: Transformer-based model, fine-tuned for Optical Character Recognition.
|
45 |
Framework: Hugging Face's transformers library.
|
46 |
The model is loaded using the AutoTokenizer and AutoModel classes from Hugging Face and runs on a CUDA-enabled device.
|
47 |
|
48 |
+
**Gradio Web Interface**
|
49 |
The project uses Gradio to create an easy-to-use web interface for interacting with the model. The interface allows users to upload images, extract text, and search for keywords in the extracted text.
|
50 |
|
51 |
+
**Gradio Setup**
|
52 |
Image Upload: The user uploads an image, and the text is extracted using the OCR model.
|
53 |
Keyword Search: Users input a keyword to search within the extracted text.
|
54 |
Highlighting: Keywords found in the text are highlighted with a customizable color using HTML <mark> tags
|