omvishesh commited on
Commit
89decca
·
verified ·
1 Parent(s): cca617a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -12,9 +12,9 @@ short_description: An OCR application integrated with GOT OCR 2.0
12
 
13
  <mark>OCR Model Integration Using Gradio:</mark>
14
 
15
- ####This project integrates a pre-trained OCR (Optical Character Recognition) model with a Gradio-based web interface. Users can upload an image (JPEG format), extract the text using the model, and search for specific keywords in the extracted text. The keywords are highlighted within the displayed results.
16
 
17
- ###dependencies / libraries required:
18
  torch
19
  transformers
20
  gradio
@@ -27,28 +27,28 @@ accelerate
27
 
28
  all these libraries are included in requirements.txt to install them : pip install -r requirements.txt
29
 
30
- ####ALSO this model requires a GPU to run , so make sure you have NVIDIA CUDA or similar technologies.
31
 
32
  The current web page is running on the hugging face space which is using paid GPU that is Nvidia T4 medium.
33
 
34
 
35
- ##Project Overview
36
  OCR Model: This project uses the GOT-OCR 2.0 model from Hugging Face.
37
  Frontend: The frontend is built using Gradio, which provides an easy-to-use web interface.
38
  Keyword Search: Users can search for specific keywords in the extracted text. The search is case-insensitive, and the matching keywords are highlighted using HTML <mark> tags with customizable colors.
39
 
40
- ##Model Description
41
  The project uses a pre-trained OCR model from Hugging Face:
42
 
43
- ####Model Name: GOT-OCR 2.0
44
  Architecture: Transformer-based model, fine-tuned for Optical Character Recognition.
45
  Framework: Hugging Face's transformers library.
46
  The model is loaded using the AutoTokenizer and AutoModel classes from Hugging Face and runs on a CUDA-enabled device.
47
 
48
- ##Gradio Web Interface
49
  The project uses Gradio to create an easy-to-use web interface for interacting with the model. The interface allows users to upload images, extract text, and search for keywords in the extracted text.
50
 
51
- ##Gradio Setup
52
  Image Upload: The user uploads an image, and the text is extracted using the OCR model.
53
  Keyword Search: Users input a keyword to search within the extracted text.
54
  Highlighting: Keywords found in the text are highlighted with a customizable color using HTML <mark> tags
 
12
 
13
  <mark>OCR Model Integration Using Gradio:</mark>
14
 
15
+ **This project integrates a pre-trained OCR (Optical Character Recognition) model with a Gradio-based web interface. Users can upload an image (JPEG format), extract the text using the model, and search for specific keywords in the extracted text. The keywords are highlighted within the displayed results.**
16
 
17
+ **dependencies / libraries required:**
18
  torch
19
  transformers
20
  gradio
 
27
 
28
  all these libraries are included in requirements.txt to install them : pip install -r requirements.txt
29
 
30
+ **ALSO this model requires a GPU to run , so make sure you have NVIDIA CUDA or similar technologies.**
31
 
32
  The current web page is running on the hugging face space which is using paid GPU that is Nvidia T4 medium.
33
 
34
 
35
+ **Project Overview**
36
  OCR Model: This project uses the GOT-OCR 2.0 model from Hugging Face.
37
  Frontend: The frontend is built using Gradio, which provides an easy-to-use web interface.
38
  Keyword Search: Users can search for specific keywords in the extracted text. The search is case-insensitive, and the matching keywords are highlighted using HTML <mark> tags with customizable colors.
39
 
40
+ **Model Description**
41
  The project uses a pre-trained OCR model from Hugging Face:
42
 
43
+ **Model Name: GOT-OCR 2.0**
44
  Architecture: Transformer-based model, fine-tuned for Optical Character Recognition.
45
  Framework: Hugging Face's transformers library.
46
  The model is loaded using the AutoTokenizer and AutoModel classes from Hugging Face and runs on a CUDA-enabled device.
47
 
48
+ **Gradio Web Interface**
49
  The project uses Gradio to create an easy-to-use web interface for interacting with the model. The interface allows users to upload images, extract text, and search for keywords in the extracted text.
50
 
51
+ **Gradio Setup**
52
  Image Upload: The user uploads an image, and the text is extracted using the OCR model.
53
  Keyword Search: Users input a keyword to search within the extracted text.
54
  Highlighting: Keywords found in the text are highlighted with a customizable color using HTML <mark> tags