Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
1
Libraries
1
Datasets
Languages
Licenses
Other
Reset Tasks
Multimodal
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Video-Text-to-Text
Any-to-Any
Audio-Text-to-Text
Computer Vision
Image Classification
Object Detection
Video Classification
Image Segmentation
Image-to-Text
Zero-Shot Image Classification
Image Feature Extraction
Mask Generation
Text-to-Image
Depth Estimation
Zero-Shot Object Detection
Unconditional Image Generation
Image-to-Image
Keypoint Detection
Image-to-3D
Text-to-Video
Text-to-3D
Image-to-Video
Natural Language Processing
Text Generation
Text Classification
Text2Text Generation
Token Classification
Fill-Mask
Question Answering
Feature Extraction
Translation
Sentence Similarity
Summarization
Zero-Shot Classification
Table Question Answering
Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Text-to-Speech
Text-to-Audio
Voice Activity Detection
Tabular
Tabular Classification
Time Series Forecasting
Tabular Regression
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Tasks with no match
Multimodal
Visual Document Retrieval
Apply filters
Models
508
Full-text search
Edit filters
Sort: Trending
Active filters:
image-to-text, transformers
Clear all
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
9 days ago
•
1.69M
•
•
590
Salesforce/blip-image-captioning-large
Image-to-Text
•
Updated
9 days ago
•
897k
•
•
1.27k
microsoft/trocr-base-handwritten
Image-to-Text
•
Updated
about 20 hours ago
•
159k
•
372
naver-clova-ix/donut-base
Image-to-Text
•
Updated
Aug 13, 2022
•
58.7k
•
191
microsoft/trocr-large-printed
Image-to-Text
•
Updated
May 27, 2024
•
291k
•
158
unum-cloud/uform-gen2-qwen-500m
Image-to-Text
•
Updated
Apr 24, 2024
•
25.3k
•
76
kazars24/trocr-base-handwritten-ru
Image-to-Text
•
Updated
Oct 27, 2024
•
2.2k
•
8
kha-white/manga-ocr-base
Image-to-Text
•
Updated
Jun 22, 2022
•
89.1k
•
136
microsoft/trocr-base-printed
Image-to-Text
•
Updated
May 27, 2024
•
85.1k
•
161
microsoft/trocr-large-handwritten
Image-to-Text
•
Updated
May 27, 2024
•
319k
•
105
microsoft/trocr-large-stage1
Image-to-Text
•
Updated
May 27, 2024
•
2.63k
•
23
microsoft/trocr-small-handwritten
Image-to-Text
•
Updated
May 27, 2024
•
617k
•
43
nlpconnect/vit-gpt2-image-captioning
Image-to-Text
•
Updated
Feb 27, 2023
•
1.84M
•
•
871
naver-clova-ix/donut-base-finetuned-cord-v2
Image-to-Text
•
Updated
Aug 13, 2022
•
16.6k
•
94
jinhybr/OCR-Donut-CORD
Image-to-Text
•
Updated
Nov 5, 2022
•
1.22k
•
199
microsoft/git-base-coco
Image-to-Text
•
Updated
Feb 8, 2023
•
66.6k
•
20
microsoft/git-large-coco
Image-to-Text
•
Updated
Jun 26, 2023
•
22.9k
•
•
103
nathansutton/generate-cxr
Image-to-Text
•
Updated
Feb 23, 2024
•
358
•
8
ddobokki/ko-trocr
Image-to-Text
•
Updated
Oct 22, 2024
•
712
•
24
google/pix2struct-base
Image-to-Text
•
Updated
Dec 24, 2023
•
3.8k
•
67
Flova/omr_transformer
Image-to-Text
•
Updated
Oct 5, 2023
•
495
•
9
purna419/invoice-parser
Image-to-Text
•
Updated
Jul 10, 2023
•
81
•
6
Gregor/mblip-mt0-xl
Image-to-Text
•
Updated
May 7, 2024
•
992
•
14
codedrainer/uae-license-detection
Image-to-Text
•
Updated
Jul 22, 2023
•
60
•
2
uf-aice-lab/BLIP-Math
Image-to-Text
•
Updated
Sep 14, 2023
•
110
•
2
facebook/nougat-base
Image-to-Text
•
Updated
Nov 20, 2023
•
8.6k
•
161
microsoft/kosmos-2-patch14-224
Image-to-Text
•
Updated
Nov 28, 2023
•
301k
•
155
Norm/nougat-latex-base
Image-to-Text
•
Updated
Feb 26, 2024
•
5.37k
•
76
unum-cloud/uform-gen
Image-to-Text
•
Updated
Dec 31, 2023
•
941
•
43
AdamCodd/donut-receipts-extract
Image-to-Text
•
Updated
Jan 11
•
16
•
33
Previous
1
2
3
...
17
Next