Edit Models filters

Tasks

Text Generation

Image-Text-to-Text

Parameters

Libraries

Transformers.js

Apps

Inference Providers

Models

4,889

Full-text search

Active filters: image-text-to-text

internlm/Intern-S1

Image-Text-to-Text • 241B • Updated about 13 hours ago • 1.37k • 124

google/gemma-3n-E4B-it

Image-Text-to-Text • 8B • Updated 14 days ago • 294k • 647

kakaocorp/kanana-1.5-v-3b-instruct

Image-Text-to-Text • 4B • Updated 5 days ago • 3.14k • 28

zai-org/GLM-4.1V-9B-Thinking

Image-Text-to-Text • 10B • Updated 20 days ago • 81.8k • • 673

internlm/Intern-S1-FP8

Image-Text-to-Text • 241B • Updated about 16 hours ago • 512 • 25

google/medgemma-4b-it

Image-Text-to-Text • 5B • Updated 19 days ago • 108k • 544

google/gemma-3-4b-it

Image-Text-to-Text • 4B • Updated Mar 21 • 1.78M • 742

google/gemma-3n-E4B-it-litert-preview

Image-Text-to-Text • Updated May 26 • 1.4k

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 5.7M • • 1.08k

meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • 109B • Updated May 22 • 687k • • 1.03k

google/gemma-3n-E2B-it-litert-preview

Image-Text-to-Text • Updated May 20 • 508

nanonets/Nanonets-OCR-s

Image-Text-to-Text • 4B • Updated Jun 20 • 170k • 1.44k

unsloth/GLM-4.1V-9B-Thinking-GGUF

Image-Text-to-Text • 9B • Updated 3 days ago • 3.17k • 15

google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21 • 323k • • 1.52k

microsoft/Florence-2-large

Image-Text-to-Text • Updated Dec 8, 2024 • 998k • 1.61k

ds4sd/SmolDocling-256M-preview

Image-Text-to-Text • 0.3B • Updated May 16 • 119k • 1.5k

unsloth/gemma-3n-E4B-it-GGUF

Image-Text-to-Text • 7B • Updated 29 days ago • 208k • 142

google/medgemma-27b-it

Image-Text-to-Text • 29B • Updated 19 days ago • 10.8k • 145

google/gemma-3n-E2B-it

Image-Text-to-Text • 5B • Updated 14 days ago • 84.6k • 147

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18 • 83.8k • 330

merve/smol-vision

Image-Text-to-Text • Updated 6 days ago • 92

HuggingFaceTB/SmolVLM-Instruct

Image-Text-to-Text • 2B • Updated Apr 8 • 96.5k • 525

google/gemma-3-12b-it

Image-Text-to-Text • 12B • Updated Mar 21 • 254k • • 462

prithivMLmods/Qwen2.5-VL-7B-Abliterated-Caption-it

Image-Text-to-Text • 8B • Updated 6 days ago • 61 • 8

google/gemma-3-27b-it-qat-q4_0-gguf

Image-Text-to-Text • 27B • Updated Apr 11 • 6.44k • 320

OpenGVLab/InternVL3-78B

Image-Text-to-Text • 78B • Updated May 29 • 284k • 208

nvidia/Eagle2.5-8B

Image-Text-to-Text • 8B • Updated 11 days ago • 10.1k • 17

internlm/Intern-S1-GGUF

Image-Text-to-Text • 6B • Updated about 16 hours ago • 160 • 7

vikhyatk/moondream2

Image-Text-to-Text • 2B • Updated 22 days ago • 573k • 1.23k

meta-llama/Llama-4-Maverick-17B-128E-Instruct

Image-Text-to-Text • 402B • Updated May 22 • 35.8k • • 384