AI-Safeguard
/

Ivy-VL-llava

Visual Question Answering

text-generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Ivy1997 commited on Dec 7, 2024

Commit

73e42c0

·

verified ·

1 Parent(s): 08ee1eb

Update README.md

Files changed (1) hide show

README.md +5 -3

README.md CHANGED Viewed

@@ -9,9 +9,11 @@ tags:
 ![logo.jpg](logo.jpg)
-Ivy\-VL is a lightweight multimodal model with only 3B parameters. It accepts both image and text inputs to generate text outputs.
-Thanks to its lightweight design, it can be deployed on edge devices such as AI glasses and smartphones, offering low memory usage and high speed while maintaining strong performance on multimodal tasks. The model is built upon the [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct) language model, with [google/siglip-so400m-patch14-384](https://huggingface.co/google/siglip-so400m-patch14-384) serving as the vision encoder.
 # Model Summary:
@@ -103,7 +105,7 @@ print(text_outputs)
 ```plaintext
 @misc{ivy2024ivy-vl,
-    title={LLaVA-NeXT: Improved reasoning, OCR, and world knowledge},
     url={https://huggingface.co/AI-Safeguard/Ivy-VL},
     author={Ivy Zhang,Jenny N,Theresa Yu and David Qiu},
     month={December},

 ![logo.jpg](logo.jpg)
+[Ivy\-VL] is a lightweight multimodal model with only 3B parameters. It accepts both image and text inputs to generate text outputs.
+Thanks to its lightweight design, it can be deployed on edge devices such as AI glasses and smartphones, offering low memory usage and high speed while maintaining strong performance on multimodal tasks.
+The model is built upon the [Qwen/Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct) language model, with [google/siglip-so400m-patch14-384](https://huggingface.co/google/siglip-so400m-patch14-384) serving as the vision encoder.
 # Model Summary:
 ```plaintext
 @misc{ivy2024ivy-vl,
+    title={Ivy-VL:Compact Vision-Language Models Achieving SOTA with Optimal Data},
     url={https://huggingface.co/AI-Safeguard/Ivy-VL},
     author={Ivy Zhang,Jenny N,Theresa Yu and David Qiu},
     month={December},