Text Generation
Transformers
PyTorch
English
llava
Inference Endpoints

This is a preview version of the Q-Instruct LLaVA. Non-finalized weights.

@misc{wu2023qinstruct,
      title={Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models}, 
      author={Haoning Wu and Zicheng Zhang and Erli Zhang and Chaofeng Chen and Liang Liao and Annan Wang and Kaixin Xu and Chunyi Li and Jingwen Hou and Guangtao Zhai and Geng Xue and Wenxiu Sun and Qiong Yan and Weisi Lin},
      year={2023},
      eprint={2311.06783},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}
Downloads last month
120
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Datasets used to train teowu/llava_v1.5_7b_qinstruct_preview_v0.1