This is a preview version of the Q-Instruct LLaVA. Non-finalized weights.

@misc{wu2023qinstruct,
      title={Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models}, 
      author={Haoning Wu and Zicheng Zhang and Erli Zhang and Chaofeng Chen and Liang Liao and Annan Wang and Kaixin Xu and Chunyi Li and Jingwen Hou and Guangtao Zhai and Geng Xue and Wenxiu Sun and Qiong Yan and Weisi Lin},
      year={2023},
      eprint={2311.06783},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Downloads last month: 120

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

teowu
/

llava_v1.5_7b_qinstruct_preview_v0.1

Datasets used to train teowu/llava_v1.5_7b_qinstruct_preview_v0.1