Spaces:

aletrn
/

lisa-on-cuda

Paused

Yukang commited on Aug 1, 2023

Commit

7f58721

1 Parent(s): c75dc84

Update README.md

Former-commit-id: e4f35a6c9aec643e3e5d54affca5889b99419f18

Files changed (1) hide show

README.md CHANGED Viewed

@@ -3,6 +3,8 @@
 This is the official implementation of ***LISA (Large-language Instructed Segmentation Assistant)***. In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.
 For more details, please refer to:
 **LISA: Reasoning Segmentation Via Large Language Model [[Paper]()]** <br />
 [Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
 [Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
@@ -12,9 +14,10 @@ For more details, please refer to:
 [Shu Liu](https://scholar.google.com.hk/citations?user=BUEDUFkAAAAJ&hl=zh-CN)
 [Jiaya Jia](https://scholar.google.com/citations?user=XPAkzTEAAAAJ&hl=en)<br />
-<p align="center"> <img src="docs/VoxelNeXt-Pipeline.png" width="100%"> </p>
 ### Experimental results
 ## Citation

 This is the official implementation of ***LISA (Large-language Instructed Segmentation Assistant)***. In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.
 For more details, please refer to:
+<p align="center"> <img src="img/fig_teaser4_crop.png" width="100%"> </p>
 **LISA: Reasoning Segmentation Via Large Language Model [[Paper]()]** <br />
 [Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
 [Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
 [Shu Liu](https://scholar.google.com.hk/citations?user=BUEDUFkAAAAJ&hl=zh-CN)
 [Jiaya Jia](https://scholar.google.com/citations?user=XPAkzTEAAAAJ&hl=en)<br />
+<p align="center"> <img src="img/fig_overview_v6_crop.png" width="100%"> </p>
 ### Experimental results
+<p align="center"> <img src="img/Table1.png" width="100%"> </p>
 ## Citation