Yukang commited on
Commit
7f58721
·
1 Parent(s): c75dc84
Files changed (1) hide show
  1. README.md +4 -1
README.md CHANGED
@@ -3,6 +3,8 @@
3
  This is the official implementation of ***LISA (Large-language Instructed Segmentation Assistant)***. In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.
4
  For more details, please refer to:
5
 
 
 
6
  **LISA: Reasoning Segmentation Via Large Language Model [[Paper]()]** <br />
7
  [Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
8
  [Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
@@ -12,9 +14,10 @@ For more details, please refer to:
12
  [Shu Liu](https://scholar.google.com.hk/citations?user=BUEDUFkAAAAJ&hl=zh-CN)
13
  [Jiaya Jia](https://scholar.google.com/citations?user=XPAkzTEAAAAJ&hl=en)<br />
14
 
15
- <p align="center"> <img src="docs/VoxelNeXt-Pipeline.png" width="100%"> </p>
16
 
17
  ### Experimental results
 
18
 
19
 
20
  ## Citation
 
3
  This is the official implementation of ***LISA (Large-language Instructed Segmentation Assistant)***. In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.
4
  For more details, please refer to:
5
 
6
+ <p align="center"> <img src="img/fig_teaser4_crop.png" width="100%"> </p>
7
+
8
  **LISA: Reasoning Segmentation Via Large Language Model [[Paper]()]** <br />
9
  [Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
10
  [Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
 
14
  [Shu Liu](https://scholar.google.com.hk/citations?user=BUEDUFkAAAAJ&hl=zh-CN)
15
  [Jiaya Jia](https://scholar.google.com/citations?user=XPAkzTEAAAAJ&hl=en)<br />
16
 
17
+ <p align="center"> <img src="img/fig_overview_v6_crop.png" width="100%"> </p>
18
 
19
  ### Experimental results
20
+ <p align="center"> <img src="img/Table1.png" width="100%"> </p>
21
 
22
 
23
  ## Citation