Spaces:
Paused
Paused
Update README.md
Browse filesFormer-commit-id: e4f35a6c9aec643e3e5d54affca5889b99419f18
README.md
CHANGED
@@ -3,6 +3,8 @@
|
|
3 |
This is the official implementation of ***LISA (Large-language Instructed Segmentation Assistant)***. In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.
|
4 |
For more details, please refer to:
|
5 |
|
|
|
|
|
6 |
**LISA: Reasoning Segmentation Via Large Language Model [[Paper]()]** <br />
|
7 |
[Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
|
8 |
[Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
|
@@ -12,9 +14,10 @@ For more details, please refer to:
|
|
12 |
[Shu Liu](https://scholar.google.com.hk/citations?user=BUEDUFkAAAAJ&hl=zh-CN)
|
13 |
[Jiaya Jia](https://scholar.google.com/citations?user=XPAkzTEAAAAJ&hl=en)<br />
|
14 |
|
15 |
-
<p align="center"> <img src="
|
16 |
|
17 |
### Experimental results
|
|
|
18 |
|
19 |
|
20 |
## Citation
|
|
|
3 |
This is the official implementation of ***LISA (Large-language Instructed Segmentation Assistant)***. In this work, we propose a new segmentation task --- ***reasoning segmentation***. The task is designed to output a segmentation mask given a complex and implicit query text. We establish a benchmark comprising over one thousand image-instruction pairs, incorporating intricate reasoning and world knowledge for evaluation purposes. Finally, we present LISA: Large-language Instructed Segmentation Assistant, which inherits the language generation capabilities of the multi-modal Large Language Model (LLM) while also possessing the ability to produce segmentation masks.
|
4 |
For more details, please refer to:
|
5 |
|
6 |
+
<p align="center"> <img src="img/fig_teaser4_crop.png" width="100%"> </p>
|
7 |
+
|
8 |
**LISA: Reasoning Segmentation Via Large Language Model [[Paper]()]** <br />
|
9 |
[Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
|
10 |
[Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
|
|
|
14 |
[Shu Liu](https://scholar.google.com.hk/citations?user=BUEDUFkAAAAJ&hl=zh-CN)
|
15 |
[Jiaya Jia](https://scholar.google.com/citations?user=XPAkzTEAAAAJ&hl=en)<br />
|
16 |
|
17 |
+
<p align="center"> <img src="img/fig_overview_v6_crop.png" width="100%"> </p>
|
18 |
|
19 |
### Experimental results
|
20 |
+
<p align="center"> <img src="img/Table1.png" width="100%"> </p>
|
21 |
|
22 |
|
23 |
## Citation
|