Spaces:
Paused
Paused
x-lai
commited on
Commit
·
f43a8c0
1
Parent(s):
a891f97
Update README.md
Browse filesFormer-commit-id: 3406019b0a1e779e29497c7a4e8a603783a4259f
README.md
CHANGED
@@ -1,6 +1,6 @@
|
|
1 |
# LISA: Reasoning Segmentation Via Large Language Model
|
2 |
|
3 |
-
This is the official implementation of
|
4 |
|
5 |
## News
|
6 |
- [x] [2023.8.2] Paper is released and github repo is created.
|
@@ -10,13 +10,13 @@ This is the official implementation of ***LISA*** (large ***L***anguage ***I***n
|
|
10 |
- [ ] ReasonSeg Dataset Release
|
11 |
- [ ] Codes and models Release
|
12 |
|
13 |
-
|
14 |
1. complex reasoning;
|
15 |
2. world knowledge;
|
16 |
3. explanatory answers;
|
17 |
4. multi-turn conversation.
|
18 |
|
19 |
-
|
20 |
<p align="center"> <img src="imgs/fig_teaser4_crop.png" width="100%"> </p>
|
21 |
|
22 |
## Abstract
|
|
|
1 |
# LISA: Reasoning Segmentation Via Large Language Model
|
2 |
|
3 |
+
This is the official implementation of **LISA** (large **L**anguage **I**nstructed **S**egmentation **A**ssistant).
|
4 |
|
5 |
## News
|
6 |
- [x] [2023.8.2] Paper is released and github repo is created.
|
|
|
10 |
- [ ] ReasonSeg Dataset Release
|
11 |
- [ ] Codes and models Release
|
12 |
|
13 |
+
**LISA** unlocks the new segmentation capabilities of multi-modal LLM, and can handle cases involving:
|
14 |
1. complex reasoning;
|
15 |
2. world knowledge;
|
16 |
3. explanatory answers;
|
17 |
4. multi-turn conversation.
|
18 |
|
19 |
+
**LISA** also demonstrates robust zero-shot capability when trained exclusively on reasoning-free datasets. In addition, fine-tuning the model with merely 239 reasoning segmentation image-instruction pairs results in further performance enhancement.
|
20 |
<p align="center"> <img src="imgs/fig_teaser4_crop.png" width="100%"> </p>
|
21 |
|
22 |
## Abstract
|