Spaces:
Paused
Paused
x-lai
commited on
Commit
·
b61402a
1
Parent(s):
be3b66a
Release training script
Browse filesFormer-commit-id: 88e5ed3291935fe5dafe719fbd6b6b31a9c68216
- README.md +2 -4
- model/LISA.py +1 -1
README.md
CHANGED
@@ -2,7 +2,7 @@
|
|
2 |
|
3 |
<font size=7><div align='center'><b>LISA</b>: Large <b>L</b>anguage <b>I</b>nstructed <b>S</b>egmentation <b>A</b>ssistant</div></font>
|
4 |
|
5 |
-
<font size=7><div align='center' > <a href=https://arxiv.org/pdf/2308.00692.pdf>**Paper**</a> | <a href="https://huggingface.co/xinlai">**Models**</a> |
|
6 |
|
7 |
<!-- <p align="center"> <img src="imgs/teaser.jpg" width="100%"> </p> -->
|
8 |
|
@@ -69,14 +69,12 @@
|
|
69 |
<p align="center"> <img src="imgs/fig_overview.jpg" width="100%"> </p>
|
70 |
|
71 |
## News
|
|
|
72 |
- [x] [2023.8.4] [Online Demo](http://103.170.5.190:7860/) is released!
|
73 |
- [x] [2023.8.4] [*ReasonSeg* Dataset](https://drive.google.com/drive/folders/125mewyg5Ao6tZ3ZdJ-1-E3n04LGVELqy?usp=sharing) and the [LISA-13B-llama2-v0-explanatory](https://huggingface.co/xinlai/LISA-13B-llama2-v0-explanatory) model are released!
|
74 |
- [x] [2023.8.3] Inference code and the [LISA-13B-llama2-v0](https://huggingface.co/xinlai/LISA-13B-llama2-v0) model are released. Welcome to check out!
|
75 |
- [x] [2023.8.2] [Paper](https://arxiv.org/pdf/2308.00692.pdf) is released and GitHub repo is created.
|
76 |
|
77 |
-
## TODO
|
78 |
-
- [ ] Training Code Release
|
79 |
-
|
80 |
**LISA: Reasoning Segmentation Via Large Language Model [[Paper](https://arxiv.org/abs/2308.00692)]** <br />
|
81 |
[Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
|
82 |
[Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
|
|
|
2 |
|
3 |
<font size=7><div align='center'><b>LISA</b>: Large <b>L</b>anguage <b>I</b>nstructed <b>S</b>egmentation <b>A</b>ssistant</div></font>
|
4 |
|
5 |
+
<font size=7><div align='center' > <a href=https://arxiv.org/pdf/2308.00692.pdf>**Paper**</a> | <a href="https://huggingface.co/xinlai">**Models**</a> | [Training](#training) | [**Inference**](#inference) | [**Dataset**](#dataset) | <a href="http://103.170.5.190:7860/">**Online Demo**</a></div></font>
|
6 |
|
7 |
<!-- <p align="center"> <img src="imgs/teaser.jpg" width="100%"> </p> -->
|
8 |
|
|
|
69 |
<p align="center"> <img src="imgs/fig_overview.jpg" width="100%"> </p>
|
70 |
|
71 |
## News
|
72 |
+
- [x] [2023.8.9] Training Code Release
|
73 |
- [x] [2023.8.4] [Online Demo](http://103.170.5.190:7860/) is released!
|
74 |
- [x] [2023.8.4] [*ReasonSeg* Dataset](https://drive.google.com/drive/folders/125mewyg5Ao6tZ3ZdJ-1-E3n04LGVELqy?usp=sharing) and the [LISA-13B-llama2-v0-explanatory](https://huggingface.co/xinlai/LISA-13B-llama2-v0-explanatory) model are released!
|
75 |
- [x] [2023.8.3] Inference code and the [LISA-13B-llama2-v0](https://huggingface.co/xinlai/LISA-13B-llama2-v0) model are released. Welcome to check out!
|
76 |
- [x] [2023.8.2] [Paper](https://arxiv.org/pdf/2308.00692.pdf) is released and GitHub repo is created.
|
77 |
|
|
|
|
|
|
|
78 |
**LISA: Reasoning Segmentation Via Large Language Model [[Paper](https://arxiv.org/abs/2308.00692)]** <br />
|
79 |
[Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
|
80 |
[Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
|
model/LISA.py
CHANGED
@@ -6,7 +6,7 @@ import torch.nn.functional as F
|
|
6 |
from peft import (LoraConfig, get_peft_model)
|
7 |
from transformers import BitsAndBytesConfig, CLIPVisionModel
|
8 |
|
9 |
-
from transformers import
|
10 |
from .llava.model.llava import LlavaLlamaForCausalLM
|
11 |
from .segment_anything import build_sam_vit_h
|
12 |
from utils.utils import (DEFAULT_IM_END_TOKEN, DEFAULT_IM_START_TOKEN,
|
|
|
6 |
from peft import (LoraConfig, get_peft_model)
|
7 |
from transformers import BitsAndBytesConfig, CLIPVisionModel
|
8 |
|
9 |
+
from transformers import CLIPVisionModel, BitsAndBytesConfig
|
10 |
from .llava.model.llava import LlavaLlamaForCausalLM
|
11 |
from .segment_anything import build_sam_vit_h
|
12 |
from utils.utils import (DEFAULT_IM_END_TOKEN, DEFAULT_IM_START_TOKEN,
|