x-lai commited on
Commit
b61402a
·
1 Parent(s): be3b66a

Release training script

Browse files

Former-commit-id: 88e5ed3291935fe5dafe719fbd6b6b31a9c68216

Files changed (2) hide show
  1. README.md +2 -4
  2. model/LISA.py +1 -1
README.md CHANGED
@@ -2,7 +2,7 @@
2
 
3
  <font size=7><div align='center'><b>LISA</b>: Large <b>L</b>anguage <b>I</b>nstructed <b>S</b>egmentation <b>A</b>ssistant</div></font>
4
 
5
- <font size=7><div align='center' > <a href=https://arxiv.org/pdf/2308.00692.pdf>**Paper**</a> | <a href="https://huggingface.co/xinlai">**Models**</a> | **Training** (Coming Soon) | [**Inference**](#inference) | [**Dataset**](#dataset) | <a href="http://103.170.5.190:7860/">**Online Demo**</a></div></font>
6
 
7
  <!-- <p align="center"> <img src="imgs/teaser.jpg" width="100%"> </p> -->
8
 
@@ -69,14 +69,12 @@
69
  <p align="center"> <img src="imgs/fig_overview.jpg" width="100%"> </p>
70
 
71
  ## News
 
72
  - [x] [2023.8.4] [Online Demo](http://103.170.5.190:7860/) is released!
73
  - [x] [2023.8.4] [*ReasonSeg* Dataset](https://drive.google.com/drive/folders/125mewyg5Ao6tZ3ZdJ-1-E3n04LGVELqy?usp=sharing) and the [LISA-13B-llama2-v0-explanatory](https://huggingface.co/xinlai/LISA-13B-llama2-v0-explanatory) model are released!
74
  - [x] [2023.8.3] Inference code and the [LISA-13B-llama2-v0](https://huggingface.co/xinlai/LISA-13B-llama2-v0) model are released. Welcome to check out!
75
  - [x] [2023.8.2] [Paper](https://arxiv.org/pdf/2308.00692.pdf) is released and GitHub repo is created.
76
 
77
- ## TODO
78
- - [ ] Training Code Release
79
-
80
  **LISA: Reasoning Segmentation Via Large Language Model [[Paper](https://arxiv.org/abs/2308.00692)]** <br />
81
  [Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
82
  [Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
 
2
 
3
  <font size=7><div align='center'><b>LISA</b>: Large <b>L</b>anguage <b>I</b>nstructed <b>S</b>egmentation <b>A</b>ssistant</div></font>
4
 
5
+ <font size=7><div align='center' > <a href=https://arxiv.org/pdf/2308.00692.pdf>**Paper**</a> | <a href="https://huggingface.co/xinlai">**Models**</a> | [Training](#training) | [**Inference**](#inference) | [**Dataset**](#dataset) | <a href="http://103.170.5.190:7860/">**Online Demo**</a></div></font>
6
 
7
  <!-- <p align="center"> <img src="imgs/teaser.jpg" width="100%"> </p> -->
8
 
 
69
  <p align="center"> <img src="imgs/fig_overview.jpg" width="100%"> </p>
70
 
71
  ## News
72
+ - [x] [2023.8.9] Training Code Release
73
  - [x] [2023.8.4] [Online Demo](http://103.170.5.190:7860/) is released!
74
  - [x] [2023.8.4] [*ReasonSeg* Dataset](https://drive.google.com/drive/folders/125mewyg5Ao6tZ3ZdJ-1-E3n04LGVELqy?usp=sharing) and the [LISA-13B-llama2-v0-explanatory](https://huggingface.co/xinlai/LISA-13B-llama2-v0-explanatory) model are released!
75
  - [x] [2023.8.3] Inference code and the [LISA-13B-llama2-v0](https://huggingface.co/xinlai/LISA-13B-llama2-v0) model are released. Welcome to check out!
76
  - [x] [2023.8.2] [Paper](https://arxiv.org/pdf/2308.00692.pdf) is released and GitHub repo is created.
77
 
 
 
 
78
  **LISA: Reasoning Segmentation Via Large Language Model [[Paper](https://arxiv.org/abs/2308.00692)]** <br />
79
  [Xin Lai](https://scholar.google.com/citations?user=tqNDPA4AAAAJ&hl=zh-CN),
80
  [Zhuotao Tian](https://scholar.google.com/citations?user=mEjhz-IAAAAJ&hl=en),
model/LISA.py CHANGED
@@ -6,7 +6,7 @@ import torch.nn.functional as F
6
  from peft import (LoraConfig, get_peft_model)
7
  from transformers import BitsAndBytesConfig, CLIPVisionModel
8
 
9
- from transformers import LlamaForCausalLM, CLIPVisionModel, BitsAndBytesConfig
10
  from .llava.model.llava import LlavaLlamaForCausalLM
11
  from .segment_anything import build_sam_vit_h
12
  from utils.utils import (DEFAULT_IM_END_TOKEN, DEFAULT_IM_START_TOKEN,
 
6
  from peft import (LoraConfig, get_peft_model)
7
  from transformers import BitsAndBytesConfig, CLIPVisionModel
8
 
9
+ from transformers import CLIPVisionModel, BitsAndBytesConfig
10
  from .llava.model.llava import LlavaLlamaForCausalLM
11
  from .segment_anything import build_sam_vit_h
12
  from utils.utils import (DEFAULT_IM_END_TOKEN, DEFAULT_IM_START_TOKEN,