# MaskAdapter

Mask-Adapter

Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation

YongKang Li^1,\*, [Tianheng Cheng](https://scholar.google.com/citations?user=PH8rJHYAAAAJ&hl=zh-CN)^1,\*, [Wenyu Liu](http://eic.hust.edu.cn/professor/liuwenyu)¹, [Xinggang Wang](https://xwcv.github.io/)^1,📧 ¹ Huazhong University of Science and Technology, (\* equal contribution, 📧 corresponding author) [![arxiv paper](https://img.shields.io/badge/arXiv-Paper-red)]() [![checkpoints](https://img.shields.io/badge/HuggingFace-🤗-orange)]() [![🤗 HuggingFace Demo](https://img.shields.io/badge/Mask_Adapter-🤗_HF_Demo-orange)]()

## Highlights * Mask-Adapter is a simple yet remarkably effective method and can be seamlessly integrated into open-vocabulary segmentation methods, e.g., [FC-CLIP](https://github.com/bytedance/fc-clip) and [MAFT-Plus](https://github.com/jiaosiyu1999/MAFT-Plus), to tackle the existing bottlenecks. * Mask-Adapter effectively extends to SAM without training, achieving impressive results across multiple open-vocabulary segmentation benchmarks. ## Updates - [x] Release code - [x] Release weights - [x] Release demo with SAM-2👉 [🤗 Mask-Adapter]() - [ ] Release weights training with addtional data ## Installation Please follow [installation](INSTALL.md). ## Getting Started See [Preparing Datasets for Mask-Adapter](datasets/README.md).Following [FC-CLIP](https://github.com/bytedance/fc-clip) and [MAFT-Plus](https://github.com/jiaosiyu1999/MAFT-Plus) to prepare datasets. See [Getting Started with Mask-Adapter](GETTING_STARTED.md). ## Citing Mask-Adapter If you use Mask-Adapter in your research, please use the following BibTeX entry. ```BibTeX ``` ## Acknowledgement [Mask2Former](https://github.com/facebookresearch/Mask2Former) [ODISE](https://github.com/NVlabs/ODISE) [FC-CLIP](https://github.com/bytedance/fc-clip) [MAFTP](https://github.com/jiaosiyu1999/MAFT-Plus) [SAM](https://github.com/facebookresearch/segment-anything) # MaskAdapter