CountGD / README.md
nikigoli's picture
Update README.md
6b989ad verified
metadata
language:
  - en
library_name: CountGD
license: mit
tags:
  - computer-vision
  - counting
  - grounding-dino
  - model_hub_mixin
  - multi-modal
  - open-vocabulary
  - pytorch_model_hub_mixin
  - transformers

CountGD

A Multi-Modal Open-World Counting Model for counting objects in an image with text and image prompts. For more details, please check out the following links

Sample prediction

Architecture

CountGD Architecture

Citation

@inproceedings{AminiNaieni24,
    author       = "Amini-Naieni, N. and Han, T. and Zisserman, A.",
    title        = "CountGD: Multi-Modal Open-World Counting",
    booktitle    = "NeurIPS",
    year         = "2024",
}