|
--- |
|
library_name: monai |
|
tags: |
|
- crowd-counting |
|
- cnn |
|
- detection |
|
license: mit |
|
metrics: |
|
- mae |
|
pipeline_tag: object-detection |
|
datasets: |
|
- ShanghaiTechDataset |
|
--- |
|
--- |
|
|
|
### Model Description |
|
A machine learning model for crowd counting |
|
|
|
- **Model type:** image-classifier |
|
- **License:** mit |
|
|
|
## Crowd Counting Model |
|
The aim is to build a model that can estimate the amount of people in a crowd from an image- |
|
|
|
The model was built using **CSRNet** a crowd counting neural network designed by Yuhong Li, Xiaofan Zhang and Deming Chen ([https://github.com/leeyeehoo/CSRNet-pytorch](https://github.com/leeyeehoo/CSRNet-pytorch)) |
|
|
|
### Model Sources |
|
|
|
- **Repository:** [https://github.com/leeyeehoo/CSRNet-pytorch](https://github.com/leeyeehoo/CSRNet-pytorch) |
|
|
|
## Uses |
|
|
|
This model was created in the spirit of creating a model capable of counting the amount of people in a crowd using images. |
|
|
|
### Direct Use |
|
|
|
```bash |
|
model = CSRNet() |
|
checkpoint = torch.load("weights.pth") |
|
model.load_state_dict(checkpoint) |
|
model.predict() |
|
|
|
``` |
|
|
|
## Bias, Risks, and Limitations |
|
|
|
Although the model can be very accurate its not exact, it has a 2%-6% error in the prediction. |
|
|
|
## Training Details |
|
|
|
### Training Data |
|
|
|
The model was trained using the ShanghaiTech Dataset, specifically the Shanghai B Dataset. |
|
|
|
### Training Procedure |
|
|
|
The info on training procedure can be found in this repository [https://github.com/leeyeehoo/CSRNet-pytorch](https://github.com/leeyeehoo/CSRNet-pytorch) |
|
|
|
## Evaluation and Results |
|
|
|
The model reached a MAE of 10.6 |
|
|
|
## Citation |
|
|
|
### Model creation and training |
|
|
|
@inproceedings{li2018csrnet, |
|
title={CSRNet: Dilated convolutional neural networks for understanding the highly congested scenes}, |
|
author={Li, Yuhong and Zhang, Xiaofan and Chen, Deming}, |
|
booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition}, |
|
pages={1091--1100}, |
|
year={2018} |
|
} |
|
|
|
### Dataset |
|
|
|
@inproceedings{zhang2016single, |
|
title={Single-image crowd counting via multi-column convolutional neural network}, |
|
author={Zhang, Yingying and Zhou, Desen and Chen, Siqin and Gao, Shenghua and Ma, Yi}, |
|
booktitle={Proceedings of the IEEE conference on computer vision and pattern recognition}, |
|
pages={589--597}, |
|
year={2016} |
|
} |