Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
## Use Cases | |
Image classification models can be used when we are not interested in specific instances of objects with location information or their shape. | |
### Keyword Classification | |
Image classification models are used widely in stock photography to assign each image a keyword. | |
### Image Search | |
Models trained in image classification can improve user experience by organizing and categorizing photo galleries on the phone or in the cloud, on multiple keywords or tags. | |
## Inference | |
With the `transformers` library, you can use the `image-classification` pipeline to infer with image classification models. You can initialize the pipeline with a model id from the Hub. If you do not provide a model id it will initialize with [google/vit-base-patch16-224](https://huggingface.co/google/vit-base-patch16-224) by default. When calling the pipeline you just need to specify a path, http link or an image loaded in PIL. You can also provide a `top_k` parameter which determines how many results it should return. | |
```python | |
from transformers import pipeline | |
clf = pipeline("image-classification") | |
clf("path_to_a_cat_image") | |
[{'label': 'tabby cat', 'score': 0.731}, | |
... | |
] | |
``` | |
You can use [huggingface.js](https://github.com/huggingface/huggingface.js) to classify images using models on Hugging Face Hub. | |
```javascript | |
import { HfInference } from "@huggingface/inference"; | |
const inference = new HfInference(HF_ACCESS_TOKEN); | |
await inference.imageClassification({ | |
data: await (await fetch("https://picsum.photos/300/300")).blob(), | |
model: "microsoft/resnet-50", | |
}); | |
``` | |
## Useful Resources | |
- [Let's Play Pictionary with Machine Learning!](https://www.youtube.com/watch?v=LS9Y2wDVI0k) | |
- [Fine-Tune ViT for Image Classification with 🤗Transformers](https://huggingface.co/blog/fine-tune-vit) | |
- [Walkthrough of Computer Vision Ecosystem in Hugging Face - CV Study Group](https://www.youtube.com/watch?v=oL-xmufhZM8) | |
- [Computer Vision Study Group: Swin Transformer](https://www.youtube.com/watch?v=Ngikt-K1Ecc) | |
- [Computer Vision Study Group: Masked Autoencoders Paper Walkthrough](https://www.youtube.com/watch?v=Ngikt-K1Ecc) | |
- [Image classification task guide](https://huggingface.co/docs/transformers/tasks/image_classification) | |
### Creating your own image classifier in just a few minutes | |
With [HuggingPics](https://github.com/nateraw/huggingpics), you can fine-tune Vision Transformers for anything using images found on the web. This project downloads images of classes defined by you, trains a model, and pushes it to the Hub. You even get to try out the model directly with a working widget in the browser, ready to be shared with all your friends! | |