miguelcarv
/

resnet-152-text-detector

Image Classification

Model card Files Files and versions Community

miguelcarv commited on Jan 20, 2024

Commit

010be03

·

verified ·

1 Parent(s): a1a8cc6

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 # Model Card for ResNet-152 Text Detector
-This model was trained with the intent to quickly classify whether or not an image contains legible text or not. It was trained as a binary classification problem on the COCO-Text dataset together with some images from LLaVAR. This came out to a total of ~70k images, where 50% of them had text and 50% of them had no legible text.
 # Model Details
 ## How to Get Started with the Model
@@ -16,7 +16,7 @@ model = AutoModelForImageClassification.from_pretrained(
 processor = AutoImageProcessor.from_pretrained("microsoft/resnet-50", do_resize=False)
 url = "http://images.cocodataset.org/train2017/000000044520.jpg"
-image = Image.open(requests.get(url, stream=True).raw).convert('RGB').resize((256,256))
 inputs = processor(image, return_tensors="pt").pixel_values

 # Model Card for ResNet-152 Text Detector
+This model was trained with the intent to quickly classify whether or not an image contains legible text or not. It was trained as a binary classification problem on the COCO-Text dataset together with some images from LLaVAR. This came out to a total of ~140k images, where 50% of them had text and 50% of them had no legible text.
 # Model Details
 ## How to Get Started with the Model
 processor = AutoImageProcessor.from_pretrained("microsoft/resnet-50", do_resize=False)
 url = "http://images.cocodataset.org/train2017/000000044520.jpg"
+image = Image.open(requests.get(url, stream=True).raw).convert('RGB').resize((300,300))
 inputs = processor(image, return_tensors="pt").pixel_values