madhurjindal commited on
Commit
cb436ee
·
1 Parent(s): cf90d3d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +81 -1
README.md CHANGED
@@ -10,4 +10,84 @@ pinned: false
10
  license: apache-2.0
11
  ---
12
 
13
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  license: apache-2.0
11
  ---
12
 
13
+ # YOLOv3 Object Detection Explorer
14
+
15
+ Welcome to the YOLOv3 Object Detection Explorer! 🕵️‍♀️🔍
16
+
17
+ Uncover the magic of object detection through our interactive Gradio app, powered by the cutting-edge YOLOv3 model. With a focus on flexibility and precision, this app lets you explore the world of computer vision like never before.
18
+
19
+ ## Key Features
20
+
21
+ - **YOLOv3 at Your Fingertips**: Immerse yourself in the realm of object detection with our YOLOv3 model. Crafted meticulously from scratch using the Pascal VOC dataset's 20 distinct classes, our model guarantees accurate and robust object detection.
22
+
23
+ - **Insights with GradCAM**: Experience the brilliance of GradCAM (Gradient-weighted Class Activation Mapping), a revolutionary technique that peers into the inner workings of the model. By harnessing gradients, it uncovers the precise regions within an image that heavily influence the classification score, offering a deeper understanding of the model's decisions.
24
+
25
+ - **Tailored Detection Streams**: With three distinct output streams, each providing sizes of 13x13, 26x26, and 52x52, you can adapt your approach based on the size of the objects in focus. Opt for smaller outputs to capture larger objects or opt for larger outputs for finer details.
26
+
27
+ ## How It Works
28
+
29
+ 1. Upload an image you'd like to analyze for object detection.
30
+ 2. Select the output stream that aligns with your object detection needs.
31
+ 3. Watch as our YOLOv3 model efficiently identifies and annotates objects within the image.
32
+ 4. Dive deeper into the process by exploring the GradCAM visualization, shedding light on the pivotal regions driving the model's classification.
33
+
34
+ ## Explore and Learn
35
+
36
+ Visit the "Examples" tab to embark on a visual journey. Explore pre-loaded images of varying complexity to witness YOLOv3's prowess firsthand. Delve into the GradCAM outputs to gain insights into how different output streams impact the model's attention.
37
+
38
+ ## Get Started
39
+
40
+ Ready to unveil the mysteries of object detection and GradCAM? Launch the YOLOv3 Object Detection Explorer and unravel the captivating world of computer vision today!
41
+
42
+ >Please refer to the training repo - https://github.com/Madhur-1/ERA-v1/edit/master/S13 for more details on the training.
43
+
44
+ ## Model Structure
45
+
46
+ ![image](https://github.com/Madhur-1/ERA-v1/assets/64495917/367fbb1a-c284-4c8a-84f4-629d4a64e025)
47
+
48
+ ## PASCAL VOC Dataset
49
+ The Pascal VOC (Visual Object Classes) dataset is a widely used benchmark in computer vision. It consists of 20 classes covering a wide range of objects commonly found in everyday scenes. The dataset is a valuable resource for training and evaluating object detection models like YOLOv3.
50
+
51
+ #### Classes in the Pascal VOC Dataset
52
+ 1. Aeroplane
53
+ 2. Bicycle
54
+ 3. Bird
55
+ 4. Boat
56
+ 5. Bottle
57
+ 6. Bus
58
+ 7. Car
59
+ 8. Cat
60
+ 9. Chair
61
+ 10. Cow
62
+ 11. Dining Table
63
+ 12. Dog
64
+ 13. Horse
65
+ 14. Motorbike
66
+ 15. Person
67
+ 16. Potted Plant
68
+ 17. Sheep
69
+ 18. Sofa
70
+ 19. Train
71
+ 20. TV/Monitor
72
+
73
+ ![Alt text](Store/image.png)
74
+
75
+ ## Data Exploration
76
+
77
+ <img width="451" alt="image" src="https://github.com/Madhur-1/ERA-v1/assets/64495917/865607f0-1640-4adb-aa28-79ef90e833c7">
78
+ <img width="437" alt="image" src="https://github.com/Madhur-1/ERA-v1/assets/64495917/662c852b-d821-4e44-b809-68b6fc797318">
79
+
80
+ ## Model Metrics
81
+ | Class Acc | No Obj Acc | Obj Acc | MAP | Train Loss | Test Loss |
82
+ |-----------|------------|------------|-----------|------------|-----------|
83
+ | 88.99 | 98.19 | 77.58 | 0.43 | 3.19 | 2.73 |
84
+
85
+ Note: The above loss values use lambda_class = 1, lambda_noobj = 5, lambda_obj = 1, lambda_box = 5.
86
+
87
+ ## Grad-CAM
88
+ Note: The following has been taken from https://towardsdatascience.com/understand-your-algorithm-with-grad-cam-d3b62fce353
89
+
90
+ Gradient-weighted Class Activation Mapping (Grad-CAM), uses the gradients of any target concept (say ‘dog’ in a classification network or a sequence of words in captioning network) flowing into the final convolutional layer to produce a coarse localization map highlighting the important regions in the image for predicting the concept.
91
+
92
+ ![Alt text](Store/image-2.png)
93
+ ---