shreyajn commited on
Commit
1614c79
·
verified ·
1 Parent(s): 653916a

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +34 -48
README.md CHANGED
@@ -36,38 +36,38 @@ More details on model performance across various devices, can be found
36
 
37
  | Model | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
38
  |---|---|---|---|---|---|---|---|---|
39
- | ResNet101Quantized | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | TFLITE | 1.154 ms | 0 - 185 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.tflite) |
40
- | ResNet101Quantized | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | QNN | 1.255 ms | 0 - 2 MB | INT8 | NPU | [ResNet101Quantized.so](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.so) |
41
- | ResNet101Quantized | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | ONNX | 40.919 ms | 0 - 208 MB | INT8 | NPU | [ResNet101Quantized.onnx](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.onnx) |
42
- | ResNet101Quantized | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | TFLITE | 0.885 ms | 0 - 90 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.tflite) |
43
- | ResNet101Quantized | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | QNN | 0.971 ms | 0 - 21 MB | INT8 | NPU | [ResNet101Quantized.so](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.so) |
44
- | ResNet101Quantized | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | ONNX | 32.577 ms | 10 - 629 MB | INT8 | NPU | [ResNet101Quantized.onnx](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.onnx) |
45
- | ResNet101Quantized | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | TFLITE | 0.798 ms | 0 - 49 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.tflite) |
46
- | ResNet101Quantized | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | QNN | 0.893 ms | 0 - 45 MB | INT8 | NPU | Use Export Script |
47
- | ResNet101Quantized | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | ONNX | 28.348 ms | 12 - 596 MB | INT8 | NPU | [ResNet101Quantized.onnx](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.onnx) |
48
- | ResNet101Quantized | SA7255P ADP | SA7255P | TFLITE | 16.139 ms | 0 - 43 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.tflite) |
49
- | ResNet101Quantized | SA7255P ADP | SA7255P | QNN | 16.606 ms | 0 - 10 MB | INT8 | NPU | Use Export Script |
50
- | ResNet101Quantized | SA8255 (Proxy) | SA8255P Proxy | TFLITE | 1.148 ms | 0 - 193 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.tflite) |
51
- | ResNet101Quantized | SA8255 (Proxy) | SA8255P Proxy | QNN | 1.274 ms | 0 - 2 MB | INT8 | NPU | Use Export Script |
52
- | ResNet101Quantized | SA8295P ADP | SA8295P | TFLITE | 1.822 ms | 0 - 45 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.tflite) |
53
- | ResNet101Quantized | SA8295P ADP | SA8295P | QNN | 1.896 ms | 0 - 18 MB | INT8 | NPU | Use Export Script |
54
- | ResNet101Quantized | SA8650 (Proxy) | SA8650P Proxy | TFLITE | 1.153 ms | 0 - 186 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.tflite) |
55
- | ResNet101Quantized | SA8650 (Proxy) | SA8650P Proxy | QNN | 1.272 ms | 0 - 2 MB | INT8 | NPU | Use Export Script |
56
- | ResNet101Quantized | SA8775P ADP | SA8775P | TFLITE | 1.436 ms | 0 - 43 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.tflite) |
57
- | ResNet101Quantized | SA8775P ADP | SA8775P | QNN | 1.583 ms | 0 - 10 MB | INT8 | NPU | Use Export Script |
58
- | ResNet101Quantized | RB3 Gen 2 (Proxy) | QCS6490 Proxy | TFLITE | 4.482 ms | 0 - 76 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.tflite) |
59
- | ResNet101Quantized | RB3 Gen 2 (Proxy) | QCS6490 Proxy | QNN | 6.281 ms | 0 - 14 MB | INT8 | NPU | Use Export Script |
60
- | ResNet101Quantized | RB5 (Proxy) | QCS8250 Proxy | TFLITE | 17.539 ms | 0 - 2 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.tflite) |
61
- | ResNet101Quantized | QCS8275 (Proxy) | QCS8275 Proxy | TFLITE | 16.139 ms | 0 - 43 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.tflite) |
62
- | ResNet101Quantized | QCS8275 (Proxy) | QCS8275 Proxy | QNN | 16.606 ms | 0 - 10 MB | INT8 | NPU | Use Export Script |
63
- | ResNet101Quantized | QCS8550 (Proxy) | QCS8550 Proxy | TFLITE | 1.146 ms | 0 - 188 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.tflite) |
64
- | ResNet101Quantized | QCS8550 (Proxy) | QCS8550 Proxy | QNN | 1.264 ms | 0 - 3 MB | INT8 | NPU | Use Export Script |
65
- | ResNet101Quantized | QCS9075 (Proxy) | QCS9075 Proxy | TFLITE | 1.436 ms | 0 - 43 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.tflite) |
66
- | ResNet101Quantized | QCS9075 (Proxy) | QCS9075 Proxy | QNN | 1.583 ms | 0 - 10 MB | INT8 | NPU | Use Export Script |
67
- | ResNet101Quantized | QCS8450 (Proxy) | QCS8450 Proxy | TFLITE | 1.56 ms | 0 - 90 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.tflite) |
68
- | ResNet101Quantized | QCS8450 (Proxy) | QCS8450 Proxy | QNN | 1.641 ms | 0 - 80 MB | INT8 | NPU | Use Export Script |
69
- | ResNet101Quantized | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 1.24 ms | 0 - 0 MB | INT8 | NPU | Use Export Script |
70
- | ResNet101Quantized | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 38.486 ms | 54 - 54 MB | INT8 | NPU | [ResNet101Quantized.onnx](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101Quantized.onnx) |
71
 
72
 
73
 
@@ -128,11 +128,11 @@ python -m qai_hub_models.models.resnet101_quantized.export
128
  ```
129
  Profiling Results
130
  ------------------------------------------------------------
131
- ResNet101Quantized
132
  Device : Samsung Galaxy S23 (13)
133
  Runtime : TFLITE
134
  Estimated inference time (ms) : 1.2
135
- Estimated peak memory usage (MB): [0, 185]
136
  Total # Ops : 150
137
  Compute Unit(s) : NPU (150 ops)
138
  ```
@@ -216,20 +216,6 @@ AI Hub. [Sign up for access](https://myaccount.qualcomm.com/signup).
216
 
217
 
218
 
219
- ## Run demo on a cloud-hosted device
220
-
221
- You can also run the demo on-device.
222
-
223
- ```bash
224
- python -m qai_hub_models.models.resnet101_quantized.demo --on-device
225
- ```
226
-
227
- **NOTE**: If you want running in a Jupyter Notebook or Google Colab like
228
- environment, please add the following to your cell (instead of the above).
229
- ```
230
- %run -m qai_hub_models.models.resnet101_quantized.demo -- --on-device
231
- ```
232
-
233
 
234
  ## Deploying compiled model to Android
235
 
 
36
 
37
  | Model | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
38
  |---|---|---|---|---|---|---|---|---|
39
+ | ResNet101 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | TFLITE | 1.156 ms | 0 - 184 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.tflite) |
40
+ | ResNet101 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | QNN | 1.274 ms | 0 - 2 MB | INT8 | NPU | [ResNet101Quantized.so](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.so) |
41
+ | ResNet101 | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | ONNX | 37.17 ms | 4 - 219 MB | INT8 | NPU | [ResNet101Quantized.onnx](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.onnx) |
42
+ | ResNet101 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | TFLITE | 0.896 ms | 0 - 86 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.tflite) |
43
+ | ResNet101 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | QNN | 0.975 ms | 0 - 18 MB | INT8 | NPU | [ResNet101Quantized.so](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.so) |
44
+ | ResNet101 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | ONNX | 28.876 ms | 3 - 678 MB | INT8 | NPU | [ResNet101Quantized.onnx](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.onnx) |
45
+ | ResNet101 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | TFLITE | 0.796 ms | 0 - 48 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.tflite) |
46
+ | ResNet101 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | QNN | 0.827 ms | 0 - 45 MB | INT8 | NPU | Use Export Script |
47
+ | ResNet101 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | ONNX | 26.92 ms | 15 - 637 MB | INT8 | NPU | [ResNet101Quantized.onnx](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.onnx) |
48
+ | ResNet101 | SA7255P ADP | SA7255P | TFLITE | 16.106 ms | 0 - 43 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.tflite) |
49
+ | ResNet101 | SA7255P ADP | SA7255P | QNN | 16.638 ms | 0 - 9 MB | INT8 | NPU | Use Export Script |
50
+ | ResNet101 | SA8255 (Proxy) | SA8255P Proxy | TFLITE | 1.157 ms | 0 - 186 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.tflite) |
51
+ | ResNet101 | SA8255 (Proxy) | SA8255P Proxy | QNN | 1.276 ms | 0 - 2 MB | INT8 | NPU | Use Export Script |
52
+ | ResNet101 | SA8295P ADP | SA8295P | TFLITE | 1.821 ms | 0 - 45 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.tflite) |
53
+ | ResNet101 | SA8295P ADP | SA8295P | QNN | 1.885 ms | 0 - 18 MB | INT8 | NPU | Use Export Script |
54
+ | ResNet101 | SA8650 (Proxy) | SA8650P Proxy | TFLITE | 1.155 ms | 0 - 188 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.tflite) |
55
+ | ResNet101 | SA8650 (Proxy) | SA8650P Proxy | QNN | 1.271 ms | 0 - 2 MB | INT8 | NPU | Use Export Script |
56
+ | ResNet101 | SA8775P ADP | SA8775P | TFLITE | 1.435 ms | 0 - 43 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.tflite) |
57
+ | ResNet101 | SA8775P ADP | SA8775P | QNN | 1.573 ms | 0 - 10 MB | INT8 | NPU | Use Export Script |
58
+ | ResNet101 | RB3 Gen 2 (Proxy) | QCS6490 Proxy | TFLITE | 4.456 ms | 0 - 74 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.tflite) |
59
+ | ResNet101 | RB3 Gen 2 (Proxy) | QCS6490 Proxy | QNN | 6.286 ms | 0 - 14 MB | INT8 | NPU | Use Export Script |
60
+ | ResNet101 | RB5 (Proxy) | QCS8250 Proxy | TFLITE | 17.525 ms | 0 - 2 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.tflite) |
61
+ | ResNet101 | QCS8275 (Proxy) | QCS8275 Proxy | TFLITE | 16.106 ms | 0 - 43 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.tflite) |
62
+ | ResNet101 | QCS8275 (Proxy) | QCS8275 Proxy | QNN | 16.638 ms | 0 - 9 MB | INT8 | NPU | Use Export Script |
63
+ | ResNet101 | QCS8550 (Proxy) | QCS8550 Proxy | TFLITE | 1.153 ms | 0 - 187 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.tflite) |
64
+ | ResNet101 | QCS8550 (Proxy) | QCS8550 Proxy | QNN | 1.267 ms | 0 - 2 MB | INT8 | NPU | Use Export Script |
65
+ | ResNet101 | QCS9075 (Proxy) | QCS9075 Proxy | TFLITE | 1.435 ms | 0 - 43 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.tflite) |
66
+ | ResNet101 | QCS9075 (Proxy) | QCS9075 Proxy | QNN | 1.573 ms | 0 - 10 MB | INT8 | NPU | Use Export Script |
67
+ | ResNet101 | QCS8450 (Proxy) | QCS8450 Proxy | TFLITE | 1.503 ms | 0 - 91 MB | INT8 | NPU | [ResNet101Quantized.tflite](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.tflite) |
68
+ | ResNet101 | QCS8450 (Proxy) | QCS8450 Proxy | QNN | 1.601 ms | 0 - 79 MB | INT8 | NPU | Use Export Script |
69
+ | ResNet101 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 1.236 ms | 0 - 0 MB | INT8 | NPU | Use Export Script |
70
+ | ResNet101 | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 37.433 ms | 53 - 53 MB | INT8 | NPU | [ResNet101Quantized.onnx](https://huggingface.co/qualcomm/ResNet101Quantized/blob/main/ResNet101.onnx) |
71
 
72
 
73
 
 
128
  ```
129
  Profiling Results
130
  ------------------------------------------------------------
131
+ ResNet101
132
  Device : Samsung Galaxy S23 (13)
133
  Runtime : TFLITE
134
  Estimated inference time (ms) : 1.2
135
+ Estimated peak memory usage (MB): [0, 184]
136
  Total # Ops : 150
137
  Compute Unit(s) : NPU (150 ops)
138
  ```
 
216
 
217
 
218
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
219
 
220
  ## Deploying compiled model to Android
221