v0.31.0
Browse filesSee https://github.com/quic/ai-hub-models/releases/v0.31.0 for changelog.
README.md
CHANGED
@@ -36,10 +36,10 @@ More details on model performance across various devices, can be found
|
|
36 |
|
37 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
38 |
|---|---|---|---|---|---|---|---|---|
|
39 |
-
| EfficientViT-l2-seg | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX |
|
40 |
-
| EfficientViT-l2-seg | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX |
|
41 |
-
| EfficientViT-l2-seg | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX |
|
42 |
-
| EfficientViT-l2-seg | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX |
|
43 |
|
44 |
|
45 |
|
@@ -103,8 +103,8 @@ Profiling Results
|
|
103 |
EfficientViT-l2-seg
|
104 |
Device : cs_8_gen_2 (ANDROID 13)
|
105 |
Runtime : ONNX
|
106 |
-
Estimated inference time (ms) :
|
107 |
-
Estimated peak memory usage (MB): [
|
108 |
Total # Ops : 464
|
109 |
Compute Unit(s) : npu (462 ops) gpu (0 ops) cpu (2 ops)
|
110 |
```
|
|
|
36 |
|
37 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
38 |
|---|---|---|---|---|---|---|---|---|
|
39 |
+
| EfficientViT-l2-seg | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 1900.82 ms | 6 - 304 MB | NPU | [EfficientViT-l2-seg.onnx](https://huggingface.co/qualcomm/EfficientViT-l2-seg/blob/main/EfficientViT-l2-seg.onnx) |
|
40 |
+
| EfficientViT-l2-seg | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 1729.898 ms | 126 - 1004 MB | NPU | [EfficientViT-l2-seg.onnx](https://huggingface.co/qualcomm/EfficientViT-l2-seg/blob/main/EfficientViT-l2-seg.onnx) |
|
41 |
+
| EfficientViT-l2-seg | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 1230.575 ms | 103 - 1058 MB | NPU | [EfficientViT-l2-seg.onnx](https://huggingface.co/qualcomm/EfficientViT-l2-seg/blob/main/EfficientViT-l2-seg.onnx) |
|
42 |
+
| EfficientViT-l2-seg | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 2416.342 ms | 144 - 144 MB | NPU | [EfficientViT-l2-seg.onnx](https://huggingface.co/qualcomm/EfficientViT-l2-seg/blob/main/EfficientViT-l2-seg.onnx) |
|
43 |
|
44 |
|
45 |
|
|
|
103 |
EfficientViT-l2-seg
|
104 |
Device : cs_8_gen_2 (ANDROID 13)
|
105 |
Runtime : ONNX
|
106 |
+
Estimated inference time (ms) : 1900.8
|
107 |
+
Estimated peak memory usage (MB): [6, 304]
|
108 |
Total # Ops : 464
|
109 |
Compute Unit(s) : npu (462 ops) gpu (0 ops) cpu (2 ops)
|
110 |
```
|