STMicroelectronics
/

miniresnetv2

Audio Classification

Model card Files Files and versions

xet

Community

FBAGSTM commited on Aug 1

Commit

14031a1

verified ·

1 Parent(s): dd6ede8

Update Readme ST Model Zoo

Browse files

Files changed (1) hide show

README.md +7 -11

README.md CHANGED Viewed

@@ -1,7 +1,3 @@
----
-license: apache-2.0
-pipeline_tag: audio-classification
----
 # Quantized miniresnetv2
 ## **Use case** : `AED`
@@ -57,8 +53,9 @@ It outputs embedding vectors of size 2048 for the 2 stacks version, and 3548 for
 ## Metrics
-Measures are done with default STM32Cube.AI configuration with enabled input / output allocated option.
 ### Reference MCU memory footprint based on ESC-10 dataset
@@ -66,8 +63,8 @@ Measures are done with default STM32Cube.AI configuration with enabled input / o
 | Model             | Format | Resolution | Series  | Activation RAM (KiB) | Runtime RAM (KiB) | Weights Flash (KiB) | Code Flash (KiB) | Total RAM (KiB)  | Total Flash (kB) | STM32Cube.AI version  |
 |-------------------|--------|------------|---------|----------------|-------------|---------------|------------|-------------|-------------|-----------------------|
-| [miniresnet v2 1stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_1stacks_64x50_tl/miniresnetv2_1stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A    | 59.89               |   7.09       |   123.98            |   61.57      | 66.98 | 185.55 | 10.0.0                 |
-| [miniresnet v2 2stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_2stacks_64x50_tl/miniresnetv2_2stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A    | 59.89                |   11.28        |   431.98            |   69.86      | 71.17 | 501.84 | 10.0.0                 |
 ### Reference inference time based on ESC-10 dataset
@@ -75,8 +72,8 @@ Measures are done with default STM32Cube.AI configuration with enabled input / o
 | Model             | Format | Resolution | Board            | Execution Engine |  Frequency   | Inference time (ms) | STM32Cube.AI version  |
 |-------------------|--------|------------|------------------|------------------|--------------|-------|-----------------------|
-| [miniresnet v2 1stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_1stacks_64x50_tl/miniresnetv2_1stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A |  1 CPU | 160 | 188.36 | 10.0.0                |
-| [miniresnet v2 2stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_2stacks_64x50_tl/miniresnetv2_2stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A |  1 CPU | 160 | 308.81 | 10.0.0                 |
 ### Accuracy with ESC-10 dataset
@@ -96,5 +93,4 @@ The reason this metric is used instead of patch-level accuracy is because patch-
 ## Retraining and Integration in a simple example:
-Please refer to the stm32ai-modelzoo-services GitHub [here](https://github.com/STMicroelectronics/stm32ai-modelzoo-services)

 # Quantized miniresnetv2
 ## **Use case** : `AED`
 ## Metrics
+* Measures are done with default STM32Cube.AI configuration with enabled input / output allocated option.
+* `tl` stands for "transfer learning", meaning that the model backbone weights were initialized from a pre-trained model, then only the last layer was unfrozen during the training.
 ### Reference MCU memory footprint based on ESC-10 dataset
 | Model             | Format | Resolution | Series  | Activation RAM (KiB) | Runtime RAM (KiB) | Weights Flash (KiB) | Code Flash (KiB) | Total RAM (KiB)  | Total Flash (kB) | STM32Cube.AI version  |
 |-------------------|--------|------------|---------|----------------|-------------|---------------|------------|-------------|-------------|-----------------------|
+| [miniresnet v2 1stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_1stacks_64x50_tl/miniresnetv2_1stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A    | 59.89               |   7.09       |   	123.98           |   60.56	9      | 66.98| 184.54 | 10.2.0                 |
+| [miniresnet v2 2stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_2stacks_64x50_tl/miniresnetv2_2stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A    | 59.89                |   	11.29        |   431.98           |   68.85      | 71.18 | 	500.83 | 10.2.0                 |
 ### Reference inference time based on ESC-10 dataset
 | Model             | Format | Resolution | Board            | Execution Engine |  Frequency   | Inference time (ms) | STM32Cube.AI version  |
 |-------------------|--------|------------|------------------|------------------|--------------|-------|-----------------------|
+| [miniresnet v2 1stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_1stacks_64x50_tl/miniresnetv2_1stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A |  1 CPU | 160 | 	187.21 | 10.2.0                |
+| [miniresnet v2 2stacks ](https://github.com/STMicroelectronics/stm32ai-modelzoo/tree/main/audio_event_detection/miniresnetv2/ST_pretrainedmodel_public_dataset/esc10/miniresnetv2_2stacks_64x50_tl/miniresnetv2_2stacks_64x50_tl_int8.tflite) | int8 | 64x50x1 | B-U585I-IOT02A |  1 CPU | 160 | 307.22 | 10.2.0                 |
 ### Accuracy with ESC-10 dataset
 ## Retraining and Integration in a simple example:
+Please refer to the stm32ai-modelzoo-services GitHub [here](https://github.com/STMicroelectronics/stm32ai-modelzoo-services)