qualcomm
/

Segment-Anything-Model

@@ -15,7 +15,7 @@ tags:
 Transformer based encoder-decoder where prompts specify what to segment in an image thereby allowing segmentation without the need for additional training. The image encoder generates embeddings and the lightweight decoder operates on the embeddings for point and mask based image segmentation.
-This model is an implementation of Segment-Anything-Model found [here](https://github.com/facebookresearch/segment-anything).
 This repository provides scripts to run Segment-Anything-Model on Qualcomm® devices.
 More details on model performance across various devices, can be found
 [here](https://aihub.qualcomm.com/models/sam).
@@ -30,15 +30,27 @@ More details on model performance across various devices, can be found
   - Number of parameters (SAMDecoder): 5.11M
   - Model size (SAMDecoder): 19.6 MB
-| Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
-| ---|---|---|---|---|---|---|---|
-| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | TFLite | 29.972 ms | 4 - 12 MB | FP16 | NPU |  [SAMDecoder.tflite](https://huggingface.co/qualcomm/Segment-Anything-Model/blob/main/SAMDecoder.tflite)
-| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | TFLite | 11293.293 ms | 38 - 215 MB | FP32 | CPU |  [SAMEncoder.tflite](https://huggingface.co/qualcomm/Segment-Anything-Model/blob/main/SAMEncoder.tflite)
 ## Installation
@@ -94,23 +106,25 @@ device. This script does the following:
 ```bash
 python -m qai_hub_models.models.sam.export
 ```
 ```
-Profile Job summary of SAMDecoder
---------------------------------------------------
-Device: SA8255 (Proxy) (13)
-Estimated Inference Time: 29.91 ms
-Estimated Peak Memory Range: 3.82-11.20 MB
-Compute Units: NPU (337) | Total (337)
-Profile Job summary of SAMEncoder
---------------------------------------------------
-Device: SA8255 (Proxy) (13)
-Estimated Inference Time: 11339.80 ms
-Estimated Peak Memory Range: 123.86-127.24 MB
-Compute Units: GPU (36),CPU (782) | Total (818)
 ```
@@ -252,15 +266,19 @@ provides instructions on how to use the `.so` shared library  in an Android appl
 Get more details on Segment-Anything-Model's performance across various devices [here](https://aihub.qualcomm.com/models/sam).
 Explore all available models on [Qualcomm® AI Hub](https://aihub.qualcomm.com/)
 ## License
-- The license for the original implementation of Segment-Anything-Model can be found
-  [here](https://github.com/facebookresearch/segment-anything/blob/main/LICENSE).
-- The license for the compiled assets for on-device deployment can be found [here](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/Qualcomm+AI+Hub+Proprietary+License.pdf)
 ## References
 * [Segment Anything](https://arxiv.org/abs/2304.02643)
 * [Source Model Implementation](https://github.com/facebookresearch/segment-anything)
 ## Community
 * Join [our AI Hub Slack community](https://aihub.qualcomm.com/community/slack) to collaborate, post questions and learn more about on-device AI.
 * For questions or feedback please [reach out to us](mailto:[email protected]).

 Transformer based encoder-decoder where prompts specify what to segment in an image thereby allowing segmentation without the need for additional training. The image encoder generates embeddings and the lightweight decoder operates on the embeddings for point and mask based image segmentation.
+This model is an implementation of Segment-Anything-Model found [here]({source_repo}).
 This repository provides scripts to run Segment-Anything-Model on Qualcomm® devices.
 More details on model performance across various devices, can be found
 [here](https://aihub.qualcomm.com/models/sam).
   - Number of parameters (SAMDecoder): 5.11M
   - Model size (SAMDecoder): 19.6 MB
+| Model | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
+|---|---|---|---|---|---|---|---|---|
+| SAMDecoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | TFLITE | 29.098 ms | 2 - 20 MB | FP16 | NPU | [Segment-Anything-Model.tflite](https://huggingface.co/qualcomm/Segment-Anything-Model/blob/main/SAMDecoder.tflite) |
+| SAMDecoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | TFLITE | 20.232 ms | 2 - 227 MB | FP16 | NPU | [Segment-Anything-Model.tflite](https://huggingface.co/qualcomm/Segment-Anything-Model/blob/main/SAMDecoder.tflite) |
+| SAMDecoder | QCS8550 (Proxy) | QCS8550 Proxy | TFLITE | 28.959 ms | 4 - 12 MB | FP16 | NPU | [Segment-Anything-Model.tflite](https://huggingface.co/qualcomm/Segment-Anything-Model/blob/main/SAMDecoder.tflite) |
+| SAMDecoder | SA8255 (Proxy) | SA8255P Proxy | TFLITE | 29.061 ms | 4 - 25 MB | FP16 | NPU | [Segment-Anything-Model.tflite](https://huggingface.co/qualcomm/Segment-Anything-Model/blob/main/SAMDecoder.tflite) |
+| SAMDecoder | SA8775 (Proxy) | SA8775P Proxy | TFLITE | 28.99 ms | 4 - 47 MB | FP16 | NPU | [Segment-Anything-Model.tflite](https://huggingface.co/qualcomm/Segment-Anything-Model/blob/main/SAMDecoder.tflite) |
+| SAMDecoder | SA8650 (Proxy) | SA8650P Proxy | TFLITE | 29.004 ms | 4 - 7 MB | FP16 | NPU | [Segment-Anything-Model.tflite](https://huggingface.co/qualcomm/Segment-Anything-Model/blob/main/SAMDecoder.tflite) |
+| SAMDecoder | QCS8450 (Proxy) | QCS8450 Proxy | TFLITE | 32.396 ms | 4 - 222 MB | FP16 | NPU | [Segment-Anything-Model.tflite](https://huggingface.co/qualcomm/Segment-Anything-Model/blob/main/SAMDecoder.tflite) |
+| SAMDecoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | TFLITE | 20.466 ms | 2 - 157 MB | FP16 | NPU | [Segment-Anything-Model.tflite](https://huggingface.co/qualcomm/Segment-Anything-Model/blob/main/SAMDecoder.tflite) |
+| SAMEncoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | TFLITE | 11323.51 ms | 0 - 272 MB | FP32 | CPU | [Segment-Anything-Model.tflite](https://huggingface.co/qualcomm/Segment-Anything-Model/blob/main/SAMEncoder.tflite) |
+| SAMEncoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | TFLITE | 8300.484 ms | 123 - 1639 MB | FP32 | CPU | [Segment-Anything-Model.tflite](https://huggingface.co/qualcomm/Segment-Anything-Model/blob/main/SAMEncoder.tflite) |
+| SAMEncoder | QCS8550 (Proxy) | QCS8550 Proxy | TFLITE | 10870.158 ms | 124 - 286 MB | FP32 | CPU | [Segment-Anything-Model.tflite](https://huggingface.co/qualcomm/Segment-Anything-Model/blob/main/SAMEncoder.tflite) |
+| SAMEncoder | SA8255 (Proxy) | SA8255P Proxy | TFLITE | 10178.345 ms | 121 - 124 MB | FP32 | CPU | [Segment-Anything-Model.tflite](https://huggingface.co/qualcomm/Segment-Anything-Model/blob/main/SAMEncoder.tflite) |
+| SAMEncoder | SA8775 (Proxy) | SA8775P Proxy | TFLITE | 11283.428 ms | 120 - 125 MB | FP32 | CPU | [Segment-Anything-Model.tflite](https://huggingface.co/qualcomm/Segment-Anything-Model/blob/main/SAMEncoder.tflite) |
+| SAMEncoder | SA8650 (Proxy) | SA8650P Proxy | TFLITE | 10102.843 ms | 121 - 125 MB | FP32 | CPU | [Segment-Anything-Model.tflite](https://huggingface.co/qualcomm/Segment-Anything-Model/blob/main/SAMEncoder.tflite) |
+| SAMEncoder | QCS8450 (Proxy) | QCS8450 Proxy | TFLITE | 13526.091 ms | 131 - 1692 MB | FP32 | CPU | [Segment-Anything-Model.tflite](https://huggingface.co/qualcomm/Segment-Anything-Model/blob/main/SAMEncoder.tflite) |
+| SAMEncoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | TFLITE | 6334.196 ms | 98 - 1573 MB | FP32 | CPU | [Segment-Anything-Model.tflite](https://huggingface.co/qualcomm/Segment-Anything-Model/blob/main/SAMEncoder.tflite) |
 ## Installation
 ```bash
 python -m qai_hub_models.models.sam.export
 ```
 ```
+Profiling Results
+------------------------------------------------------------
+SAMDecoder
+Device                          : Samsung Galaxy S23 (13)
+Runtime                         : TFLITE
+Estimated inference time (ms)   : 29.1
+Estimated peak memory usage (MB): [2, 20]
+Total # Ops                     : 337
+Compute Unit(s)                 : NPU (337 ops)
+------------------------------------------------------------
+SAMEncoder
+Device                          : Samsung Galaxy S23 (13)
+Runtime                         : TFLITE
+Estimated inference time (ms)   : 11323.5
+Estimated peak memory usage (MB): [0, 272]
+Total # Ops                     : 818
+Compute Unit(s)                 : GPU (36 ops) CPU (782 ops)
 ```
 Get more details on Segment-Anything-Model's performance across various devices [here](https://aihub.qualcomm.com/models/sam).
 Explore all available models on [Qualcomm® AI Hub](https://aihub.qualcomm.com/)
 ## License
+* The license for the original implementation of Segment-Anything-Model can be found [here](https://github.com/facebookresearch/segment-anything/blob/main/LICENSE).
+* The license for the compiled assets for on-device deployment can be found [here](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/Qualcomm+AI+Hub+Proprietary+License.pdf)
 ## References
 * [Segment Anything](https://arxiv.org/abs/2304.02643)
 * [Source Model Implementation](https://github.com/facebookresearch/segment-anything)
 ## Community
 * Join [our AI Hub Slack community](https://aihub.qualcomm.com/community/slack) to collaborate, post questions and learn more about on-device AI.
 * For questions or feedback please [reach out to us](mailto:[email protected]).