v0.31.0
Browse filesSee https://github.com/quic/ai-hub-models/releases/v0.31.0 for changelog.
README.md
CHANGED
@@ -42,9 +42,9 @@ This model is an implementation of Phi-3.5-mini-instruct found [here](https://hu
|
|
42 |
|
43 |
| Model | Precision | Device | Chipset | Target Runtime | Response Rate (tokens per second) | Time To First Token (range, seconds)
|
44 |
|---|---|---|---|---|---|
|
45 |
-
| Phi-3.5-Mini-Instruct | w4a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile |
|
46 |
-
| Phi-3.5-Mini-Instruct | w4a16 | Snapdragon X Elite CRD | Snapdragon® X Elite |
|
47 |
-
| Phi-3.5-Mini-Instruct | w4a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile |
|
48 |
|
49 |
## Deploying Phi-3.5-mini-instruct on-device
|
50 |
|
|
|
42 |
|
43 |
| Model | Precision | Device | Chipset | Target Runtime | Response Rate (tokens per second) | Time To First Token (range, seconds)
|
44 |
|---|---|---|---|---|---|
|
45 |
+
| Phi-3.5-Mini-Instruct | w4a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_CONTEXT_BINARY | 13.01 | 0.1469056 - 4.7009792 | -- | Use Export Script |
|
46 |
+
| Phi-3.5-Mini-Instruct | w4a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_CONTEXT_BINARY | 6.2 | 0.185833 - 5.946656 | -- | Use Export Script |
|
47 |
+
| Phi-3.5-Mini-Instruct | w4a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_CONTEXT_BINARY | 14.73 | 0.1195948 - 3.8270336 | -- | Use Export Script |
|
48 |
|
49 |
## Deploying Phi-3.5-mini-instruct on-device
|
50 |
|