qaihm-bot commited on
Commit
4cab813
·
verified ·
1 Parent(s): b529f53

See https://github.com/quic/ai-hub-models/releases/v0.31.0 for changelog.

Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -42,9 +42,9 @@ This model is an implementation of Phi-3.5-mini-instruct found [here](https://hu
42
 
43
  | Model | Precision | Device | Chipset | Target Runtime | Response Rate (tokens per second) | Time To First Token (range, seconds)
44
  |---|---|---|---|---|---|
45
- | Phi-3.5-Mini-Instruct | w4a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN | 13.01 | 0.1469056 - 4.7009792 | -- | Use Export Script |
46
- | Phi-3.5-Mini-Instruct | w4a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 6.2 | 0.185833 - 5.946656 | -- | Use Export Script |
47
- | Phi-3.5-Mini-Instruct | w4a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN | 14.73 | 0.1195948 - 3.8270336 | -- | Use Export Script |
48
 
49
  ## Deploying Phi-3.5-mini-instruct on-device
50
 
 
42
 
43
  | Model | Precision | Device | Chipset | Target Runtime | Response Rate (tokens per second) | Time To First Token (range, seconds)
44
  |---|---|---|---|---|---|
45
+ | Phi-3.5-Mini-Instruct | w4a16 | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_CONTEXT_BINARY | 13.01 | 0.1469056 - 4.7009792 | -- | Use Export Script |
46
+ | Phi-3.5-Mini-Instruct | w4a16 | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_CONTEXT_BINARY | 6.2 | 0.185833 - 5.946656 | -- | Use Export Script |
47
+ | Phi-3.5-Mini-Instruct | w4a16 | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_CONTEXT_BINARY | 14.73 | 0.1195948 - 3.8270336 | -- | Use Export Script |
48
 
49
  ## Deploying Phi-3.5-mini-instruct on-device
50