niuchl helloxm commited on
Commit
145af7c
·
verified ·
1 Parent(s): ced2712

Update performance dashboard (#2)

Browse files

- Update performance dashboard (bd55be6d4447db6dcf0db8f2eeed73af443c1749)


Co-authored-by: Xiaoming Hu <[email protected]>

Files changed (1) hide show
  1. README.md +22 -4
README.md CHANGED
@@ -22,7 +22,25 @@ Please follow the [instructions](https://github.com/google-ai-edge/mediapipe-sam
22
 
23
  Note that all benchmark stats are from a Samsung S24 Ultra.
24
 
25
- | Model | Params | | GGML tk/s (CPU, 4 threads) | GGML tk/s (CPU, 8 threads) | LiteRT tk/s (XNNPACK, 4 threads) | LiteRT tk/s (XNNPACK, 8 threads) |
26
- | ---------------------------------------------- | ------ | ------------------ | -------------------------- | -------------------------- | -------------------------------- | -------------------------------- |
27
- | DeepSeek-R1-Distill-Qwen-1.5B (Int8 quantized) | 1.78 B | Prefill 512 tokens | 64.66 | 87.18 | 260.95 | 299.15 |
28
- | | | Decode 128 tokens | 23.85 | 15.37 | 23.126 | 10.486 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
22
 
23
  Note that all benchmark stats are from a Samsung S24 Ultra.
24
 
25
+ <table border="1">
26
+ <tr>
27
+ <th>Model</th>
28
+ <td colspan="2">DeepSeek-R1-Distill-Qwen-1.5B (Int8 quantized)</td>
29
+ </tr>
30
+ <tr>
31
+ <th>Params</th>
32
+ <td colspan="2">1.78 B</td>
33
+ </tr>
34
+ <tr>
35
+ <th></th>
36
+ <td><b>Prefill 512 tokens</b></td><td><b>Decode 128 tokens</b></td>
37
+ </tr>
38
+ <tr>
39
+ <th>LiteRT tk/s (XNNPACK, 4 threads)</th>
40
+ <td>260.95</td><td>23.126</td>
41
+ </tr>
42
+ <tr>
43
+ <th>GGML tk/s (CPU, 4 threads)</th>
44
+ <td>64.66</td><td>23.85</td>
45
+ </tr>
46
+ </table>