⚡ WebGPU Benchmark Results (34.37x speedup)
#11
by
osanseviero
- opened
Batch Size | WASM (ms) | WebGPU (ms) |
1 | 460.00 | 10.20 |
2 | 931.40 | 76.10 |
4 | 1863.90 | 226.00 |
8 | 3714.10 | 227.60 |
16 | 7516.40 | 298.80 |
32 | 15190.60 | 442.00 |
- Model: Xenova/all-MiniLM-L6-v2
- Quantized: false
- Sequence length: 512
- Browser: Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/124.0.0.0 Safari/537.36
- GPU: vendor=nvidia, architecture=ampere, device=, description=