⚡ WebGPU Benchmark Results (20.95x speedup)

#99
by fantos - opened
Batch SizeWASM (int8)WASM (fp16)WASM (fp32)WebGPU (fp16)WebGPU (fp32)
1417.80501.00552.2045.7065.90
2864.901331.001244.30123.40117.50
41766.802569.302098.90157.10232.30
83411.104781.604399.30327.10409.90
166772.2011739.1010643.90613.70842.50
3215859.4023870.1018189.101139.201247.60
  • Model: Xenova/all-MiniLM-L6-v2
  • Tests run: WASM (int8), WASM (fp16), WASM (fp32), WebGPU (fp16), WebGPU (fp32)
  • Sequence length: 512
  • Browser: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/129.0.0.0 Safari/537.36
  • GPU: vendor=intel, architecture=gen-12lp, device=, description=

Sign up or log in to comment