|
Benchmark with wasmtime:
wasmtime checkasm --test=hevc_idct --benchmark
hevc_idct_4x4_8_c: 20.8 ( 1.00x)
hevc_idct_4x4_8_simd128: 10.2 ( 2.02x)
hevc_idct_4x4_10_c: 21.0 ( 1.00x)
hevc_idct_4x4_10_simd128: 10.2 ( 2.05x)
hevc_idct_8x8_8_c: 198.0 ( 1.00x)
hevc_idct_8x8_8_simd128: 41.5 ( 4.77x)
hevc_idct_8x8_10_c: 197.8 ( 1.00x)
hevc_idct_8x8_10_simd128: 41.8 ( 4.74x)
hevc_idct_16x16_8_c: 1385.2 ( 1.00x)
hevc_idct_16x16_8_simd128: 291.8 ( 4.75x)
hevc_idct_16x16_10_c: 1375.0 ( 1.00x)
hevc_idct_16x16_10_simd128: 281.2 ( 4.89x)
hevc_idct_32x32_8_c: 13479.0 ( 1.00x)
hevc_idct_32x32_8_simd128: 1541.5 ( 8.74x)
hevc_idct_32x32_10_c: 13448.0 ( 1.00x)
hevc_idct_32x32_10_simd128: 1541.5 ( 8.72x)
Decoding test:
1. Without the optimization:
$ wasmtime --wasi threads --dir=/ ./ffmpeg -cpuflags 0 -i 265.mp4 -an -t 120 -f null - -benchmark
frame= 2880 fps= 94 q=-0.0 Lsize=N/A time=00:02:00.00 bitrate=N/A speed=3.92x
2. With the optimization:
$ wasmtime --wasi threads --dir=/ ./ffmpeg -i 265.mp4 -an -t 120 -f null - -benchmark
frame= 2880 fps=110 q=-0.0 Lsize=N/A time=00:02:00.00 bitrate=N/A speed=4.58x
Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>
|