opengl:vfpbenchlog
差分
このページの2つのバージョン間の差分を表示します。
両方とも前のリビジョン前のリビジョン次のリビジョン | 前のリビジョン次のリビジョン両方とも次のリビジョン | ||
opengl:vfpbenchlog [2019/06/16 01:04] – [Qualcomm Kryo 385 (Cortex-A55) (ARMv8.2A AArch64 arm64) FPU+ASIMD+HALFFP] oga | opengl:vfpbenchlog [2020/08/13 22:49] – [AMD Ryzen 7 1800X (AMD64 x86_64 x64) SSE4.2/AVX2/FMA3] oga | ||
---|---|---|---|
行 12: | 行 12: | ||
^ Device | ^ Device | ||
- | | PC Intel Core i7-6700K | + | | PC AMD Ryzen 9 3950X |
- | | PC AMD Ryzen 7 1800X | + | | PC Intel Core i7-6700K |
- | | PC Intel Core i7-4790K | + | | PC Intel Core i7-4790K |
+ | | PC AMD Ryzen 7 1800X | Win10 | AMD Ryzen 7 1800X | Zen | x64 | SSE4.2/ | ||
| Apple Mac mini Late 2012 | OSX.10 | | Apple Mac mini Late 2012 | OSX.10 | ||
| Apple MacBook Pro Late 2011 | OSX.10 | | Apple MacBook Pro Late 2011 | OSX.10 | ||
| Pixel 3 | A9.0 | Snapdragon 845 | Kryo 385(A75/55) | ARMv8A | AArch64 | | Pixel 3 | A9.0 | Snapdragon 845 | Kryo 385(A75/55) | ARMv8A | AArch64 | ||
+ | | Essential Phone PH-1 | A9.0 | Snapdragon 835 | Kryo (A73/53) | ARMv8A | AArch64 | ||
+ | | Amazon Fire HD 10 2019 | A9.0 | Mediatek MT8183 | ||
+ | | PC AMD A10-7870K | ||
| Apple MacBook Pro Late 2013 | OSX.10 | | Apple MacBook Pro Late 2013 | OSX.10 | ||
| iPhone SE | iOS9.3 | | iPhone SE | iOS9.3 | ||
行 29: | 行 33: | ||
| NVIDIA Tegra Note 7 | A4.4 | NVIDIA Tegra 4 | Cortex-A15 | | NVIDIA Tegra Note 7 | A4.4 | NVIDIA Tegra 4 | Cortex-A15 | ||
| PC Intel N3150 Braswell | | PC Intel N3150 Braswell | ||
+ | | Raspberry Pi 4 | Ubuntu | ||
| ASUS Nexus 7 2013 | A4.4 | Qualcomm S4 APQ8064 | | ASUS Nexus 7 2013 | A4.4 | Qualcomm S4 APQ8064 | ||
| HTC J butterfly HTL21 | A4.1 | Qualcomm S4 APQ8064 | | HTC J butterfly HTL21 | A4.1 | Qualcomm S4 APQ8064 | ||
+ | | NVIDIA Jetson nano | Ubuntu | ||
| Apple TV (2015) | | Apple TV (2015) | ||
| Apple iPhone 5s | i8.0 | Apple A7 | Cyclone | | Apple iPhone 5s | i8.0 | Apple A7 | Cyclone | ||
行 9224: | 行 9230: | ||
- | ==== Qualcomm Kryo 280 (Cortex-A73) (ARMv8A AArch64 arm64) FPU+ASIMD ==== | + | ==== Qualcomm Kryo 280 (Cortex-A73 |
- | ++++Essential Phone PH-1 Snapdragon 835 big core Kryo 280 2.32GHz x4 ARM64 (AArch64) Android 9.0| | + | ++++Essential Phone PH-1 Snapdragon 835 Kryo 280 2.45GHz x4 + 1.9GHz |
< | < | ||
- | ARCH: ARMv8A | + | ARCH: ARMv8A |
- | FPU: AArch64 NEON | + | FPU : ASIMD(AArch64 NEON) |
- | SingleT SP max: 19.546 GFLOPS | + | Name: Qualcomm Technologies, |
- | SingleT DP max: 9.769 GFLOPS | + | |
- | MultiT | + | CPU Thread: 8 |
- | MultiT | + | CPU Core : |
- | CPU core: 4 | + | CPU Group : 2 |
+ | | ||
+ | Group 1: Thread= | ||
+ | NEON : yes | ||
+ | FMA : yes | ||
FPHP : no | FPHP : no | ||
SIMDHP: no | SIMDHP: no | ||
- | * FPU/NEON (single | + | Total: |
+ | SingleThread HP max: - | ||
+ | SingleThread SP max: | ||
+ | SingleThread DP max: | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | |||
+ | Group 0: Thread=4 | ||
+ | SingleThread HP max: - | ||
+ | SingleThread SP max: | ||
+ | SingleThread DP max: 7.406 GFLOPS | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | |||
+ | Group 1: Thread=4 | ||
+ | SingleThread HP max: - | ||
+ | SingleThread SP max: | ||
+ | SingleThread DP max: 9.772 GFLOPS | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | |||
+ | |||
+ | * Group 0: Thread=1 | ||
+ | * FPU/NEON (SP fp) | ||
TIME(s) | TIME(s) | ||
- | FPU fmul (32bit x1) n8 : 0.288 | + | FPU fmul (32bit x1) n8 : 0.319 |
- | FPU fadd (32bit x1) n8 : 0.258 | + | FPU fadd (32bit x1) n8 : 0.334 |
- | FPU fmadd (32bit x1) n8 : | + | FPU fmadd (32bit x1) n8 : |
- | NEON fmul.2s (32bit x2) n8 : 0.258 | + | NEON fmul.2s (32bit x2) n8 : 0.334 |
- | NEON fadd.2s (32bit x2) n8 : 0.258 | + | NEON fadd.2s (32bit x2) n8 : 0.335 |
- | NEON fmla.2s (32bit x2) n8 : 0.269 17816.9 | + | NEON fmla.2s (32bit x2) n8 : 0.334 13676.0 |
- | NEON fmul.4s (32bit x4) n8 : 0.491 | + | NEON fmul.4s (32bit x4) n8 : 0.623 |
- | NEON fadd.4s (32bit x4) n8 : 0.491 | + | NEON fadd.4s (32bit x4) n8 : 0.628 |
- | NEON fmla.4s (32bit x4) n8 : 0.491 19546.0 | + | NEON fmla.4s (32bit x4) n8 : 0.622 14677.4 |
- | FPU fmul (32bit x1) ns4 : | + | FPU fmul (32bit x1) ns4 : |
- | FPU fadd (32bit x1) ns4 : | + | FPU fadd (32bit x1) ns4 : |
- | FPU fmadd (32bit x1) ns4 : 0.614 | + | FPU fmadd (32bit x1) ns4 : 0.685 |
- | NEON fmul.2s (32bit x2) ns4 : | + | NEON fmul.2s (32bit x2) ns4 : |
- | NEON fadd.2s (32bit x2) ns4 : | + | NEON fadd.2s (32bit x2) ns4 : |
- | NEON fmla.2s (32bit x2) ns4 : | + | NEON fmla.2s (32bit x2) ns4 : |
- | NEON fmul.4s (32bit x4) ns4 : | + | NEON fmul.4s (32bit x4) ns4 : |
- | NEON fadd.4s (32bit x4) ns4 : | + | NEON fadd.4s (32bit x4) ns4 : |
- | NEON fmla.4s (32bit x4) ns4 : | + | NEON fmla.4s (32bit x4) ns4 : |
- | FPU fmul (32bit x1) n1 : 0.258 | + | FPU fmul (32bit x1) n1 : 0.607 |
- | FPU fadd (32bit x1) n1 : 0.258 | + | FPU fadd (32bit x1) n1 : 0.606 |
- | FPU fmadd (32bit x1) n1 : | + | FPU fmadd (32bit x1) n1 : |
- | NEON fmul.2s (32bit x2) n1 : 0.258 | + | NEON fmul.2s (32bit x2) n1 : 0.606 |
- | NEON fadd.2s (32bit x2) n1 : 0.258 | + | NEON fadd.2s (32bit x2) n1 : 0.606 |
- | NEON fmla.2s (32bit x2) n1 : | + | NEON fmla.2s (32bit x2) n1 : |
- | NEON fmul.4s (32bit x4) n1 : 0.491 | + | NEON fmul.4s (32bit x4) n1 : 0.621 |
- | NEON fadd.4s (32bit x4) n1 : 0.491 | + | NEON fadd.4s (32bit x4) n1 : 0.623 |
- | NEON fmla.4s (32bit x4) n1 : | + | NEON fmla.4s (32bit x4) n1 : |
- | NEON fmul.4s (32bit x4) n12 : | + | NEON fmul.4s (32bit x4) n12 : |
- | NEON fadd.4s (32bit x4) n12 : | + | NEON fadd.4s (32bit x4) n12 : |
- | NEON fmla.4s (32bit x4) n12 : | + | NEON fmla.4s (32bit x4) n12 : |
- | Average | + | Average |
- | Highest | + | Highest |
- | * FPU/NEON (double | + | * Group 0: Thread=1 |
+ | * FPU/NEON (DP fp) | ||
TIME(s) | TIME(s) | ||
- | FPU fmul (64bit x1) n8 : 0.280 | + | FPU fmul (64bit x1) n8 : 0.336 |
- | FPU fadd (64bit x1) n8 : 0.258 | + | FPU fadd (64bit x1) n8 : 0.323 |
- | FPU fmadd (64bit x1) n8 : | + | FPU fmadd (64bit x1) n8 : |
- | NEON fmul.2d (64bit x2) n8 : 0.491 | + | NEON fmul.2d (64bit x2) n8 : 0.622 |
- | NEON fadd.2d (64bit x2) n8 : 0.491 | + | NEON fadd.2d (64bit x2) n8 : 0.626 |
- | NEON fmla.2d (64bit x2) n8 : 0.492 | + | NEON fmla.2d (64bit x2) n8 : 0.622 |
- | FPU fmul (64bit x1) ns4 : | + | FPU fmul (64bit x1) ns4 : |
- | FPU fadd (64bit x1) ns4 : | + | FPU fadd (64bit x1) ns4 : |
- | FPU fmadd (64bit x1) ns4 : 0.431 | + | FPU fmadd (64bit x1) ns4 : 0.608 |
- | NEON fmul.2d (64bit x2) ns4 : | + | NEON fmul.2d (64bit x2) ns4 : |
- | NEON fadd.2d (64bit x2) ns4 : | + | NEON fadd.2d (64bit x2) ns4 : |
- | NEON fmla.2d (64bit x2) ns4 : | + | NEON fmla.2d (64bit x2) ns4 : |
- | FPU fmul (64bit x1) n1 : 0.258 | + | FPU fmul (64bit x1) n1 : 0.606 |
- | FPU fadd (64bit x1) n1 : 0.258 | + | FPU fadd (64bit x1) n1 : 0.606 |
- | FPU fmadd (64bit x1) n1 : | + | FPU fmadd (64bit x1) n1 : |
- | NEON fmul.2d (64bit x2) n1 : 0.492 | + | NEON fmul.2d (64bit x2) n1 : 0.632 |
- | NEON fadd.2d (64bit x2) n1 : 0.491 | + | NEON fadd.2d (64bit x2) n1 : 0.620 |
- | NEON fmla.2d (64bit x2) n1 : | + | NEON fmla.2d (64bit x2) n1 : |
- | NEON fmul.2d (64bit x2) n12 : | + | NEON fmul.2d (64bit x2) n12 : |
- | NEON fadd.2d (64bit x2) n12 : | + | NEON fadd.2d (64bit x2) n12 : |
- | NEON fmla.2d (64bit x2) n12 : | + | NEON fmla.2d (64bit x2) n12 : |
- | Average | + | Average |
- | Highest | + | Highest |
+ | * Group 0: Thread=1 | ||
* Matrix 4x4 | * Matrix 4x4 | ||
TIME(s) | TIME(s) | ||
- | C++ code : 0.267 | + | C++ code : 0.426 |
- | NEON fmla.4s 128bit A : | + | NEON fmla.4s 128bit A : |
- | NEON fmla.4s 128bit B : | + | NEON fmla.4s 128bit B : |
- | Average | + | Average |
- | Highest | + | Highest |
- | * FPU/NEON (single | + | * Group 0: Thread=4 |
+ | * FPU/NEON (SP fp) multi-thread | ||
TIME(s) | TIME(s) | ||
- | FPU fmul (32bit x1) n8 : 0.560 8571.6 | + | FPU fmul (32bit x1) n8 : 0.318 14335.7 |
- | FPU fadd (32bit x1) n8 : 0.538 8917.4 | + | FPU fadd (32bit x1) n8 : 0.333 13704.3 |
- | FPU fmadd (32bit x1) n8 : | + | FPU fmadd (32bit x1) n8 : |
- | NEON fmul.2s (32bit x2) n8 : 0.536 17918.8 2239.9 ( 8 0.9) 17918.8 | + | NEON fmul.2s (32bit x2) n8 : 0.333 27405.8 3425.7 ( 8 1.8) 27405.8 |
- | NEON fadd.2s (32bit x2) n8 : 0.538 17834.8 | + | NEON fadd.2s (32bit x2) n8 : 0.334 27323.1 |
- | NEON fmla.2s (32bit x2) n8 : 0.564 34051.0 2128.2 ( 16 0.9) 34051.0 | + | NEON fmla.2s (32bit x2) n8 : 0.339 53820.0 3363.8 ( 16 1.8) 53820.0 |
- | NEON fmul.4s (32bit x4) n8 : | + | NEON fmul.4s (32bit x4) n8 : |
- | NEON fadd.4s (32bit x4) n8 : | + | NEON fadd.4s (32bit x4) n8 : |
- | NEON fmla.4s (32bit x4) n8 : | + | NEON fmla.4s (32bit x4) n8 : |
- | FPU fmul (32bit x1) ns4 : | + | FPU fmul (32bit x1) ns4 : |
- | FPU fadd (32bit x1) ns4 : | + | FPU fadd (32bit x1) ns4 : |
- | FPU fmadd (32bit x1) ns4 : | + | FPU fmadd (32bit x1) ns4 : |
- | NEON fmul.2s (32bit x2) ns4 : | + | NEON fmul.2s (32bit x2) ns4 : |
- | NEON fadd.2s (32bit x2) ns4 : | + | NEON fadd.2s (32bit x2) ns4 : |
- | NEON fmla.2s (32bit x2) ns4 : | + | NEON fmla.2s (32bit x2) ns4 : |
- | NEON fmul.4s (32bit x4) ns4 : | + | NEON fmul.4s (32bit x4) ns4 : |
- | NEON fadd.4s (32bit x4) ns4 : | + | NEON fadd.4s (32bit x4) ns4 : |
- | NEON fmla.4s (32bit x4) ns4 : | + | NEON fmla.4s (32bit x4) ns4 : |
- | FPU fmul (32bit x1) n1 : 0.268 17912.2 | + | FPU fmul (32bit x1) n1 : 0.608 7505.8 |
- | FPU fadd (32bit x1) n1 : 0.268 17911.6 | + | FPU fadd (32bit x1) n1 : 0.606 7528.7 |
- | FPU fmadd (32bit x1) n1 : | + | FPU fmadd (32bit x1) n1 : |
- | NEON fmul.2s (32bit x2) n1 : 0.268 35833.5 | + | NEON fmul.2s (32bit x2) n1 : 0.616 14812.3 |
- | NEON fadd.2s (32bit x2) n1 : 0.268 35834.0 4479.2 ( 8 1.8) 35834.0 | + | NEON fadd.2s (32bit x2) n1 : 0.605 15074.0 1884.2 ( 8 1.0) 15074.0 |
- | NEON fmla.2s (32bit x2) n1 : | + | NEON fmla.2s (32bit x2) n1 : |
- | NEON fmul.4s (32bit x4) n1 : 0.510 37624.7 | + | NEON fmul.4s (32bit x4) n1 : 0.630 28977.6 |
- | NEON fadd.4s (32bit x4) n1 : 0.510 37627.6 | + | NEON fadd.4s (32bit x4) n1 : 0.622 29345.1 |
- | NEON fmla.4s (32bit x4) n1 : | + | NEON fmla.4s (32bit x4) n1 : |
- | NEON fmul.4s (32bit x4) n12 : | + | NEON fmul.4s (32bit x4) n12 : |
- | NEON fadd.4s (32bit x4) n12 : | + | NEON fadd.4s (32bit x4) n12 : |
- | NEON fmla.4s (32bit x4) n12 : | + | NEON fmla.4s (32bit x4) n12 : |
- | Average | + | Average |
- | Highest | + | Highest |
- | * FPU/NEON (double | + | * Group 0: Thread=4 |
+ | * FPU/NEON (DP fp) multi-thread | ||
TIME(s) | TIME(s) | ||
- | FPU fmul (64bit x1) n8 : 0.417 11502.8 | + | FPU fmul (64bit x1) n8 : 0.336 13593.7 |
- | FPU fadd (64bit x1) n8 : 0.365 13164.7 | + | FPU fadd (64bit x1) n8 : 0.323 14139.0 |
- | FPU fmadd (64bit x1) n8 : | + | FPU fmadd (64bit x1) n8 : |
- | NEON fmul.2d (64bit x2) n8 : 0.692 13877.0 1734.6 ( 8 0.7) 13877.0 | + | NEON fmul.2d (64bit x2) n8 : 0.621 14688.0 1836.0 ( 8 1.0) 14688.0 |
- | NEON fadd.2d (64bit x2) n8 : 0.617 15560.5 | + | NEON fadd.2d (64bit x2) n8 : 0.621 14690.9 |
- | NEON fmla.2d (64bit x2) n8 : 0.540 35539.9 | + | NEON fmla.2d (64bit x2) n8 : 0.621 29407.3 |
- | FPU fmul (64bit x1) ns4 : | + | FPU fmul (64bit x1) ns4 : |
- | FPU fadd (64bit x1) ns4 : | + | FPU fadd (64bit x1) ns4 : |
- | FPU fmadd (64bit x1) ns4 : 0.448 21419.5 | + | FPU fmadd (64bit x1) ns4 : 0.606 15057.1 |
- | NEON fmul.2d (64bit x2) ns4 : | + | NEON fmul.2d (64bit x2) ns4 : |
- | NEON fadd.2d (64bit x2) ns4 : | + | NEON fadd.2d (64bit x2) ns4 : |
- | NEON fmla.2d (64bit x2) ns4 : | + | NEON fmla.2d (64bit x2) ns4 : |
- | FPU fmul (64bit x1) n1 : 0.268 17919.8 | + | FPU fmul (64bit x1) n1 : 0.605 7534.9 |
- | FPU fadd (64bit x1) n1 : 0.268 17920.0 | + | FPU fadd (64bit x1) n1 : 0.605 7535.5 |
- | FPU fmadd (64bit x1) n1 : | + | FPU fmadd (64bit x1) n1 : |
- | NEON fmul.2d (64bit x2) n1 : 0.510 18812.5 | + | NEON fmul.2d (64bit x2) n1 : 0.621 14702.7 |
- | NEON fadd.2d (64bit x2) n1 : 0.510 18812.1 2351.5 ( 8 1.0) 18812.1 | + | NEON fadd.2d (64bit x2) n1 : 0.621 14698.1 1837.3 ( 8 1.0) 14698.1 |
- | NEON fmla.2d (64bit x2) n1 : | + | NEON fmla.2d (64bit x2) n1 : |
- | NEON fmul.2d (64bit x2) n12 : | + | NEON fmul.2d (64bit x2) n12 : |
- | NEON fadd.2d (64bit x2) n12 : | + | NEON fadd.2d (64bit x2) n12 : |
- | NEON fmla.2d (64bit x2) n12 : | + | NEON fmla.2d (64bit x2) n12 : |
- | Average | + | Average |
- | Highest | + | Highest |
+ | * Group 0: Thread=4 | ||
* Matrix 4x4 multi-thread | * Matrix 4x4 multi-thread | ||
TIME(s) | TIME(s) | ||
- | C++ code : 0.569 12591.7 | + | C++ code : 0.436 15639.0 |
- | NEON fmla.4s 128bit A : | + | NEON fmla.4s 128bit A : |
- | NEON fmla.4s 128bit B : | + | NEON fmla.4s 128bit B : |
- | Average | + | Average |
- | Highest | + | Highest |
- | cpu0 1900800 300000 | + | * Group 1: Thread=1 |
- | cpu1 1900800 300000 | + | * FPU/NEON (SP fp) |
- | cpu2 1900800 300000 | + | TIME(s) |
- | cpu3 1900800 300000 | + | FPU fmul (32bit x1) n8 : 0.317 |
- | cpu4 2457600 300000 | + | FPU fadd (32bit x1) n8 : 0.317 |
- | cpu5 2457600 300000 | + | FPU fmadd (32bit x1) n8 : |
- | cpu6 2457600 300000 | + | NEON fmul.2s (32bit x2) n8 : 0.317 |
- | cpu7 2457600 300000 | + | NEON fadd.2s (32bit x2) n8 : 0.317 |
+ | NEON fmla.2s (32bit x2) n8 : 0.329 17927.5 | ||
+ | NEON fmul.4s (32bit x4) n8 : 0.603 | ||
+ | NEON fadd.4s (32bit x4) n8 : 0.603 | ||
+ | NEON fmla.4s (32bit x4) n8 : 0.603 19546.8 | ||
+ | FPU fmul (32bit x1) ns4 : | ||
+ | FPU fadd (32bit x1) ns4 : | ||
+ | FPU fmadd (32bit x1) ns4 : 0.755 | ||
+ | NEON fmul.2s (32bit x2) ns4 : | ||
+ | NEON fadd.2s (32bit x2) ns4 : | ||
+ | NEON fmla.2s (32bit x2) ns4 : | ||
+ | NEON fmul.4s (32bit x4) ns4 : | ||
+ | NEON fadd.4s (32bit x4) ns4 : | ||
+ | NEON fmla.4s (32bit x4) ns4 : | ||
+ | FPU fmul (32bit x1) n1 : 0.317 | ||
+ | FPU fadd (32bit x1) n1 : 0.317 | ||
+ | FPU fmadd (32bit x1) n1 : | ||
+ | NEON fmul.2s (32bit x2) n1 : 0.317 | ||
+ | NEON fadd.2s (32bit x2) n1 : 0.317 | ||
+ | NEON fmla.2s (32bit x2) n1 : 1.810 | ||
+ | NEON fmul.4s (32bit x4) n1 : 0.604 | ||
+ | NEON fadd.4s (32bit x4) n1 : 0.603 | ||
+ | NEON fmla.4s (32bit x4) n1 : 1.811 | ||
+ | NEON fmul.4s (32bit x4) n12 : | ||
+ | NEON fadd.4s (32bit x4) n12 : | ||
+ | NEON fmla.4s (32bit x4) n12 : | ||
+ | Average | ||
+ | Highest | ||
- | Processor : AArch64 Processor rev 1 (aarch64) | ||
- | processor : 0 | ||
- | BogoMIPS : 38.00 | ||
- | Features : fp asimd evtstrm aes pmull sha1 sha2 crc32 | ||
- | CPU implementer : | ||
- | CPU architecture: | ||
- | CPU variant : 0xa | ||
- | CPU part : 0x801 | ||
- | CPU revision : 4 | ||
- | processor : 1 | + | * Group 1: |
- | BogoMIPS : 38.00 | + | * FPU/NEON (DP fp) |
- | Features : fp asimd evtstrm aes pmull sha1 sha2 crc32 | + | TIME(s) |
- | CPU implementer : 0x51 | + | FPU fmul (64bit x1) n8 |
- | CPU architecture: 8 | + | FPU fadd (64bit x1) n8 |
- | CPU variant : 0xa | + | FPU fmadd (64bit x1) n8 : |
- | CPU part : 0x801 | + | NEON fmul.2d (64bit x2) n8 |
- | CPU revision : 4 | + | NEON fadd.2d (64bit x2) n8 : 0.604 |
+ | NEON fmla.2d (64bit x2) n8 : 0.604 | ||
+ | FPU fmul (64bit x1) ns4 : | ||
+ | FPU fadd (64bit x1) ns4 : | ||
+ | FPU fmadd (64bit x1) ns4 | ||
+ | NEON fmul.2d (64bit x2) ns4 : | ||
+ | NEON fadd.2d (64bit x2) ns4 : | ||
+ | NEON fmla.2d (64bit x2) ns4 : | ||
+ | FPU fmul (64bit x1) n1 : 0.317 | ||
+ | FPU fadd (64bit x1) n1 : 0.317 | ||
+ | FPU fmadd (64bit x1) n1 : | ||
+ | NEON fmul.2d (64bit x2) n1 : 0.604 | ||
+ | NEON fadd.2d (64bit x2) n1 : 0.604 | ||
+ | NEON fmla.2d (64bit x2) n1 : 1.811 | ||
+ | NEON fmul.2d (64bit x2) n12 : | ||
+ | NEON fadd.2d (64bit x2) n12 : | ||
+ | NEON fmla.2d (64bit x2) n12 : | ||
+ | Average | ||
+ | Highest | ||
- | processor : 2 | ||
- | BogoMIPS : 38.00 | ||
- | Features : fp asimd evtstrm aes pmull sha1 sha2 crc32 | ||
- | CPU implementer : | ||
- | CPU architecture: | ||
- | CPU variant : 0xa | ||
- | CPU part : 0x801 | ||
- | CPU revision : 4 | ||
- | processor : 3 | + | * Group 1: |
- | BogoMIPS : 38.00 | + | * Matrix 4x4 |
- | Features : fp asimd evtstrm aes pmull sha1 sha2 crc32 | + | |
- | CPU implementer : | + | C++ code |
- | CPU architecture: 8 | + | NEON fmla.4s 128bit A : |
- | CPU variant : 0xa | + | NEON fmla.4s 128bit B : |
- | CPU part : 0x801 | + | Average |
- | CPU revision : 4 | + | Highest |
- | processor : 4 | ||
- | BogoMIPS : 38.00 | ||
- | Features : fp asimd evtstrm aes pmull sha1 sha2 crc32 | ||
- | CPU implementer : | ||
- | CPU architecture: | ||
- | CPU variant : 0xa | ||
- | CPU part : 0x800 | ||
- | CPU revision : 1 | ||
- | processor : 5 | + | * Group 1: |
- | BogoMIPS : 38.00 | + | * FPU/NEON (SP fp) multi-thread |
- | Features : fp asimd evtstrm aes pmull sha1 sha2 crc32 | + | TIME(s) |
- | CPU implementer : 0x51 | + | FPU fmul (32bit x1) n8 : 0.455 12977.3 |
- | CPU architecture: 8 | + | FPU fadd (32bit x1) n8 : 0.494 11946.6 |
- | CPU variant : 0xa | + | FPU fmadd (32bit x1) n8 : |
- | CPU part : 0x800 | + | NEON fmul.2s (32bit x2) n8 : 0.479 24612.2 |
- | CPU revision : 1 | + | NEON fadd.2s (32bit x2) n8 |
+ | NEON fmla.2s (32bit x2) n8 | ||
+ | NEON fmul.4s (32bit x4) n8 | ||
+ | NEON fadd.4s (32bit x4) n8 | ||
+ | NEON fmla.4s (32bit x4) n8 : 0.925 51022.2 | ||
+ | FPU fmul (32bit x1) ns4 : | ||
+ | FPU fadd (32bit x1) ns4 : | ||
+ | FPU fmadd (32bit x1) ns4 | ||
+ | NEON fmul.2s (32bit x2) ns4 : | ||
+ | NEON fadd.2s (32bit x2) ns4 : | ||
+ | NEON fmla.2s (32bit x2) ns4 : | ||
+ | NEON fmul.4s (32bit x4) ns4 : | ||
+ | NEON fadd.4s (32bit x4) ns4 : | ||
+ | NEON fmla.4s (32bit x4) ns4 : | ||
+ | FPU fmul (32bit x1) n1 : 0.403 14651.9 | ||
+ | FPU fadd (32bit x1) n1 : 0.434 13582.6 | ||
+ | FPU fmadd (32bit x1) n1 : | ||
+ | NEON fmul.2s (32bit x2) n1 : 0.412 28620.6 | ||
+ | NEON fadd.2s (32bit x2) n1 : 0.384 30747.3 | ||
+ | NEON fmla.2s (32bit x2) n1 : 2.123 11112.1 | ||
+ | NEON fmul.4s (32bit x4) n1 : 0.684 34490.5 | ||
+ | NEON fadd.4s (32bit x4) n1 : 0.636 37116.4 | ||
+ | NEON fmla.4s (32bit x4) n1 : 1.902 24810.3 | ||
+ | NEON fmul.4s (32bit x4) n12 : | ||
+ | NEON fadd.4s (32bit x4) n12 : | ||
+ | NEON fmla.4s (32bit x4) n12 : | ||
+ | Average | ||
+ | Highest | ||
- | processor : 6 | ||
- | BogoMIPS : 38.00 | ||
- | Features : fp asimd evtstrm aes pmull sha1 sha2 crc32 | ||
- | CPU implementer : | ||
- | CPU architecture: | ||
- | CPU variant : 0xa | ||
- | CPU part : 0x800 | ||
- | CPU revision : 1 | ||
- | processor : 7 | + | * Group 1: |
- | BogoMIPS : 38.00 | + | * FPU/NEON (DP fp) multi-thread |
- | Features : fp asimd evtstrm aes pmull sha1 sha2 crc32 | + | TIME(s) |
- | CPU implementer : 0x51 | + | FPU fmul (64bit x1) n8 : 1.267 |
- | CPU architecture: 8 | + | FPU fadd (64bit x1) n8 : 1.271 |
- | CPU variant : 0xa | + | FPU fmadd (64bit x1) n8 : |
- | CPU part : 0x800 | + | NEON fmul.2d (64bit x2) n8 : 2.414 |
- | CPU revision : 1 | + | NEON fadd.2d (64bit x2) n8 |
+ | NEON fmla.2d (64bit x2) n8 | ||
+ | FPU fmul (64bit x1) ns4 : | ||
+ | FPU fadd (64bit x1) ns4 : | ||
+ | FPU fmadd (64bit x1) ns4 : 2.116 | ||
+ | NEON fmul.2d (64bit x2) ns4 : | ||
+ | NEON fadd.2d (64bit x2) ns4 : | ||
+ | NEON fmla.2d (64bit x2) ns4 : | ||
+ | FPU fmul (64bit x1) n1 : 1.274 | ||
+ | FPU fadd (64bit x1) n1 : 1.268 | ||
+ | FPU fmadd (64bit x1) n1 : | ||
+ | NEON fmul.2d (64bit x2) n1 : 0.821 14364.5 | ||
+ | NEON fadd.2d (64bit x2) n1 : 0.937 12586.2 | ||
+ | NEON fmla.2d (64bit x2) n1 : 2.334 10110.2 | ||
+ | NEON fmul.2d (64bit x2) n12 : | ||
+ | NEON fadd.2d (64bit x2) n12 : | ||
+ | NEON fmla.2d (64bit x2) n12 : | ||
+ | Average | ||
+ | Highest | ||
- | Hardware : Qualcomm Technologies, | ||
- | Qualcomm Technologies, | + | * Group 1: Thread=4 |
+ | * Matrix 4x4 multi-thread | ||
+ | TIME(s) | ||
+ | C++ code : 0.540 16309.5 | ||
+ | NEON fmla.4s 128bit A : | ||
+ | NEON fmla.4s 128bit B : | ||
+ | Average | ||
+ | Highest | ||
- | 2019/01/19 16: | ||
</ | </ | ||
行 13747: | 行 13851: | ||
++++ | ++++ | ||
+ | ==== Intel Core i5-1030NG7 (AMD64 x86_64 x64) SSE4.2/ | ||
+ | |||
+ | |||
+ | ++++Intel Core i5-1030NG7 1.1GHz (3.5GHz) 4 core 8 thread Windows 10| | ||
+ | |||
+ | < | ||
+ | Date: 20200810 185418 | ||
+ | ARCH: x64 (x86_64) | ||
+ | FPU : SSE SSE2 SSSE3 SSE4.1 SSE4.2 AVX AVX2 FMA3 F16C AVX512F/ | ||
+ | Name: | ||
+ | CPU Thread: | ||
+ | CPU Core : 4 | ||
+ | CPU Group : 1 | ||
+ | Group 0: Thread= 8 Clock=1.100000 GHz (mask:0) | ||
+ | SSE : yes | ||
+ | AVX : yes | ||
+ | FMA : yes | ||
+ | F16C : yes | ||
+ | AVX512: yes | ||
+ | |||
+ | Total: | ||
+ | SingleThread HP max: - | ||
+ | SingleThread SP max: 111.310 GFLOPS | ||
+ | SingleThread DP max: | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | |||
+ | Group 0: Thread=8 | ||
+ | SingleThread HP max: - | ||
+ | SingleThread SP max: 111.310 GFLOPS | ||
+ | SingleThread DP max: | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | |||
+ | |||
+ | * Group 0: Thread=1 | ||
+ | * SSE/AVX (SP fp) | ||
+ | TIME(s) | ||
+ | SSE mulss (32bit x1) n8 : | ||
+ | SSE addss (32bit x1) n8 : | ||
+ | FMA vfmaddss (32bit x1) n8 : 0.101 13027.6 | ||
+ | FMA vfmaddss (32bit x1) n12 : | ||
+ | FMA vfma+mlss (32bit x1) n12 : 0.143 10399.8 | ||
+ | FMA vfma+adss (32bit x1) n12 : 0.142 10437.5 | ||
+ | SSE mulps (32bit x4) n8 : | ||
+ | SSE addps (32bit x4) n8 : | ||
+ | SSE mul+addps (32bit x4) n8 : | ||
+ | FMA vfmaddps (32bit x4) n8 : 0.102 51919.4 | ||
+ | FMA vfmaddps (32bit x4) n12 : | ||
+ | FMA vfma+mlps (32bit x4) n12 : 0.142 41781.8 | ||
+ | FMA vfma+adps (32bit x4) n12 : 0.143 41652.9 | ||
+ | SSE ml+ad+adps (32bit x4) n9 : 0.108 27519.6 | ||
+ | SSE mulss (32bit x1) ns4 : 0.190 | ||
+ | SSE addss (32bit x1) ns4 : 0.190 | ||
+ | SSE mulps (32bit x4) ns4 : 0.190 13906.4 | ||
+ | SSE addps (32bit x4) ns4 : 0.190 13867.9 | ||
+ | AVX vmulps (32bit x8) n8 : 0.095 55597.1 | ||
+ | AVX vaddps (32bit x8) n8 : 0.095 55388.9 | ||
+ | AVX vmul+addps (32bit x8) n8 : 0.095 55612.9 | ||
+ | FMA vfmaddps (32bit x8) n8 : 0.122 86880.7 | ||
+ | FMA vfmaddps (32bit x8) n12 : | ||
+ | FMA vfma+mlps (32bit x8) n12 : 0.142 83413.5 | ||
+ | FMA vfma+adps (32bit x8) n12 : 0.144 82441.6 | ||
+ | AVX vml+ad+adps (32bit x8) n9 : | ||
+ | AVX512 vmulps (32bit x16) n12 : | ||
+ | AVX512 vaddps (32bit x16) n12 : | ||
+ | AVX512 vfmaddps (32bit x16) n12 : | ||
+ | AVX512 vfma+mps (32bit x16) n12 : | ||
+ | AVX512 vfma+aps (32bit x16) n12 : | ||
+ | AVX512 vmulps (32bit x8) n12 : 0.144 55154.4 | ||
+ | AVX512 vaddps (32bit x8) n12 : 0.142 55624.6 | ||
+ | AVX512 vfmaddps (32bit x8) n12 : 0.142 | ||
+ | Average | ||
+ | Highest | ||
+ | |||
+ | |||
+ | * Group 0: Thread=1 | ||
+ | * SSE/AVX (DP fp) | ||
+ | TIME(s) | ||
+ | SSE2 mulsd (64bit x1) n8 : 0.143 | ||
+ | SSE2 addsd (64bit x1) n8 : 0.102 | ||
+ | FMA vfmaddsd (64bit x1) n8 : | ||
+ | FMA vfmaddsd (64bit x1) n12 : 0.142 13910.1 | ||
+ | FMA vfma+mlsd (64bit x1) n12 : | ||
+ | FMA vfma+adsd (64bit x1) n12 : | ||
+ | SSE2 mulpd (64bit x2) n8 : 0.102 12983.3 | ||
+ | SSE2 addpd (64bit x2) n8 : 0.102 12988.4 | ||
+ | SSE2 mul+addpd (64bit x2) n8 : 0.101 13026.5 | ||
+ | FMA vfmaddpd (64bit x2) n8 : | ||
+ | FMA vfmaddpd (64bit x2) n12 : 0.143 27767.1 | ||
+ | FMA vfma+mlpd (64bit x2) n12 : | ||
+ | FMA vfma+adpd (64bit x2) n12 : | ||
+ | SSE2 ml+ad+dpd (64bit x2) n9 : 0.108 13686.9 | ||
+ | SSE2 mulsd (64bit x1) ns4 : | ||
+ | SSE2 addsd (64bit x1) ns4 : | ||
+ | SSE2 mulpd (64bit x2) ns4 : | ||
+ | SSE2 addpd (64bit x2) ns4 : | ||
+ | AVX vmulpd (64bit x4) n8 : 0.096 27464.0 | ||
+ | AVX vaddpd (64bit x4) n8 : 0.095 27868.4 | ||
+ | AVX vmul+addpd (64bit x4) n8 : 0.095 27776.9 | ||
+ | FMA vfmaddpd (64bit x4) n8 : 0.101 52105.9 | ||
+ | FMA vfmaddpd (64bit x4) n12 : | ||
+ | FMA vfma+mlpd (64bit x4) n12 : 0.143 41631.3 | ||
+ | FMA vfma+adpd (64bit x4) n12 : 0.142 41748.7 | ||
+ | AVX vml_ad_adpd (64bit x4) n9 : | ||
+ | AVX512 vmulpd (64bit x8) n12 : 0.294 26935.4 | ||
+ | AVX512 vaddpd (64bit x8) n12 : 0.294 26918.9 | ||
+ | AVX512 vfmaddpd (64bit x8) n12 : 0.294 53835.4 | ||
+ | AVX512 vfma+mpd (64bit x8) n12 : 0.293 40495.9 | ||
+ | AVX512 vfma+apd (64bit x8) n12 : 0.293 40512.9 | ||
+ | Average | ||
+ | Highest | ||
+ | |||
+ | |||
+ | * Group 0: Thread=8 | ||
+ | * SSE/AVX (SP fp) multi-thread | ||
+ | TIME(s) | ||
+ | SSE mulss (32bit x1) n8 : | ||
+ | SSE addss (32bit x1) n8 : | ||
+ | FMA vfmaddss (32bit x1) n8 : 0.207 51050.5 | ||
+ | FMA vfmaddss (32bit x1) n12 : | ||
+ | FMA vfma+mlss (32bit x1) n12 : 0.310 38279.6 | ||
+ | FMA vfma+adss (32bit x1) n12 : 0.310 38294.5 | ||
+ | SSE mulps (32bit x4) n8 : | ||
+ | SSE addps (32bit x4) n8 : | ||
+ | SSE mul+addps (32bit x4) n8 : | ||
+ | FMA vfmaddps (32bit x4) n8 : 0.207 | ||
+ | FMA vfmaddps (32bit x4) n12 : | ||
+ | FMA vfma+mlps (32bit x4) n12 : 0.310 | ||
+ | FMA vfma+adps (32bit x4) n12 : 0.310 | ||
+ | SSE ml+ad+adps (32bit x4) n9 : 0.233 | ||
+ | SSE mulss (32bit x1) ns4 : 0.231 22819.0 | ||
+ | SSE addss (32bit x1) ns4 : 0.232 22796.0 | ||
+ | SSE mulps (32bit x4) ns4 : 0.232 90991.3 | ||
+ | SSE addps (32bit x4) ns4 : 0.232 91226.8 | ||
+ | AVX vmulps (32bit x8) n8 : 0.207 | ||
+ | AVX vaddps (32bit x8) n8 : 0.207 | ||
+ | AVX vmul+addps (32bit x8) n8 : 0.207 | ||
+ | FMA vfmaddps (32bit x8) n8 : 0.207 | ||
+ | FMA vfmaddps (32bit x8) n12 : | ||
+ | FMA vfma+mlps (32bit x8) n12 : 0.311 | ||
+ | FMA vfma+adps (32bit x8) n12 : 0.310 | ||
+ | AVX vml+ad+adps (32bit x8) n9 : | ||
+ | AVX512 vmulps (32bit x16) n12 : | ||
+ | AVX512 vaddps (32bit x16) n12 : | ||
+ | AVX512 vfmaddps (32bit x16) n12 : | ||
+ | AVX512 vfma+mps (32bit x16) n12 : | ||
+ | AVX512 vfma+aps (32bit x16) n12 : | ||
+ | AVX512 vmulps (32bit x8) n12 : 0.316 | ||
+ | AVX512 vaddps (32bit x8) n12 : 0.310 | ||
+ | AVX512 vfmaddps (32bit x8) n12 : 0.306 | ||
+ | Average | ||
+ | Highest | ||
+ | |||
+ | |||
+ | * Group 0: Thread=8 | ||
+ | * SSE/AVX (DP fp) multi-thread | ||
+ | TIME(s) | ||
+ | SSE2 mulsd (64bit x1) n8 : 0.244 21634.6 | ||
+ | SSE2 addsd (64bit x1) n8 : 0.207 25508.6 | ||
+ | FMA vfmaddsd (64bit x1) n8 : | ||
+ | FMA vfmaddsd (64bit x1) n12 : 0.311 50924.6 | ||
+ | FMA vfma+mlsd (64bit x1) n12 : | ||
+ | FMA vfma+adsd (64bit x1) n12 : | ||
+ | SSE2 mulpd (64bit x2) n8 : 0.207 51029.3 | ||
+ | SSE2 addpd (64bit x2) n8 : 0.207 51025.8 | ||
+ | SSE2 mul+addpd (64bit x2) n8 : 0.207 51019.7 | ||
+ | FMA vfmaddpd (64bit x2) n8 : | ||
+ | FMA vfmaddpd (64bit x2) n12 : 0.311 | ||
+ | FMA vfma+mlpd (64bit x2) n12 : | ||
+ | FMA vfma+adpd (64bit x2) n12 : | ||
+ | SSE2 ml+ad+dpd (64bit x2) n9 : 0.233 51085.6 | ||
+ | SSE2 mulsd (64bit x1) ns4 : | ||
+ | SSE2 addsd (64bit x1) ns4 : | ||
+ | SSE2 mulpd (64bit x2) ns4 : | ||
+ | SSE2 addpd (64bit x2) ns4 : | ||
+ | AVX vmulpd (64bit x4) n8 : 0.207 | ||
+ | AVX vaddpd (64bit x4) n8 : 0.207 | ||
+ | AVX vmul+addpd (64bit x4) n8 : 0.207 | ||
+ | FMA vfmaddpd (64bit x4) n8 : 0.207 | ||
+ | FMA vfmaddpd (64bit x4) n12 : | ||
+ | FMA vfma+mlpd (64bit x4) n12 : 0.314 | ||
+ | FMA vfma+adpd (64bit x4) n12 : 0.318 | ||
+ | AVX vml_ad_adpd (64bit x4) n9 : | ||
+ | AVX512 vmulpd (64bit x8) n12 : 0.682 92879.9 | ||
+ | AVX512 vaddpd (64bit x8) n12 : 0.682 92855.7 | ||
+ | AVX512 vfmaddpd (64bit x8) n12 : 0.682 | ||
+ | AVX512 vfma+mpd (64bit x8) n12 : 0.682 | ||
+ | AVX512 vfma+apd (64bit x8) n12 : 0.682 | ||
+ | Average | ||
+ | Highest | ||
+ | |||
+ | </ | ||
opengl/vfpbenchlog.txt · 最終更新: 2020/12/30 23:46 by oga