opengl:vfpbenchlog
差分
このページの2つのバージョン間の差分を表示します。
両方とも前のリビジョン前のリビジョン次のリビジョン | 前のリビジョン次のリビジョン両方とも次のリビジョン | ||
opengl:vfpbenchlog [2019/06/16 01:07] – [Qualcomm Kryo 280 (Cortex-A73) (ARMv8A AArch64 arm64) FPU+ASIMD] oga | opengl:vfpbenchlog [2020/08/13 22:50] – [Intel Core i5-1030NG7 (AMD64 x86_64 x64) SSE4.2/AVX2/FMA3/AVX512F,CD,VL,BW,DQ,VNNI] oga | ||
---|---|---|---|
行 12: | 行 12: | ||
^ Device | ^ Device | ||
- | | PC Intel Core i7-6700K | + | | PC AMD Ryzen 9 3950X |
- | | PC AMD Ryzen 7 1800X | + | | PC Intel Core i7-6700K |
- | | PC Intel Core i7-4790K | + | | PC Intel Core i7-4790K |
+ | | PC AMD Ryzen 7 1800X | Win10 | AMD Ryzen 7 1800X | Zen | x64 | SSE4.2/ | ||
| Apple Mac mini Late 2012 | OSX.10 | | Apple Mac mini Late 2012 | OSX.10 | ||
| Apple MacBook Pro Late 2011 | OSX.10 | | Apple MacBook Pro Late 2011 | OSX.10 | ||
| Pixel 3 | A9.0 | Snapdragon 845 | Kryo 385(A75/55) | ARMv8A | AArch64 | | Pixel 3 | A9.0 | Snapdragon 845 | Kryo 385(A75/55) | ARMv8A | AArch64 | ||
+ | | Essential Phone PH-1 | A9.0 | Snapdragon 835 | Kryo (A73/53) | ARMv8A | AArch64 | ||
+ | | Amazon Fire HD 10 2019 | A9.0 | Mediatek MT8183 | ||
+ | | PC AMD A10-7870K | ||
| Apple MacBook Pro Late 2013 | OSX.10 | | Apple MacBook Pro Late 2013 | OSX.10 | ||
| iPhone SE | iOS9.3 | | iPhone SE | iOS9.3 | ||
行 29: | 行 33: | ||
| NVIDIA Tegra Note 7 | A4.4 | NVIDIA Tegra 4 | Cortex-A15 | | NVIDIA Tegra Note 7 | A4.4 | NVIDIA Tegra 4 | Cortex-A15 | ||
| PC Intel N3150 Braswell | | PC Intel N3150 Braswell | ||
+ | | Raspberry Pi 4 | Ubuntu | ||
| ASUS Nexus 7 2013 | A4.4 | Qualcomm S4 APQ8064 | | ASUS Nexus 7 2013 | A4.4 | Qualcomm S4 APQ8064 | ||
| HTC J butterfly HTL21 | A4.1 | Qualcomm S4 APQ8064 | | HTC J butterfly HTL21 | A4.1 | Qualcomm S4 APQ8064 | ||
+ | | NVIDIA Jetson nano | Ubuntu | ||
| Apple TV (2015) | | Apple TV (2015) | ||
| Apple iPhone 5s | i8.0 | Apple A7 | Cyclone | | Apple iPhone 5s | i8.0 | Apple A7 | Cyclone | ||
行 13845: | 行 13851: | ||
++++ | ++++ | ||
+ | ==== Intel Core i5-1030NG7 (AMD64 x86_64 x64) SSE4.2/ | ||
+ | ++++Intel Core i5-1030NG7 1.1GHz (3.5GHz) 4 core 8 thread Windows 10| | ||
+ | |||
+ | < | ||
+ | Date: 20200810 185418 | ||
+ | ARCH: x64 (x86_64) | ||
+ | FPU : SSE SSE2 SSSE3 SSE4.1 SSE4.2 AVX AVX2 FMA3 F16C AVX512F/ | ||
+ | Name: | ||
+ | CPU Thread: | ||
+ | CPU Core : 4 | ||
+ | CPU Group : 1 | ||
+ | Group 0: Thread= 8 Clock=1.100000 GHz (mask:0) | ||
+ | SSE : yes | ||
+ | AVX : yes | ||
+ | FMA : yes | ||
+ | F16C : yes | ||
+ | AVX512: yes | ||
+ | |||
+ | Total: | ||
+ | SingleThread HP max: - | ||
+ | SingleThread SP max: 111.310 GFLOPS | ||
+ | SingleThread DP max: | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | |||
+ | Group 0: Thread=8 | ||
+ | SingleThread HP max: - | ||
+ | SingleThread SP max: 111.310 GFLOPS | ||
+ | SingleThread DP max: | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | |||
+ | |||
+ | * Group 0: Thread=1 | ||
+ | * SSE/AVX (SP fp) | ||
+ | TIME(s) | ||
+ | SSE mulss (32bit x1) n8 : | ||
+ | SSE addss (32bit x1) n8 : | ||
+ | FMA vfmaddss (32bit x1) n8 : 0.101 13027.6 | ||
+ | FMA vfmaddss (32bit x1) n12 : | ||
+ | FMA vfma+mlss (32bit x1) n12 : 0.143 10399.8 | ||
+ | FMA vfma+adss (32bit x1) n12 : 0.142 10437.5 | ||
+ | SSE mulps (32bit x4) n8 : | ||
+ | SSE addps (32bit x4) n8 : | ||
+ | SSE mul+addps (32bit x4) n8 : | ||
+ | FMA vfmaddps (32bit x4) n8 : 0.102 51919.4 | ||
+ | FMA vfmaddps (32bit x4) n12 : | ||
+ | FMA vfma+mlps (32bit x4) n12 : 0.142 41781.8 | ||
+ | FMA vfma+adps (32bit x4) n12 : 0.143 41652.9 | ||
+ | SSE ml+ad+adps (32bit x4) n9 : 0.108 27519.6 | ||
+ | SSE mulss (32bit x1) ns4 : 0.190 | ||
+ | SSE addss (32bit x1) ns4 : 0.190 | ||
+ | SSE mulps (32bit x4) ns4 : 0.190 13906.4 | ||
+ | SSE addps (32bit x4) ns4 : 0.190 13867.9 | ||
+ | AVX vmulps (32bit x8) n8 : 0.095 55597.1 | ||
+ | AVX vaddps (32bit x8) n8 : 0.095 55388.9 | ||
+ | AVX vmul+addps (32bit x8) n8 : 0.095 55612.9 | ||
+ | FMA vfmaddps (32bit x8) n8 : 0.122 86880.7 | ||
+ | FMA vfmaddps (32bit x8) n12 : | ||
+ | FMA vfma+mlps (32bit x8) n12 : 0.142 83413.5 | ||
+ | FMA vfma+adps (32bit x8) n12 : 0.144 82441.6 | ||
+ | AVX vml+ad+adps (32bit x8) n9 : | ||
+ | AVX512 vmulps (32bit x16) n12 : | ||
+ | AVX512 vaddps (32bit x16) n12 : | ||
+ | AVX512 vfmaddps (32bit x16) n12 : | ||
+ | AVX512 vfma+mps (32bit x16) n12 : | ||
+ | AVX512 vfma+aps (32bit x16) n12 : | ||
+ | AVX512 vmulps (32bit x8) n12 : 0.144 55154.4 | ||
+ | AVX512 vaddps (32bit x8) n12 : 0.142 55624.6 | ||
+ | AVX512 vfmaddps (32bit x8) n12 : 0.142 | ||
+ | Average | ||
+ | Highest | ||
+ | |||
+ | |||
+ | * Group 0: Thread=1 | ||
+ | * SSE/AVX (DP fp) | ||
+ | TIME(s) | ||
+ | SSE2 mulsd (64bit x1) n8 : 0.143 | ||
+ | SSE2 addsd (64bit x1) n8 : 0.102 | ||
+ | FMA vfmaddsd (64bit x1) n8 : | ||
+ | FMA vfmaddsd (64bit x1) n12 : 0.142 13910.1 | ||
+ | FMA vfma+mlsd (64bit x1) n12 : | ||
+ | FMA vfma+adsd (64bit x1) n12 : | ||
+ | SSE2 mulpd (64bit x2) n8 : 0.102 12983.3 | ||
+ | SSE2 addpd (64bit x2) n8 : 0.102 12988.4 | ||
+ | SSE2 mul+addpd (64bit x2) n8 : 0.101 13026.5 | ||
+ | FMA vfmaddpd (64bit x2) n8 : | ||
+ | FMA vfmaddpd (64bit x2) n12 : 0.143 27767.1 | ||
+ | FMA vfma+mlpd (64bit x2) n12 : | ||
+ | FMA vfma+adpd (64bit x2) n12 : | ||
+ | SSE2 ml+ad+dpd (64bit x2) n9 : 0.108 13686.9 | ||
+ | SSE2 mulsd (64bit x1) ns4 : | ||
+ | SSE2 addsd (64bit x1) ns4 : | ||
+ | SSE2 mulpd (64bit x2) ns4 : | ||
+ | SSE2 addpd (64bit x2) ns4 : | ||
+ | AVX vmulpd (64bit x4) n8 : 0.096 27464.0 | ||
+ | AVX vaddpd (64bit x4) n8 : 0.095 27868.4 | ||
+ | AVX vmul+addpd (64bit x4) n8 : 0.095 27776.9 | ||
+ | FMA vfmaddpd (64bit x4) n8 : 0.101 52105.9 | ||
+ | FMA vfmaddpd (64bit x4) n12 : | ||
+ | FMA vfma+mlpd (64bit x4) n12 : 0.143 41631.3 | ||
+ | FMA vfma+adpd (64bit x4) n12 : 0.142 41748.7 | ||
+ | AVX vml_ad_adpd (64bit x4) n9 : | ||
+ | AVX512 vmulpd (64bit x8) n12 : 0.294 26935.4 | ||
+ | AVX512 vaddpd (64bit x8) n12 : 0.294 26918.9 | ||
+ | AVX512 vfmaddpd (64bit x8) n12 : 0.294 53835.4 | ||
+ | AVX512 vfma+mpd (64bit x8) n12 : 0.293 40495.9 | ||
+ | AVX512 vfma+apd (64bit x8) n12 : 0.293 40512.9 | ||
+ | Average | ||
+ | Highest | ||
+ | |||
+ | |||
+ | * Group 0: Thread=8 | ||
+ | * SSE/AVX (SP fp) multi-thread | ||
+ | TIME(s) | ||
+ | SSE mulss (32bit x1) n8 : | ||
+ | SSE addss (32bit x1) n8 : | ||
+ | FMA vfmaddss (32bit x1) n8 : 0.207 51050.5 | ||
+ | FMA vfmaddss (32bit x1) n12 : | ||
+ | FMA vfma+mlss (32bit x1) n12 : 0.310 38279.6 | ||
+ | FMA vfma+adss (32bit x1) n12 : 0.310 38294.5 | ||
+ | SSE mulps (32bit x4) n8 : | ||
+ | SSE addps (32bit x4) n8 : | ||
+ | SSE mul+addps (32bit x4) n8 : | ||
+ | FMA vfmaddps (32bit x4) n8 : 0.207 | ||
+ | FMA vfmaddps (32bit x4) n12 : | ||
+ | FMA vfma+mlps (32bit x4) n12 : 0.310 | ||
+ | FMA vfma+adps (32bit x4) n12 : 0.310 | ||
+ | SSE ml+ad+adps (32bit x4) n9 : 0.233 | ||
+ | SSE mulss (32bit x1) ns4 : 0.231 22819.0 | ||
+ | SSE addss (32bit x1) ns4 : 0.232 22796.0 | ||
+ | SSE mulps (32bit x4) ns4 : 0.232 90991.3 | ||
+ | SSE addps (32bit x4) ns4 : 0.232 91226.8 | ||
+ | AVX vmulps (32bit x8) n8 : 0.207 | ||
+ | AVX vaddps (32bit x8) n8 : 0.207 | ||
+ | AVX vmul+addps (32bit x8) n8 : 0.207 | ||
+ | FMA vfmaddps (32bit x8) n8 : 0.207 | ||
+ | FMA vfmaddps (32bit x8) n12 : | ||
+ | FMA vfma+mlps (32bit x8) n12 : 0.311 | ||
+ | FMA vfma+adps (32bit x8) n12 : 0.310 | ||
+ | AVX vml+ad+adps (32bit x8) n9 : | ||
+ | AVX512 vmulps (32bit x16) n12 : | ||
+ | AVX512 vaddps (32bit x16) n12 : | ||
+ | AVX512 vfmaddps (32bit x16) n12 : | ||
+ | AVX512 vfma+mps (32bit x16) n12 : | ||
+ | AVX512 vfma+aps (32bit x16) n12 : | ||
+ | AVX512 vmulps (32bit x8) n12 : 0.316 | ||
+ | AVX512 vaddps (32bit x8) n12 : 0.310 | ||
+ | AVX512 vfmaddps (32bit x8) n12 : 0.306 | ||
+ | Average | ||
+ | Highest | ||
+ | |||
+ | |||
+ | * Group 0: Thread=8 | ||
+ | * SSE/AVX (DP fp) multi-thread | ||
+ | TIME(s) | ||
+ | SSE2 mulsd (64bit x1) n8 : 0.244 21634.6 | ||
+ | SSE2 addsd (64bit x1) n8 : 0.207 25508.6 | ||
+ | FMA vfmaddsd (64bit x1) n8 : | ||
+ | FMA vfmaddsd (64bit x1) n12 : 0.311 50924.6 | ||
+ | FMA vfma+mlsd (64bit x1) n12 : | ||
+ | FMA vfma+adsd (64bit x1) n12 : | ||
+ | SSE2 mulpd (64bit x2) n8 : 0.207 51029.3 | ||
+ | SSE2 addpd (64bit x2) n8 : 0.207 51025.8 | ||
+ | SSE2 mul+addpd (64bit x2) n8 : 0.207 51019.7 | ||
+ | FMA vfmaddpd (64bit x2) n8 : | ||
+ | FMA vfmaddpd (64bit x2) n12 : 0.311 | ||
+ | FMA vfma+mlpd (64bit x2) n12 : | ||
+ | FMA vfma+adpd (64bit x2) n12 : | ||
+ | SSE2 ml+ad+dpd (64bit x2) n9 : 0.233 51085.6 | ||
+ | SSE2 mulsd (64bit x1) ns4 : | ||
+ | SSE2 addsd (64bit x1) ns4 : | ||
+ | SSE2 mulpd (64bit x2) ns4 : | ||
+ | SSE2 addpd (64bit x2) ns4 : | ||
+ | AVX vmulpd (64bit x4) n8 : 0.207 | ||
+ | AVX vaddpd (64bit x4) n8 : 0.207 | ||
+ | AVX vmul+addpd (64bit x4) n8 : 0.207 | ||
+ | FMA vfmaddpd (64bit x4) n8 : 0.207 | ||
+ | FMA vfmaddpd (64bit x4) n12 : | ||
+ | FMA vfma+mlpd (64bit x4) n12 : 0.314 | ||
+ | FMA vfma+adpd (64bit x4) n12 : 0.318 | ||
+ | AVX vml_ad_adpd (64bit x4) n9 : | ||
+ | AVX512 vmulpd (64bit x8) n12 : 0.682 92879.9 | ||
+ | AVX512 vaddpd (64bit x8) n12 : 0.682 92855.7 | ||
+ | AVX512 vfmaddpd (64bit x8) n12 : 0.682 | ||
+ | AVX512 vfma+mpd (64bit x8) n12 : 0.682 | ||
+ | AVX512 vfma+apd (64bit x8) n12 : 0.682 | ||
+ | Average | ||
+ | Highest | ||
+ | |||
+ | </ | ||
+ | |||
+ | ++++ | ||
opengl/vfpbenchlog.txt · 最終更新: 2020/12/30 23:46 by oga