opengl:vfpbenchlog
差分
このページの2つのバージョン間の差分を表示します。
両方とも前のリビジョン前のリビジョン次のリビジョン | 前のリビジョン次のリビジョン両方とも次のリビジョン | ||
opengl:vfpbenchlog [2020/01/05 01:37] – [結果一覧] oga | opengl:vfpbenchlog [2020/08/13 22:49] – [AMD Ryzen 7 1800X (AMD64 x86_64 x64) SSE4.2/AVX2/FMA3] oga | ||
---|---|---|---|
行 19: | 行 19: | ||
| Apple MacBook Pro Late 2011 | OSX.10 | | Apple MacBook Pro Late 2011 | OSX.10 | ||
| Pixel 3 | A9.0 | Snapdragon 845 | Kryo 385(A75/55) | ARMv8A | AArch64 | | Pixel 3 | A9.0 | Snapdragon 845 | Kryo 385(A75/55) | ARMv8A | AArch64 | ||
+ | | Essential Phone PH-1 | A9.0 | Snapdragon 835 | Kryo (A73/53) | ARMv8A | AArch64 | ||
+ | | Amazon Fire HD 10 2019 | A9.0 | Mediatek MT8183 | ||
| PC AMD A10-7870K | | PC AMD A10-7870K | ||
| Apple MacBook Pro Late 2013 | OSX.10 | | Apple MacBook Pro Late 2013 | OSX.10 | ||
行 13849: | 行 13851: | ||
++++ | ++++ | ||
+ | ==== Intel Core i5-1030NG7 (AMD64 x86_64 x64) SSE4.2/ | ||
+ | |||
+ | |||
+ | ++++Intel Core i5-1030NG7 1.1GHz (3.5GHz) 4 core 8 thread Windows 10| | ||
+ | |||
+ | < | ||
+ | Date: 20200810 185418 | ||
+ | ARCH: x64 (x86_64) | ||
+ | FPU : SSE SSE2 SSSE3 SSE4.1 SSE4.2 AVX AVX2 FMA3 F16C AVX512F/ | ||
+ | Name: | ||
+ | CPU Thread: | ||
+ | CPU Core : 4 | ||
+ | CPU Group : 1 | ||
+ | Group 0: Thread= 8 Clock=1.100000 GHz (mask:0) | ||
+ | SSE : yes | ||
+ | AVX : yes | ||
+ | FMA : yes | ||
+ | F16C : yes | ||
+ | AVX512: yes | ||
+ | |||
+ | Total: | ||
+ | SingleThread HP max: - | ||
+ | SingleThread SP max: 111.310 GFLOPS | ||
+ | SingleThread DP max: | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | |||
+ | Group 0: Thread=8 | ||
+ | SingleThread HP max: - | ||
+ | SingleThread SP max: 111.310 GFLOPS | ||
+ | SingleThread DP max: | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | |||
+ | |||
+ | * Group 0: Thread=1 | ||
+ | * SSE/AVX (SP fp) | ||
+ | TIME(s) | ||
+ | SSE mulss (32bit x1) n8 : | ||
+ | SSE addss (32bit x1) n8 : | ||
+ | FMA vfmaddss (32bit x1) n8 : 0.101 13027.6 | ||
+ | FMA vfmaddss (32bit x1) n12 : | ||
+ | FMA vfma+mlss (32bit x1) n12 : 0.143 10399.8 | ||
+ | FMA vfma+adss (32bit x1) n12 : 0.142 10437.5 | ||
+ | SSE mulps (32bit x4) n8 : | ||
+ | SSE addps (32bit x4) n8 : | ||
+ | SSE mul+addps (32bit x4) n8 : | ||
+ | FMA vfmaddps (32bit x4) n8 : 0.102 51919.4 | ||
+ | FMA vfmaddps (32bit x4) n12 : | ||
+ | FMA vfma+mlps (32bit x4) n12 : 0.142 41781.8 | ||
+ | FMA vfma+adps (32bit x4) n12 : 0.143 41652.9 | ||
+ | SSE ml+ad+adps (32bit x4) n9 : 0.108 27519.6 | ||
+ | SSE mulss (32bit x1) ns4 : 0.190 | ||
+ | SSE addss (32bit x1) ns4 : 0.190 | ||
+ | SSE mulps (32bit x4) ns4 : 0.190 13906.4 | ||
+ | SSE addps (32bit x4) ns4 : 0.190 13867.9 | ||
+ | AVX vmulps (32bit x8) n8 : 0.095 55597.1 | ||
+ | AVX vaddps (32bit x8) n8 : 0.095 55388.9 | ||
+ | AVX vmul+addps (32bit x8) n8 : 0.095 55612.9 | ||
+ | FMA vfmaddps (32bit x8) n8 : 0.122 86880.7 | ||
+ | FMA vfmaddps (32bit x8) n12 : | ||
+ | FMA vfma+mlps (32bit x8) n12 : 0.142 83413.5 | ||
+ | FMA vfma+adps (32bit x8) n12 : 0.144 82441.6 | ||
+ | AVX vml+ad+adps (32bit x8) n9 : | ||
+ | AVX512 vmulps (32bit x16) n12 : | ||
+ | AVX512 vaddps (32bit x16) n12 : | ||
+ | AVX512 vfmaddps (32bit x16) n12 : | ||
+ | AVX512 vfma+mps (32bit x16) n12 : | ||
+ | AVX512 vfma+aps (32bit x16) n12 : | ||
+ | AVX512 vmulps (32bit x8) n12 : 0.144 55154.4 | ||
+ | AVX512 vaddps (32bit x8) n12 : 0.142 55624.6 | ||
+ | AVX512 vfmaddps (32bit x8) n12 : 0.142 | ||
+ | Average | ||
+ | Highest | ||
+ | |||
+ | |||
+ | * Group 0: Thread=1 | ||
+ | * SSE/AVX (DP fp) | ||
+ | TIME(s) | ||
+ | SSE2 mulsd (64bit x1) n8 : 0.143 | ||
+ | SSE2 addsd (64bit x1) n8 : 0.102 | ||
+ | FMA vfmaddsd (64bit x1) n8 : | ||
+ | FMA vfmaddsd (64bit x1) n12 : 0.142 13910.1 | ||
+ | FMA vfma+mlsd (64bit x1) n12 : | ||
+ | FMA vfma+adsd (64bit x1) n12 : | ||
+ | SSE2 mulpd (64bit x2) n8 : 0.102 12983.3 | ||
+ | SSE2 addpd (64bit x2) n8 : 0.102 12988.4 | ||
+ | SSE2 mul+addpd (64bit x2) n8 : 0.101 13026.5 | ||
+ | FMA vfmaddpd (64bit x2) n8 : | ||
+ | FMA vfmaddpd (64bit x2) n12 : 0.143 27767.1 | ||
+ | FMA vfma+mlpd (64bit x2) n12 : | ||
+ | FMA vfma+adpd (64bit x2) n12 : | ||
+ | SSE2 ml+ad+dpd (64bit x2) n9 : 0.108 13686.9 | ||
+ | SSE2 mulsd (64bit x1) ns4 : | ||
+ | SSE2 addsd (64bit x1) ns4 : | ||
+ | SSE2 mulpd (64bit x2) ns4 : | ||
+ | SSE2 addpd (64bit x2) ns4 : | ||
+ | AVX vmulpd (64bit x4) n8 : 0.096 27464.0 | ||
+ | AVX vaddpd (64bit x4) n8 : 0.095 27868.4 | ||
+ | AVX vmul+addpd (64bit x4) n8 : 0.095 27776.9 | ||
+ | FMA vfmaddpd (64bit x4) n8 : 0.101 52105.9 | ||
+ | FMA vfmaddpd (64bit x4) n12 : | ||
+ | FMA vfma+mlpd (64bit x4) n12 : 0.143 41631.3 | ||
+ | FMA vfma+adpd (64bit x4) n12 : 0.142 41748.7 | ||
+ | AVX vml_ad_adpd (64bit x4) n9 : | ||
+ | AVX512 vmulpd (64bit x8) n12 : 0.294 26935.4 | ||
+ | AVX512 vaddpd (64bit x8) n12 : 0.294 26918.9 | ||
+ | AVX512 vfmaddpd (64bit x8) n12 : 0.294 53835.4 | ||
+ | AVX512 vfma+mpd (64bit x8) n12 : 0.293 40495.9 | ||
+ | AVX512 vfma+apd (64bit x8) n12 : 0.293 40512.9 | ||
+ | Average | ||
+ | Highest | ||
+ | |||
+ | |||
+ | * Group 0: Thread=8 | ||
+ | * SSE/AVX (SP fp) multi-thread | ||
+ | TIME(s) | ||
+ | SSE mulss (32bit x1) n8 : | ||
+ | SSE addss (32bit x1) n8 : | ||
+ | FMA vfmaddss (32bit x1) n8 : 0.207 51050.5 | ||
+ | FMA vfmaddss (32bit x1) n12 : | ||
+ | FMA vfma+mlss (32bit x1) n12 : 0.310 38279.6 | ||
+ | FMA vfma+adss (32bit x1) n12 : 0.310 38294.5 | ||
+ | SSE mulps (32bit x4) n8 : | ||
+ | SSE addps (32bit x4) n8 : | ||
+ | SSE mul+addps (32bit x4) n8 : | ||
+ | FMA vfmaddps (32bit x4) n8 : 0.207 | ||
+ | FMA vfmaddps (32bit x4) n12 : | ||
+ | FMA vfma+mlps (32bit x4) n12 : 0.310 | ||
+ | FMA vfma+adps (32bit x4) n12 : 0.310 | ||
+ | SSE ml+ad+adps (32bit x4) n9 : 0.233 | ||
+ | SSE mulss (32bit x1) ns4 : 0.231 22819.0 | ||
+ | SSE addss (32bit x1) ns4 : 0.232 22796.0 | ||
+ | SSE mulps (32bit x4) ns4 : 0.232 90991.3 | ||
+ | SSE addps (32bit x4) ns4 : 0.232 91226.8 | ||
+ | AVX vmulps (32bit x8) n8 : 0.207 | ||
+ | AVX vaddps (32bit x8) n8 : 0.207 | ||
+ | AVX vmul+addps (32bit x8) n8 : 0.207 | ||
+ | FMA vfmaddps (32bit x8) n8 : 0.207 | ||
+ | FMA vfmaddps (32bit x8) n12 : | ||
+ | FMA vfma+mlps (32bit x8) n12 : 0.311 | ||
+ | FMA vfma+adps (32bit x8) n12 : 0.310 | ||
+ | AVX vml+ad+adps (32bit x8) n9 : | ||
+ | AVX512 vmulps (32bit x16) n12 : | ||
+ | AVX512 vaddps (32bit x16) n12 : | ||
+ | AVX512 vfmaddps (32bit x16) n12 : | ||
+ | AVX512 vfma+mps (32bit x16) n12 : | ||
+ | AVX512 vfma+aps (32bit x16) n12 : | ||
+ | AVX512 vmulps (32bit x8) n12 : 0.316 | ||
+ | AVX512 vaddps (32bit x8) n12 : 0.310 | ||
+ | AVX512 vfmaddps (32bit x8) n12 : 0.306 | ||
+ | Average | ||
+ | Highest | ||
+ | |||
+ | |||
+ | * Group 0: Thread=8 | ||
+ | * SSE/AVX (DP fp) multi-thread | ||
+ | TIME(s) | ||
+ | SSE2 mulsd (64bit x1) n8 : 0.244 21634.6 | ||
+ | SSE2 addsd (64bit x1) n8 : 0.207 25508.6 | ||
+ | FMA vfmaddsd (64bit x1) n8 : | ||
+ | FMA vfmaddsd (64bit x1) n12 : 0.311 50924.6 | ||
+ | FMA vfma+mlsd (64bit x1) n12 : | ||
+ | FMA vfma+adsd (64bit x1) n12 : | ||
+ | SSE2 mulpd (64bit x2) n8 : 0.207 51029.3 | ||
+ | SSE2 addpd (64bit x2) n8 : 0.207 51025.8 | ||
+ | SSE2 mul+addpd (64bit x2) n8 : 0.207 51019.7 | ||
+ | FMA vfmaddpd (64bit x2) n8 : | ||
+ | FMA vfmaddpd (64bit x2) n12 : 0.311 | ||
+ | FMA vfma+mlpd (64bit x2) n12 : | ||
+ | FMA vfma+adpd (64bit x2) n12 : | ||
+ | SSE2 ml+ad+dpd (64bit x2) n9 : 0.233 51085.6 | ||
+ | SSE2 mulsd (64bit x1) ns4 : | ||
+ | SSE2 addsd (64bit x1) ns4 : | ||
+ | SSE2 mulpd (64bit x2) ns4 : | ||
+ | SSE2 addpd (64bit x2) ns4 : | ||
+ | AVX vmulpd (64bit x4) n8 : 0.207 | ||
+ | AVX vaddpd (64bit x4) n8 : 0.207 | ||
+ | AVX vmul+addpd (64bit x4) n8 : 0.207 | ||
+ | FMA vfmaddpd (64bit x4) n8 : 0.207 | ||
+ | FMA vfmaddpd (64bit x4) n12 : | ||
+ | FMA vfma+mlpd (64bit x4) n12 : 0.314 | ||
+ | FMA vfma+adpd (64bit x4) n12 : 0.318 | ||
+ | AVX vml_ad_adpd (64bit x4) n9 : | ||
+ | AVX512 vmulpd (64bit x8) n12 : 0.682 92879.9 | ||
+ | AVX512 vaddpd (64bit x8) n12 : 0.682 92855.7 | ||
+ | AVX512 vfmaddpd (64bit x8) n12 : 0.682 | ||
+ | AVX512 vfma+mpd (64bit x8) n12 : 0.682 | ||
+ | AVX512 vfma+apd (64bit x8) n12 : 0.682 | ||
+ | Average | ||
+ | Highest | ||
+ | |||
+ | </ | ||
opengl/vfpbenchlog.txt · 最終更新: 2020/12/30 23:46 by oga