opengl:vfpbenchlog
差分
このページの2つのバージョン間の差分を表示します。
両方とも前のリビジョン前のリビジョン次のリビジョン | 前のリビジョン最新のリビジョン両方とも次のリビジョン | ||
opengl:vfpbenchlog [2020/08/13 23:00] – [結果一覧] oga | opengl:vfpbenchlog [2020/12/30 14:44] – [旧リスト] oga | ||
---|---|---|---|
行 7: | 行 7: | ||
~~NOTOC~~ | ~~NOTOC~~ | ||
+ | |||
===== 結果一覧 ===== | ===== 結果一覧 ===== | ||
+ | ^ Device | ||
+ | ^ ::: ^ ::: ^ ::: ^ Half-p | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | | [[https:// | ||
+ | |||
+ | |||
+ | * Half-p, Single-p, Dobule-p の単位は GFLOPS | ||
+ | * Multi Thread Single-p が大きい順に並んでいます。 | ||
+ | * Apple M1/A10~ など Apple OS 上の big.LITTLE CPU では LITTLE core の正しい計測ができていません。Multi Thread のスコアは不正確で実際より大きな値が出ています。Single Thread の値だけ見るようにしてください。 | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | ---- | ||
+ | |||
+ | ==== 旧リスト ==== | ||
^ Device | ^ Device | ||
行 18: | 行 78: | ||
| Apple Mac mini Late 2012 | OSX.10 | | Apple Mac mini Late 2012 | OSX.10 | ||
| Apple MacBook Pro Late 2011 | OSX.10 | | Apple MacBook Pro Late 2011 | OSX.10 | ||
- | | Pixel 3 | + | | Google |
- | | Essential Phone PH-1 | A9.0 | + | | Essential Phone PH-1 | A10 | Snapdragon 835 | Kryo (A73/53) | ARMv8A | AArch64 |
- | | Amazon Fire HD 10 2019 | A9.0 | Mediatek MT8183 | + | | Amazon Fire HD 10 2019 | A9.0 | Mediatek MT8183 |
| PC AMD A10-7870K | | PC AMD A10-7870K | ||
| Apple MacBook Pro Late 2013 | OSX.10 | | Apple MacBook Pro Late 2013 | OSX.10 | ||
| iPhone SE | iOS9.3 | | iPhone SE | iOS9.3 | ||
+ | | Chromebook Flip C101PA | ||
| NVIDIA SHIELD Tablet | | NVIDIA SHIELD Tablet | ||
| Apple iPad A8X | i8.0 | Apple A8X | Typhoon | | Apple iPad A8X | i8.0 | Apple A8X | Typhoon | ||
行 9236: | 行 9297: | ||
< | < | ||
+ | Date: 20200810 123729 | ||
ARCH: ARMv8A AArch64 | ARCH: ARMv8A AArch64 | ||
FPU : ASIMD(AArch64 NEON) | FPU : ASIMD(AArch64 NEON) | ||
- | Name: Qualcomm Technologies, | + | Name: Qualcomm Technologies, |
CPU Thread: | CPU Thread: | ||
行 9245: | 行 9307: | ||
Group 0: Thread= 4 Clock=1.900800 GHz (mask:f) | Group 0: Thread= 4 Clock=1.900800 GHz (mask:f) | ||
Group 1: Thread= 4 Clock=2.457600 GHz (mask:f0) | Group 1: Thread= 4 Clock=2.457600 GHz (mask:f0) | ||
- | NEON : yes | + | NEON |
- | FMA | + | FMA : yes |
- | FPHP : no | + | FPHP |
- | SIMDHP: no | + | SIMDHP |
+ | DotProd: no | ||
Total: | Total: | ||
SingleThread HP max: - | SingleThread HP max: - | ||
- | SingleThread SP max: 34.353 GFLOPS | + | SingleThread SP max: 19.547 GFLOPS |
- | SingleThread DP max: 17.178 GFLOPS | + | SingleThread DP max: 9.776 GFLOPS |
MultiThread | MultiThread | ||
- | MultiThread | + | MultiThread |
- | MultiThread | + | MultiThread |
Group 0: Thread=4 | Group 0: Thread=4 | ||
SingleThread HP max: - | SingleThread HP max: - | ||
- | SingleThread SP max: 14.806 GFLOPS | + | SingleThread SP max: 14.783 GFLOPS |
- | SingleThread DP max: 7.406 GFLOPS | + | SingleThread DP max: 7.401 GFLOPS |
MultiThread | MultiThread | ||
- | MultiThread | + | MultiThread |
- | MultiThread | + | MultiThread |
Group 1: Thread=4 | Group 1: Thread=4 | ||
SingleThread HP max: - | SingleThread HP max: - | ||
SingleThread SP max: | SingleThread SP max: | ||
- | SingleThread DP max: 9.772 GFLOPS | + | SingleThread DP max: 9.776 GFLOPS |
MultiThread | MultiThread | ||
- | MultiThread | + | MultiThread |
- | MultiThread | + | MultiThread |
* Group 0: Thread=1 | * Group 0: Thread=1 | ||
* FPU/NEON (SP fp) | * FPU/NEON (SP fp) | ||
- | | + | |
- | FPU fmul (32bit x1) n8 : 0.319 | + | FPU fmul (32bit x1) n8 : 0.335 |
- | FPU fadd (32bit x1) n8 : 0.334 | + | FPU fadd (32bit x1) n8 : 0.319 |
- | FPU fmadd (32bit x1) n8 | + | FPU fmadd (32bit x1) n8 |
- | NEON fmul.2s (32bit x2) n8 : 0.334 | + | NEON fmul.2s (32bit x2) n8 : 0.318 |
- | NEON fadd.2s (32bit x2) n8 : 0.335 | + | NEON fadd.2s (32bit x2) n8 : 0.318 |
- | NEON fmla.2s (32bit x2) n8 : 0.334 13676.0 | + | NEON fmla.2s (32bit x2) n8 : 0.318 14338.8 |
- | NEON fmul.4s (32bit x4) n8 : 0.623 | + | NEON fmul.4s (32bit x4) n8 : 0.622 |
- | NEON fadd.4s (32bit x4) n8 : 0.628 | + | NEON fadd.4s (32bit x4) n8 : 0.623 |
- | NEON fmla.4s (32bit x4) n8 : 0.622 14677.4 | + | NEON fmla.4s (32bit x4) n8 : 0.621 14685.5 |
- | FPU fmul (32bit x1) ns4 | + | FPU fmul (32bit x1) ns4 |
- | FPU fadd (32bit x1) ns4 | + | FPU fadd (32bit x1) ns4 |
- | FPU fmadd (32bit x1) ns4 : 0.685 | + | FPU fmadd (32bit x1) ns4 : 0.607 |
- | NEON fmul.2s (32bit x2) ns4 | + | NEON fmul.2s (32bit x2) ns4 |
- | NEON fadd.2s (32bit x2) ns4 | + | NEON fadd.2s (32bit x2) ns4 |
- | NEON fmla.2s (32bit x2) ns4 | + | NEON fmla.2s (32bit x2) ns4 |
- | NEON fmul.4s (32bit x4) ns4 | + | NEON fmul.4s (32bit x4) ns4 |
- | NEON fadd.4s (32bit x4) ns4 | + | NEON fadd.4s (32bit x4) ns4 |
- | NEON fmla.4s (32bit x4) ns4 | + | NEON fmla.4s (32bit x4) ns4 |
- | FPU fmul (32bit x1) n1 : 0.607 1878.9 1878.9 ( 1 1.0) | + | FPU fmul (32bit x1) n1 : 0.607 1877.9 1877.9 ( 1.0 1.0) |
- | FPU fadd (32bit x1) n1 : 0.606 | + | FPU fadd (32bit x1) n1 : 0.607 |
- | FPU fmadd (32bit x1) n1 | + | FPU fmadd (32bit x1) n1 |
- | NEON fmul.2s (32bit x2) n1 : 0.606 | + | NEON fmul.2s (32bit x2) n1 : 0.607 |
- | NEON fadd.2s (32bit x2) n1 : 0.606 | + | NEON fadd.2s (32bit x2) n1 : 0.607 |
- | NEON fmla.2s (32bit x2) n1 : 2.424 | + | NEON fmla.2s (32bit x2) n1 : 2.428 |
- | NEON fmul.4s (32bit x4) n1 : 0.621 | + | NEON fmul.4s (32bit x4) n1 : 0.623 |
- | NEON fadd.4s (32bit x4) n1 : 0.623 7321.4 | + | NEON fadd.4s (32bit x4) n1 : 0.623 7327.1 |
- | NEON fmla.4s (32bit x4) n1 : 2.428 | + | NEON fmla.4s (32bit x4) n1 : 2.429 |
- | NEON fmul.4s (32bit x4) n12 | + | NEON fmul.4s (32bit x4) n12 |
- | NEON fadd.4s (32bit x4) n12 | + | NEON fadd.4s (32bit x4) n12 |
- | NEON fmla.4s (32bit x4) n12 | + | NEON fmla.4s (32bit x4) n12 |
- | Average | + | Average |
- | Highest | + | Highest |
* Group 0: Thread=1 | * Group 0: Thread=1 | ||
* FPU/NEON (DP fp) | * FPU/NEON (DP fp) | ||
- | | + | |
- | FPU fmul (64bit x1) n8 : 0.336 | + | FPU fmul (64bit x1) n8 : 0.318 |
- | FPU fadd (64bit x1) n8 : 0.323 | + | FPU fadd (64bit x1) n8 : 0.333 |
- | FPU fmadd (64bit x1) n8 | + | FPU fmadd (64bit x1) n8 |
- | NEON fmul.2d (64bit x2) n8 : 0.622 3664.8 | + | NEON fmul.2d (64bit x2) n8 : 0.622 3668.2 |
- | NEON fadd.2d (64bit x2) n8 : 0.626 | + | NEON fadd.2d (64bit x2) n8 : 0.622 |
- | NEON fmla.2d (64bit x2) n8 : 0.622 | + | NEON fmla.2d (64bit x2) n8 : 0.623 |
- | FPU fmul (64bit x1) ns4 | + | FPU fmul (64bit x1) ns4 |
- | FPU fadd (64bit x1) ns4 | + | FPU fadd (64bit x1) ns4 |
- | FPU fmadd (64bit x1) ns4 : 0.608 | + | FPU fmadd (64bit x1) ns4 : 0.684 |
- | NEON fmul.2d (64bit x2) ns4 | + | NEON fmul.2d (64bit x2) ns4 |
- | NEON fadd.2d (64bit x2) ns4 | + | NEON fadd.2d (64bit x2) ns4 |
- | NEON fmla.2d (64bit x2) ns4 | + | NEON fmla.2d (64bit x2) ns4 |
- | FPU fmul (64bit x1) n1 : 0.606 1881.6 | + | FPU fmul (64bit x1) n1 : 0.606 1880.8 |
- | FPU fadd (64bit x1) n1 : 0.606 | + | FPU fadd (64bit x1) n1 : 0.607 |
- | FPU fmadd (64bit x1) n1 | + | FPU fmadd (64bit x1) n1 |
- | NEON fmul.2d (64bit x2) n1 : 0.632 | + | NEON fmul.2d (64bit x2) n1 : 0.621 |
- | NEON fadd.2d (64bit x2) n1 : 0.620 | + | NEON fadd.2d (64bit x2) n1 : 0.621 |
- | NEON fmla.2d (64bit x2) n1 : 2.429 | + | NEON fmla.2d (64bit x2) n1 : 2.425 |
- | NEON fmul.2d (64bit x2) n12 | + | NEON fmul.2d (64bit x2) n12 |
- | NEON fadd.2d (64bit x2) n12 | + | NEON fadd.2d (64bit x2) n12 |
- | NEON fmla.2d (64bit x2) n12 | + | NEON fmla.2d (64bit x2) n12 |
- | Average | + | Average |
- | Highest | + | Highest |
- | + | ||
- | + | ||
- | * Group 0: Thread=1 | + | |
- | * Matrix 4x4 | + | |
- | TIME(s) | + | |
- | C++ code : | + | |
- | NEON fmla.4s 128bit A : 0.714 | + | |
- | NEON fmla.4s 128bit B : | + | |
- | Average | + | |
- | Highest | + | |
* Group 0: Thread=4 | * Group 0: Thread=4 | ||
* FPU/NEON (SP fp) multi-thread | * FPU/NEON (SP fp) multi-thread | ||
- | | + | |
- | FPU fmul (32bit x1) n8 : 0.318 14335.7 | + | FPU fmul (32bit x1) n8 : 0.334 13666.0 |
- | FPU fadd (32bit x1) n8 : 0.333 13704.3 | + | FPU fadd (32bit x1) n8 : 0.320 14246.0 |
- | FPU fmadd (32bit x1) n8 | + | FPU fmadd (32bit x1) n8 |
- | NEON fmul.2s (32bit x2) n8 : 0.333 27405.8 | + | NEON fmul.2s (32bit x2) n8 : 0.319 28609.4 |
- | NEON fadd.2s (32bit x2) n8 : 0.334 27323.1 | + | NEON fadd.2s (32bit x2) n8 : 0.318 28688.6 |
- | NEON fmla.2s (32bit x2) n8 : 0.339 53820.0 | + | NEON fmla.2s (32bit x2) n8 : 0.318 57306.2 |
- | NEON fmul.4s (32bit x4) n8 : 0.622 29351.1 | + | NEON fmul.4s (32bit x4) n8 : 0.623 29292.0 |
- | NEON fadd.4s (32bit x4) n8 : 0.622 29337.7 | + | NEON fadd.4s (32bit x4) n8 : 0.623 29296.3 |
- | NEON fmla.4s (32bit x4) n8 : 0.621 58795.1 | + | NEON fmla.4s (32bit x4) n8 : 0.622 58721.2 |
- | FPU fmul (32bit x1) ns4 | + | FPU fmul (32bit x1) ns4 |
- | FPU fadd (32bit x1) ns4 | + | FPU fadd (32bit x1) ns4 |
- | FPU fmadd (32bit x1) ns4 : 0.681 13398.8 | + | FPU fmadd (32bit x1) ns4 : 0.609 14992.5 |
- | NEON fmul.2s (32bit x2) ns4 | + | NEON fmul.2s (32bit x2) ns4 |
- | NEON fadd.2s (32bit x2) ns4 | + | NEON fadd.2s (32bit x2) ns4 |
- | NEON fmla.2s (32bit x2) ns4 | + | NEON fmla.2s (32bit x2) ns4 |
- | NEON fmul.4s (32bit x4) ns4 | + | NEON fmul.4s (32bit x4) ns4 |
- | NEON fadd.4s (32bit x4) ns4 | + | NEON fadd.4s (32bit x4) ns4 |
- | NEON fmla.4s (32bit x4) ns4 | + | NEON fmla.4s (32bit x4) ns4 |
- | FPU fmul (32bit x1) n1 : 0.608 | + | FPU fmul (32bit x1) n1 : 0.609 |
- | FPU fadd (32bit x1) n1 : 0.606 | + | FPU fadd (32bit x1) n1 : 0.621 |
- | FPU fmadd (32bit x1) n1 | + | FPU fmadd (32bit x1) n1 |
- | NEON fmul.2s (32bit x2) n1 : 0.616 14812.3 | + | NEON fmul.2s (32bit x2) n1 : 0.608 15003.8 |
- | NEON fadd.2s (32bit x2) n1 : 0.605 15074.0 | + | NEON fadd.2s (32bit x2) n1 : 0.607 15024.3 |
- | NEON fmla.2s (32bit x2) n1 : 2.433 | + | NEON fmla.2s (32bit x2) n1 : 2.425 |
- | NEON fmul.4s (32bit x4) n1 : 0.630 28977.6 | + | NEON fmul.4s (32bit x4) n1 : 0.621 29364.2 |
- | NEON fadd.4s (32bit x4) n1 : 0.622 29345.1 | + | NEON fadd.4s (32bit x4) n1 : 0.623 29273.6 |
- | NEON fmla.4s (32bit x4) n1 : 2.434 14996.4 | + | NEON fmla.4s (32bit x4) n1 : 2.431 15015.4 |
- | NEON fmul.4s (32bit x4) n12 | + | NEON fmul.4s (32bit x4) n12 |
- | NEON fadd.4s (32bit x4) n12 | + | NEON fadd.4s (32bit x4) n12 |
- | NEON fmla.4s (32bit x4) n12 | + | NEON fmla.4s (32bit x4) n12 |
- | Average | + | Average |
- | Highest | + | Highest |
* Group 0: Thread=4 | * Group 0: Thread=4 | ||
* FPU/NEON (DP fp) multi-thread | * FPU/NEON (DP fp) multi-thread | ||
- | | + | |
- | FPU fmul (64bit x1) n8 : 0.336 13593.7 | + | FPU fmul (64bit x1) n8 : 0.321 14232.9 |
- | FPU fadd (64bit x1) n8 : 0.323 14139.0 3534.7 ( 4 1.9) 14139.0 | + | FPU fadd (64bit x1) n8 : 0.333 13683.0 3420.8 |
- | FPU fmadd (64bit x1) n8 | + | FPU fmadd (64bit x1) n8 |
- | NEON fmul.2d (64bit x2) n8 : 0.621 14688.0 | + | NEON fmul.2d (64bit x2) n8 : 0.622 14665.8 |
- | NEON fadd.2d (64bit x2) n8 : 0.621 14690.9 | + | NEON fadd.2d (64bit x2) n8 : 0.622 14673.4 |
- | NEON fmla.2d (64bit x2) n8 : 0.621 29407.3 | + | NEON fmla.2d (64bit x2) n8 : 0.623 29311.8 |
- | FPU fmul (64bit x1) ns4 | + | FPU fmul (64bit x1) ns4 |
- | FPU fadd (64bit x1) ns4 | + | FPU fadd (64bit x1) ns4 |
- | FPU fmadd (64bit x1) ns4 : 0.606 15057.1 | + | FPU fmadd (64bit x1) ns4 : 0.685 13321.0 |
- | NEON fmul.2d (64bit x2) ns4 | + | NEON fmul.2d (64bit x2) ns4 |
- | NEON fadd.2d (64bit x2) ns4 | + | NEON fadd.2d (64bit x2) ns4 |
- | NEON fmla.2d (64bit x2) ns4 | + | NEON fmla.2d (64bit x2) ns4 |
- | FPU fmul (64bit x1) n1 : 0.605 | + | FPU fmul (64bit x1) n1 : 0.607 |
- | FPU fadd (64bit x1) n1 : 0.605 | + | FPU fadd (64bit x1) n1 : 0.608 |
- | FPU fmadd (64bit x1) n1 | + | FPU fmadd (64bit x1) n1 |
- | NEON fmul.2d (64bit x2) n1 : 0.621 14702.7 | + | NEON fmul.2d (64bit x2) n1 : 0.625 14589.9 |
- | NEON fadd.2d (64bit x2) n1 : 0.621 | + | NEON fadd.2d (64bit x2) n1 : 0.621 |
- | NEON fmla.2d (64bit x2) n1 : 2.424 | + | NEON fmla.2d (64bit x2) n1 : 2.427 |
- | NEON fmul.2d (64bit x2) n12 | + | NEON fmul.2d (64bit x2) n12 |
- | NEON fadd.2d (64bit x2) n12 | + | NEON fadd.2d (64bit x2) n12 |
- | NEON fmla.2d (64bit x2) n12 | + | NEON fmla.2d (64bit x2) n12 |
- | Average | + | Average |
- | Highest | + | Highest |
- | + | ||
- | + | ||
- | * Group 0: Thread=4 | + | |
- | * Matrix 4x4 multi-thread | + | |
- | TIME(s) | + | |
- | C++ code : 0.436 15639.0 | + | |
- | NEON fmla.4s 128bit A : | + | |
- | NEON fmla.4s 128bit B : 0.348 19563.4 | + | |
- | Average | + | |
- | Highest | + | |
* Group 1: Thread=1 | * Group 1: Thread=1 | ||
* FPU/NEON (SP fp) | * FPU/NEON (SP fp) | ||
- | | + | |
- | FPU fmul (32bit x1) n8 : 0.317 4653.5 | + | FPU fmul (32bit x1) n8 : 0.317 4649.7 |
- | FPU fadd (32bit x1) n8 : 0.317 4654.9 4654.9 ( 1 1.9) | + | FPU fadd (32bit x1) n8 : 0.317 4650.9 4650.9 ( 1.0 1.9) |
- | FPU fmadd (32bit x1) n8 | + | FPU fmadd (32bit x1) n8 |
- | NEON fmul.2s (32bit x2) n8 : 0.317 | + | NEON fmul.2s (32bit x2) n8 : 0.318 |
- | NEON fadd.2s (32bit x2) n8 : 0.317 9309.8 4654.9 ( 2 1.9) | + | NEON fadd.2s (32bit x2) n8 : 0.317 9308.3 4654.2 |
- | NEON fmla.2s (32bit x2) n8 : 0.329 17927.5 | + | NEON fmla.2s (32bit x2) n8 : 0.317 18615.9 |
- | NEON fmul.4s (32bit x4) n8 : 0.603 9774.3 2443.6 ( 4 1.0) | + | NEON fmul.4s (32bit x4) n8 : 0.603 9773.7 2443.4 |
- | NEON fadd.4s (32bit x4) n8 : 0.603 | + | NEON fadd.4s (32bit x4) n8 : 0.604 |
- | NEON fmla.4s (32bit x4) n8 : 0.603 19546.8 2443.4 ( 8 1.0) 19546.8 | + | NEON fmla.4s (32bit x4) n8 : 0.604 19545.9 2443.2 |
- | FPU fmul (32bit x1) ns4 | + | FPU fmul (32bit x1) ns4 |
- | FPU fadd (32bit x1) ns4 | + | FPU fadd (32bit x1) ns4 |
- | FPU fmadd (32bit x1) ns4 : 0.755 | + | FPU fmadd (32bit x1) ns4 : 0.754 |
- | NEON fmul.2s (32bit x2) ns4 | + | NEON fmul.2s (32bit x2) ns4 |
- | NEON fadd.2s (32bit x2) ns4 | + | NEON fadd.2s (32bit x2) ns4 |
- | NEON fmla.2s (32bit x2) ns4 | + | NEON fmla.2s (32bit x2) ns4 |
- | NEON fmul.4s (32bit x4) ns4 | + | NEON fmul.4s (32bit x4) ns4 |
- | NEON fadd.4s (32bit x4) ns4 | + | NEON fadd.4s (32bit x4) ns4 |
- | NEON fmla.4s (32bit x4) ns4 | + | NEON fmla.4s (32bit x4) ns4 |
- | FPU fmul (32bit x1) n1 : 0.317 4648.8 | + | FPU fmul (32bit x1) n1 : 0.317 4653.7 |
- | FPU fadd (32bit x1) n1 : 0.317 4652.1 | + | FPU fadd (32bit x1) n1 : 0.317 4653.3 |
- | FPU fmadd (32bit x1) n1 | + | FPU fmadd (32bit x1) n1 |
- | NEON fmul.2s (32bit x2) n1 : 0.317 9305.3 | + | NEON fmul.2s (32bit x2) n1 : 0.317 9311.6 |
- | NEON fadd.2s (32bit x2) n1 : 0.317 9307.2 | + | NEON fadd.2s (32bit x2) n1 : 0.317 9309.8 |
- | NEON fmla.2s (32bit x2) n1 : 1.810 3258.0 814.5 ( 4 0.3) | + | NEON fmla.2s (32bit x2) n1 : 1.810 3258.4 814.6 |
- | NEON fmul.4s (32bit x4) n1 : 0.604 9771.8 | + | NEON fmul.4s (32bit x4) n1 : 0.604 9772.2 |
- | NEON fadd.4s (32bit x4) n1 : 0.603 | + | NEON fadd.4s (32bit x4) n1 : 0.604 |
- | NEON fmla.4s (32bit x4) n1 : 1.811 6513.9 814.2 ( 8 0.3) | + | NEON fmla.4s (32bit x4) n1 : 1.811 6514.2 814.3 |
- | NEON fmul.4s (32bit x4) n12 | + | NEON fmul.4s (32bit x4) n12 |
- | NEON fadd.4s (32bit x4) n12 | + | NEON fadd.4s (32bit x4) n12 |
- | NEON fmla.4s (32bit x4) n12 | + | NEON fmla.4s (32bit x4) n12 |
- | Average | + | Average |
- | Highest | + | Highest |
* Group 1: Thread=1 | * Group 1: Thread=1 | ||
* FPU/NEON (DP fp) | * FPU/NEON (DP fp) | ||
- | | + | |
- | FPU fmul (64bit x1) n8 : 0.317 4652.4 | + | FPU fmul (64bit x1) n8 : 0.317 4650.1 |
- | FPU fadd (64bit x1) n8 : 0.317 4652.7 | + | FPU fadd (64bit x1) n8 : 0.317 4650.3 |
- | FPU fmadd (64bit x1) n8 | + | FPU fmadd (64bit x1) n8 |
- | NEON fmul.2d (64bit x2) n8 : 0.604 | + | NEON fmul.2d (64bit x2) n8 : 0.603 |
- | NEON fadd.2d (64bit x2) n8 : 0.604 | + | NEON fadd.2d (64bit x2) n8 : 0.603 |
- | NEON fmla.2d (64bit x2) n8 : 0.604 | + | NEON fmla.2d (64bit x2) n8 : 0.603 |
- | FPU fmul (64bit x1) ns4 | + | FPU fmul (64bit x1) ns4 |
- | FPU fadd (64bit x1) ns4 | + | FPU fadd (64bit x1) ns4 |
- | FPU fmadd (64bit x1) ns4 : 0.528 | + | FPU fmadd (64bit x1) ns4 : 0.531 |
- | NEON fmul.2d (64bit x2) ns4 | + | NEON fmul.2d (64bit x2) ns4 |
- | NEON fadd.2d (64bit x2) ns4 | + | NEON fadd.2d (64bit x2) ns4 |
- | NEON fmla.2d (64bit x2) ns4 | + | NEON fmla.2d (64bit x2) ns4 |
- | FPU fmul (64bit x1) n1 : 0.317 4653.7 | + | FPU fmul (64bit x1) n1 : 0.317 4654.4 |
- | FPU fadd (64bit x1) n1 : 0.317 | + | FPU fadd (64bit x1) n1 : 0.317 |
- | FPU fmadd (64bit x1) n1 | + | FPU fmadd (64bit x1) n1 |
- | NEON fmul.2d (64bit x2) n1 : 0.604 | + | NEON fmul.2d (64bit x2) n1 : 0.603 |
- | NEON fadd.2d (64bit x2) n1 : 0.604 | + | NEON fadd.2d (64bit x2) n1 : 0.603 |
- | NEON fmla.2d (64bit x2) n1 : 1.811 | + | NEON fmla.2d (64bit x2) n1 : 1.810 |
- | NEON fmul.2d (64bit x2) n12 | + | NEON fmul.2d (64bit x2) n12 |
- | NEON fadd.2d (64bit x2) n12 | + | NEON fadd.2d (64bit x2) n12 |
- | NEON fmla.2d (64bit x2) n12 | + | NEON fmla.2d (64bit x2) n12 |
- | Average | + | Average |
- | Highest | + | Highest |
- | + | ||
- | + | ||
- | * Group 1: Thread=1 | + | |
- | * Matrix 4x4 | + | |
- | TIME(s) | + | |
- | C++ code : 0.360 | + | |
- | NEON fmla.4s 128bit A : | + | |
- | NEON fmla.4s 128bit B : | + | |
- | Average | + | |
- | Highest | + | |
* Group 1: Thread=4 | * Group 1: Thread=4 | ||
* FPU/NEON (SP fp) multi-thread | * FPU/NEON (SP fp) multi-thread | ||
- | | + | |
- | FPU fmul (32bit x1) n8 : 0.455 12977.3 | + | FPU fmul (32bit x1) n8 : 0.335 17600.7 |
- | FPU fadd (32bit x1) n8 : 0.494 11946.6 | + | FPU fadd (32bit x1) n8 : 0.329 17915.7 |
- | FPU fmadd (32bit x1) n8 | + | FPU fmadd (32bit x1) n8 |
- | NEON fmul.2s (32bit x2) n8 : 0.479 24612.2 | + | NEON fmul.2s (32bit x2) n8 : 0.329 35832.1 |
- | NEON fadd.2s (32bit x2) n8 : 0.476 24783.8 | + | NEON fadd.2s (32bit x2) n8 : 0.329 35831.5 |
- | NEON fmla.2s (32bit x2) n8 : 0.479 49204.8 | + | NEON fmla.2s (32bit x2) n8 : 0.329 71648.0 |
- | NEON fmul.4s (32bit x4) n8 : 0.942 25056.7 | + | NEON fmul.4s (32bit x4) n8 : 0.627 37622.6 |
- | NEON fadd.4s (32bit x4) n8 : 0.941 25058.9 | + | NEON fadd.4s (32bit x4) n8 : 0.627 37624.2 |
- | NEON fmla.4s (32bit x4) n8 : 0.925 51022.2 | + | NEON fmla.4s (32bit x4) n8 : 0.627 75249.0 |
- | FPU fmul (32bit x1) ns4 | + | FPU fmul (32bit x1) ns4 |
- | FPU fadd (32bit x1) ns4 | + | FPU fadd (32bit x1) ns4 |
- | FPU fmadd (32bit x1) ns4 : | + | FPU fmadd (32bit x1) ns4 : |
- | NEON fmul.2s (32bit x2) ns4 | + | NEON fmul.2s (32bit x2) ns4 |
- | NEON fadd.2s (32bit x2) ns4 | + | NEON fadd.2s (32bit x2) ns4 |
- | NEON fmla.2s (32bit x2) ns4 | + | NEON fmla.2s (32bit x2) ns4 |
- | NEON fmul.4s (32bit x4) ns4 | + | NEON fmul.4s (32bit x4) ns4 |
- | NEON fadd.4s (32bit x4) ns4 | + | NEON fadd.4s (32bit x4) ns4 |
- | NEON fmla.4s (32bit x4) ns4 | + | NEON fmla.4s (32bit x4) ns4 |
- | FPU fmul (32bit x1) n1 : 0.403 14651.9 | + | FPU fmul (32bit x1) n1 : 0.329 17914.3 |
- | FPU fadd (32bit x1) n1 : 0.434 13582.6 | + | FPU fadd (32bit x1) n1 : 0.329 17914.5 |
- | FPU fmadd (32bit x1) n1 | + | FPU fmadd (32bit x1) n1 |
- | NEON fmul.2s (32bit x2) n1 : 0.412 28620.6 | + | NEON fmul.2s (32bit x2) n1 : 0.329 35829.5 |
- | NEON fadd.2s (32bit x2) n1 : 0.384 30747.3 3843.4 ( 8 1.6) 30747.3 | + | NEON fadd.2s (32bit x2) n1 : 0.329 35830.3 4478.8 |
- | NEON fmla.2s (32bit x2) n1 : | + | NEON fmla.2s (32bit x2) n1 : |
- | NEON fmul.4s (32bit x4) n1 : 0.684 34490.5 | + | NEON fmul.4s (32bit x4) n1 : 0.627 37625.7 |
- | NEON fadd.4s (32bit x4) n1 : 0.636 37116.4 | + | NEON fadd.4s (32bit x4) n1 : 0.627 37623.9 |
- | NEON fmla.4s (32bit x4) n1 : 1.902 24810.3 775.3 ( 32 0.3) 24810.3 | + | NEON fmla.4s (32bit x4) n1 : 1.881 25082.9 783.8 |
- | NEON fmul.4s (32bit x4) n12 | + | NEON fmul.4s (32bit x4) n12 |
- | NEON fadd.4s (32bit x4) n12 | + | NEON fadd.4s (32bit x4) n12 |
- | NEON fmla.4s (32bit x4) n12 | + | NEON fmla.4s (32bit x4) n12 |
- | Average | + | Average |
- | Highest | + | Highest |
* Group 1: Thread=4 | * Group 1: Thread=4 | ||
* FPU/NEON (DP fp) multi-thread | * FPU/NEON (DP fp) multi-thread | ||
- | | + | |
- | FPU fmul (64bit x1) n8 : | + | FPU fmul (64bit x1) n8 : |
- | FPU fadd (64bit x1) n8 : | + | FPU fadd (64bit x1) n8 : |
- | FPU fmadd (64bit x1) n8 | + | FPU fmadd (64bit x1) n8 |
- | NEON fmul.2d (64bit x2) n8 : | + | NEON fmul.2d (64bit x2) n8 : |
- | NEON fadd.2d (64bit x2) n8 : | + | NEON fadd.2d (64bit x2) n8 : |
- | NEON fmla.2d (64bit x2) n8 : | + | NEON fmla.2d (64bit x2) n8 : |
- | FPU fmul (64bit x1) ns4 | + | FPU fmul (64bit x1) ns4 |
- | FPU fadd (64bit x1) ns4 | + | FPU fadd (64bit x1) ns4 |
- | FPU fmadd (64bit x1) ns4 : | + | FPU fmadd (64bit x1) ns4 : |
- | NEON fmul.2d (64bit x2) ns4 | + | NEON fmul.2d (64bit x2) ns4 |
- | NEON fadd.2d (64bit x2) ns4 | + | NEON fadd.2d (64bit x2) ns4 |
- | NEON fmla.2d (64bit x2) ns4 | + | NEON fmla.2d (64bit x2) ns4 |
- | FPU fmul (64bit x1) n1 : | + | FPU fmul (64bit x1) n1 : |
- | FPU fadd (64bit x1) n1 : | + | FPU fadd (64bit x1) n1 : |
- | FPU fmadd (64bit x1) n1 | + | FPU fmadd (64bit x1) n1 |
- | NEON fmul.2d (64bit x2) n1 : 0.821 14364.5 | + | NEON fmul.2d (64bit x2) n1 : 0.627 18812.4 |
- | NEON fadd.2d (64bit x2) n1 : 0.937 12586.2 | + | NEON fadd.2d (64bit x2) n1 : 0.627 18813.0 |
- | NEON fmla.2d (64bit x2) n1 : | + | NEON fmla.2d (64bit x2) n1 : |
- | NEON fmul.2d (64bit x2) n12 | + | NEON fmul.2d (64bit x2) n12 |
- | NEON fadd.2d (64bit x2) n12 | + | NEON fadd.2d (64bit x2) n12 |
- | NEON fmla.2d (64bit x2) n12 | + | NEON fmla.2d (64bit x2) n12 |
- | Average | + | Average |
- | Highest | + | Highest |
- | + | ||
- | + | ||
- | * Group 1: Thread=4 | + | |
- | * Matrix 4x4 multi-thread | + | |
- | TIME(s) | + | |
- | C++ code | + | |
- | NEON fmla.4s 128bit A : | + | |
- | NEON fmla.4s 128bit B : | + | |
- | Average | + | |
- | Highest | + | |
行 11786: | 行 11808: | ||
< | < | ||
- | ARCH: ARMv7A | + | Date: 20200808 172338 |
- | FPU: VFPv3-D32 NEON | + | ARCH: ARMv7A |
- | SingleT SP max: 16.066 GFLOPS | + | FPU : VFPv4-D32 NEON |
- | SingleT DP max: 8.027 GFLOPS | + | Name: RK3399 ChromebookFlipC101PA |
- | MultiT | + | CPU Thread: 6 |
- | MultiT | + | CPU Core : |
- | CPU core: 2 | + | CPU Group : 2 |
- | NEON: yes | + | Group 0: Thread= 4 Clock=1.512000 GHz (mask:f) |
- | FMA : no | + | Group 1: Thread= |
+ | NEON | ||
+ | FMA : yes | ||
+ | FPHP : no | ||
+ | SIMDHP : no | ||
+ | DotProd: no | ||
- | * VFP/NEON (single fp) | + | Total: |
- | | + | SingleThread HP max: - |
- | VFP fmuls (32bit x1) n8 : | + | SingleThread SP max: 16.062 GFLOPS |
- | VFP fadds (32bit x1) n8 : | + | SingleThread DP max: 8.030 GFLOPS |
- | VFP fmacs (32bit x1) n8 : | + | MultiThread |
- | VFP vfma.f32 (32bit x1) n8 : - - - - | + | MultiThread |
- | NEON vmul.f32 (32bit x2) n8 : | + | MultiThread |
- | NEON vadd.f32 (32bit x2) n8 | + | |
- | NEON vmla.f32 (32bit x2) n8 : | + | |
- | NEON vfma.f32 (32bit x2) n8 : | + | |
- | NEON vmul.f32 (32bit x4) n8 : | + | |
- | NEON vadd.f32 (32bit x4) n8 : | + | |
- | NEON vmla.f32 (32bit x4) n8 : | + | |
- | NEON vfma.f32 (32bit x4) n8 : | + | |
- | VFP fmuls (32bit x1) ns4 : 0.598 | + | |
- | VFP fadds (32bit x1) ns4 | + | |
- | VFP fmacs (32bit x1) ns4 : 1.046 | + | |
- | VFP vfma.f32 (32bit x1) ns4 : | + | |
- | NEON vmul.f32 (32bit x2) ns4 : | + | |
- | NEON vadd.f32 (32bit x2) ns4 : 0.597 | + | |
- | NEON vmla.f32 (32bit x2) ns4 : 1.046 | + | |
- | NEON vfma.f32 (32bit x2) ns4 : - - - - - | + | |
- | NEON vmul.f32 (32bit x4) ns4 : 0.597 | + | |
- | NEON vadd.f32 (32bit x4) ns4 : 0.597 | + | |
- | NEON vmla.f32 (32bit x4) ns4 : 1.046 | + | |
- | NEON vfma.f32 (32bit x4) ns4 : - - - - - | + | |
- | VFP fmuls (32bit x1) n1 : | + | |
- | VFP fadds (32bit x1) n1 : | + | |
- | VFP fmacs (32bit x1) n1 : | + | |
- | VFP vfma.f32 (32bit x1) n1 : - - - - - | + | |
- | NEON vmul.f32 (32bit x2) n1 | + | |
- | NEON vadd.f32 (32bit x2) n1 : | + | |
- | NEON vmla.f32 (32bit x2) n1 : | + | |
- | NEON vfma.f32 (32bit x2) n1 : | + | |
- | NEON vmul.f32 (32bit x4) n1 : | + | |
- | NEON vadd.f32 (32bit x4) n1 : | + | |
- | NEON vmla.f32 (32bit x4) n1 : | + | |
- | NEON vfma.f32 (32bit x4) n1 : | + | |
- | NEON vmul.f32 (32bit x4) n12 : 0.896 | + | |
- | NEON vadd.f32 (32bit x4) n12 : 0.896 | + | |
- | NEON vmla.f32 (32bit x4) n12 : 0.896 16066.1 | + | |
- | NEON vfma.f32 (32bit x4) n12 : - - - - - | + | |
- | Average | + | |
- | Highest | + | |
+ | Group 0: Thread=4 | ||
+ | SingleThread HP max: - | ||
+ | SingleThread SP max: | ||
+ | SingleThread DP max: 5.459 GFLOPS | ||
+ | MultiThread | ||
+ | MultiThread | ||
+ | MultiThread | ||
- | * VFP/NEON (double fp) | + | Group 1: |
- | TIME(s) | + | |
- | VFP fmuld (64bit x1) n8 : | + | |
- | VFP faddd (64bit x1) n8 : | + | SingleThread DP max: |
- | VFP fmacd (64bit x1) n8 : | + | |
- | VFP vfma.f64 (64bit x1) n8 : - - - - - | + | |
- | VFP fmuld (64bit x1) ns4 : 0.598 | + | MultiThread |
- | VFP faddd (64bit x1) ns4 | + | |
- | VFP fmacd (64bit x1) ns4 : 1.046 | + | |
- | VFP vfma.f64 (64bit x1) ns4 : | + | |
- | VFP fmuld (64bit x1) n1 : 0.301 | + | |
- | VFP faddd (64bit x1) n1 : | + | |
- | VFP fmacd (64bit x1) n1 : | + | |
- | VFP vfma.f64 (64bit x1) n1 | + | |
- | Average | + | |
- | Highest | + | |
- | * Matrix 4x4 | + | * Group 0: Thread=1 |
- | TIME(s) | + | * VFP/NEON (SP fp) |
- | C++ code | + | TIME(s) |
- | NEON vmla 128bit A | + | VFP fmuls (32bit x1) n8 : 0.364 |
- | NEON vmla 64bit A | + | VFP fadds (32bit x1) n8 : |
- | NEON vfma 128bit A | + | VFP fmacs (32bit x1) n8 : |
- | NEON vmla 128bit B | + | VFP vfma.f32 (32bit x1) n8 : - - - |
- | NEON vmla | + | NEON vmul.f32 (32bit x2) n8 : 0.318 |
- | NEON vfma 128bit B | + | NEON vadd.f32 (32bit x2) n8 : |
- | NEON vfma 128bit C | + | NEON vmla.f32 (32bit x2) n8 : 0.580 |
- | Average | + | NEON vfma.f32 (32bit x2) n8 : |
- | Highest | + | NEON vmul.f32 (32bit x4) n8 : |
+ | NEON vadd.f32 (32bit x4) n8 : | ||
+ | NEON vmla.f32 (32bit x4) n8 : | ||
+ | NEON vfma.f32 (32bit x4) n8 : - - - - - | ||
+ | VFP fmuls (32bit x1) ns4 : 0.606 | ||
+ | VFP fadds (32bit x1) ns4 : 0.607 | ||
+ | VFP fmacs (32bit x1) ns4 : 1.210 | ||
+ | VFP vfma.f32 (32bit x1) ns4 : | ||
+ | NEON vmul.f32 (32bit x2) ns4 | ||
+ | NEON vadd.f32 (32bit x2) ns4 : 0.605 | ||
+ | NEON vmla.f32 (32bit x2) ns4 : | ||
+ | NEON vfma.f32 (32bit x2) ns4 : - - - | ||
+ | NEON vmul.f32 (32bit x4) ns4 : 0.620 | ||
+ | NEON vadd.f32 (32bit x4) ns4 : 0.619 | ||
+ | NEON vmla.f32 (32bit x4) ns4 : 1.209 | ||
+ | NEON vfma.f32 (32bit x4) ns4 | ||
+ | VFP fmuls (32bit x1) n1 : | ||
+ | VFP fadds (32bit x1) n1 : | ||
+ | VFP fmacs (32bit x1) n1 : | ||
+ | VFP vfma.f32 (32bit x1) n1 : | ||
+ | NEON vmul.f32 (32bit x2) n1 : | ||
+ | NEON vadd.f32 (32bit x2) n1 : | ||
+ | NEON vmla.f32 (32bit x2) n1 : | ||
+ | NEON vfma.f32 (32bit x2) n1 : - - - | ||
+ | NEON vmul.f32 (32bit x4) n1 : 0.619 | ||
+ | NEON vadd.f32 (32bit x4) n1 : | ||
+ | NEON vmla.f32 | ||
+ | NEON vfma.f32 (32bit x4) n1 : | ||
+ | NEON vmul.f32 (32bit x4) n12 : 0.922 | ||
+ | NEON vadd.f32 (32bit x4) n12 : 0.922 5903.3 1475.8 | ||
+ | NEON vmla.f32 (32bit x4) n12 : 0.923 11800.8 | ||
+ | NEON vfma.f32 (32bit x4) n12 : - - - | ||
+ | Average | ||
+ | Highest | ||
- | * VFP/NEON (single | + | * Group 0: Thread=1 |
- | TIME(s) | + | * VFP/NEON (DP fp) |
- | VFP fmuls (32bit x1) n8 : | + | TIME(s) |
- | VFP fadds (32bit x1) n8 : | + | VFP fmuld (64bit x1) n8 : |
- | VFP fmacs (32bit x1) n8 : | + | VFP faddd (64bit x1) n8 : |
- | VFP vfma.f32 | + | VFP fmacd (64bit x1) n8 |
- | NEON vmul.f32 (32bit x2) n8 : 0.300 15996.9 | + | VFP vfma.f64 (64bit x1) n8 : - - - |
- | NEON vadd.f32 (32bit x2) n8 : | + | VFP fmuld (64bit x1) ns4 : 0.604 |
- | NEON vmla.f32 (32bit x2) n8 : | + | VFP faddd (64bit x1) ns4 : 0.604 |
- | NEON vfma.f32 (32bit x2) n8 : | + | VFP fmacd (64bit x1) ns4 : 1.218 |
- | NEON vmul.f32 (32bit x4) n8 : | + | VFP vfma.f64 (64bit x1) ns4 |
- | NEON vadd.f32 (32bit x4) n8 : | + | VFP fmuld (64bit x1) n1 : |
- | NEON vmla.f32 (32bit x4) n8 : 0.601 31941.5 | + | VFP faddd (64bit x1) n1 : |
- | NEON vfma.f32 (32bit x4) n8 : - - - - - | + | VFP fmacd (64bit x1) n1 |
- | VFP fmuls (32bit x1) ns4 : 0.599 | + | VFP vfma.f64 (64bit x1) n1 : - - - |
- | VFP fadds (32bit x1) ns4 : 0.606 | + | Average |
- | VFP fmacs (32bit x1) ns4 : | + | Highest |
- | VFP vfma.f32 (32bit x1) ns4 : | + | |
- | NEON vmul.f32 (32bit x2) ns4 : 0.599 | + | |
- | NEON vadd.f32 (32bit x2) ns4 : 0.601 | + | |
- | NEON vmla.f32 | + | |
- | NEON vfma.f32 (32bit x2) ns4 : - - - - - | + | |
- | NEON vmul.f32 (32bit x4) ns4 : 0.599 16014.8 2001.9 ( 8 1.0) | + | |
- | NEON vadd.f32 | + | |
- | NEON vmla.f32 (32bit x4) ns4 : 1.049 18307.6 | + | |
- | NEON vfma.f32 (32bit x4) ns4 : - - - - - | + | |
- | VFP fmuls (32bit x1) n1 : | + | |
- | VFP fadds (32bit x1) n1 : | + | |
- | VFP fmacs (32bit x1) n1 : | + | |
- | VFP vfma.f32 | + | |
- | NEON vmul.f32 (32bit x2) n1 : | + | |
- | NEON vadd.f32 (32bit x2) n1 : | + | |
- | NEON vmla.f32 (32bit x2) n1 : | + | |
- | NEON vfma.f32 (32bit x2) n1 | + | |
- | NEON vmul.f32 (32bit x4) n1 : 0.602 15955.7 | + | |
- | NEON vadd.f32 (32bit x4) n1 : | + | |
- | NEON vmla.f32 (32bit x4) n1 : | + | |
- | NEON vfma.f32 (32bit x4) n1 : | + | |
- | NEON vmul.f32 (32bit x4) n12 | + | |
- | NEON vadd.f32 (32bit x4) n12 : 0.900 15994.6 | + | |
- | NEON vmla.f32 (32bit x4) n12 : 0.898 32053.6 | + | |
- | NEON vfma.f32 (32bit x4) n12 : - - - - - | + | |
- | Average | + | |
- | Highest | + | |
- | * VFP/NEON (double | + | * Group 0: Thread=4 |
- | TIME(s) | + | * VFP/NEON (SP fp) multi-thread |
- | VFP fmuld (64bit x1) n8 : | + | TIME(s) |
- | VFP faddd (64bit x1) n8 : | + | VFP fmuls (32bit x1) n8 : |
- | VFP fmacd (64bit x1) n8 : | + | VFP fadds (32bit x1) n8 : |
- | VFP vfma.f64 (64bit x1) n8 : - - - - - | + | VFP fmacs (32bit x1) n8 : |
- | VFP fmuld (64bit x1) ns4 : 0.600 | + | VFP vfma.f32 |
- | VFP faddd (64bit x1) ns4 : 0.605 | + | NEON vmul.f32 (32bit x2) n8 : 0.322 22549.4 |
- | VFP fmacd (64bit x1) ns4 : 1.051 | + | NEON vadd.f32 (32bit x2) n8 : |
- | VFP vfma.f64 (64bit x1) ns4 : | + | NEON vmla.f32 (32bit x2) n8 : |
- | VFP fmuld (64bit x1) n1 : 0.300 | + | NEON vfma.f32 |
- | VFP faddd (64bit x1) n1 : | + | NEON vmul.f32 (32bit x4) n8 : 0.626 23194.3 |
- | VFP fmacd (64bit x1) n1 : | + | NEON vadd.f32 (32bit x4) n8 : |
- | VFP vfma.f64 (64bit x1) n1 : - - - - - | + | NEON vmla.f32 (32bit x4) n8 |
- | Average | + | NEON vfma.f32 (32bit x4) n8 |
- | Highest | + | VFP fmuls (32bit x1) ns4 : 0.613 |
+ | VFP fadds (32bit x1) ns4 : 0.608 | ||
+ | VFP fmacs (32bit x1) ns4 : 1.219 | ||
+ | VFP vfma.f32 (32bit x1) ns4 : | ||
+ | NEON vmul.f32 | ||
+ | NEON vadd.f32 (32bit x2) ns4 : 0.607 11947.7 | ||
+ | NEON vmla.f32 | ||
+ | NEON vfma.f32 (32bit x2) ns4 : - - - | ||
+ | NEON vmul.f32 (32bit x4) ns4 : 0.623 23296.9 | ||
+ | NEON vadd.f32 (32bit x4) ns4 : 0.623 23288.7 | ||
+ | NEON vmla.f32 (32bit x4) ns4 : 1.214 23903.3 | ||
+ | NEON vfma.f32 (32bit x4) ns4 : - - - | ||
+ | VFP fmuls (32bit x1) n1 : | ||
+ | VFP fadds (32bit x1) n1 : | ||
+ | VFP fmacs (32bit x1) n1 : | ||
+ | VFP vfma.f32 (32bit x1) n1 : - - - | ||
+ | NEON vmul.f32 (32bit x2) n1 : 0.609 11916.6 | ||
+ | NEON vadd.f32 (32bit x2) n1 : | ||
+ | NEON vmla.f32 (32bit x2) n1 : | ||
+ | NEON vfma.f32 (32bit x2) n1 : | ||
+ | NEON vmul.f32 (32bit x4) n1 : 0.626 23197.0 | ||
+ | NEON vadd.f32 (32bit x4) n1 : | ||
+ | NEON vmla.f32 (32bit x4) n1 : | ||
+ | NEON vfma.f32 (32bit x4) n1 : | ||
+ | NEON vmul.f32 (32bit x4) n12 : 0.929 23441.0 | ||
+ | NEON vadd.f32 (32bit x4) n12 : 0.930 23401.3 | ||
+ | NEON vmla.f32 (32bit x4) n12 : 0.928 46918.9 | ||
+ | NEON vfma.f32 (32bit x4) n12 : - - - | ||
+ | Average | ||
+ | Highest | ||
- | * Matrix 4x4 multi-thread | + | * Group 0: Thread=4 |
- | TIME(s) | + | * VFP/NEON (DP fp) multi-thread |
- | C++ code | + | TIME(s) |
- | NEON vmla 128bit A | + | VFP fmuld (64bit x1) n8 : 0.354 10241.0 |
- | NEON vmla | + | VFP faddd (64bit x1) n8 : 0.320 11325.3 |
- | NEON vfma 128bit A | + | VFP fmacd (64bit x1) n8 : 0.334 21746.4 |
- | NEON vmla 128bit B | + | VFP vfma.f64 (64bit x1) n8 |
- | NEON vmla | + | VFP fmuld (64bit x1) ns4 |
- | NEON vfma 128bit B | + | VFP faddd (64bit x1) ns4 |
- | NEON vfma 128bit C | + | VFP fmacd (64bit x1) ns4 : 1.224 |
- | Average | + | VFP vfma.f64 (64bit x1) ns4 : - - - |
- | Highest | + | VFP fmuld (64bit x1) n1 : |
+ | VFP faddd (64bit x1) n1 : | ||
+ | VFP fmacd (64bit x1) n1 : | ||
+ | VFP vfma.f64 (64bit x1) n1 | ||
+ | Average | ||
+ | Highest | ||
- | cpu0 1512000 408000 | + | * Group 1: Thread=1 |
- | cpu1 1512000 408000 | + | * VFP/NEON (SP fp) |
- | cpu2 1512000 408000 | + | TIME(s) |
- | cpu3 1512000 408000 | + | VFP fmuls (32bit x1) n8 : |
- | cpu4 2016000 408000 | + | VFP fadds (32bit x1) n8 : |
- | cpu5 2016000 408000 | + | VFP fmacs (32bit x1) n8 : |
+ | VFP vfma.f32 (32bit x1) n8 : - - - | ||
+ | NEON vmul.f32 (32bit x2) n8 : | ||
+ | NEON vadd.f32 (32bit x2) n8 : | ||
+ | NEON vmla.f32 (32bit x2) n8 : | ||
+ | NEON vfma.f32 (32bit x2) n8 : | ||
+ | NEON vmul.f32 (32bit x4) n8 : | ||
+ | NEON vadd.f32 (32bit x4) n8 : | ||
+ | NEON vmla.f32 (32bit x4) n8 : | ||
+ | NEON vfma.f32 (32bit x4) n8 : | ||
+ | VFP fmuls (32bit x1) ns4 : 0.602 | ||
+ | VFP fadds (32bit x1) ns4 : 0.602 | ||
+ | VFP fmacs (32bit x1) ns4 : 1.054 | ||
+ | VFP vfma.f32 (32bit x1) ns4 : | ||
+ | NEON vmul.f32 (32bit x2) ns4 : 0.602 | ||
+ | NEON vadd.f32 (32bit x2) ns4 : 0.602 | ||
+ | NEON vmla.f32 (32bit x2) ns4 : 1.054 | ||
+ | NEON vfma.f32 (32bit x2) ns4 : - - - | ||
+ | NEON vmul.f32 (32bit x4) ns4 : 0.602 | ||
+ | NEON vadd.f32 (32bit x4) ns4 : 0.602 | ||
+ | NEON vmla.f32 (32bit x4) ns4 : 1.055 | ||
+ | NEON vfma.f32 (32bit x4) ns4 : - - - | ||
+ | VFP fmuls (32bit x1) n1 : | ||
+ | VFP fadds (32bit x1) n1 : | ||
+ | VFP fmacs (32bit x1) n1 : | ||
+ | VFP vfma.f32 (32bit x1) n1 : - - - | ||
+ | NEON vmul.f32 (32bit x2) n1 : | ||
+ | NEON vadd.f32 (32bit x2) n1 : | ||
+ | NEON vmla.f32 (32bit x2) n1 : | ||
+ | NEON vfma.f32 (32bit x2) n1 : | ||
+ | NEON vmul.f32 (32bit x4) n1 : | ||
+ | NEON vadd.f32 (32bit x4) n1 : | ||
+ | NEON vmla.f32 (32bit x4) n1 : | ||
+ | NEON vfma.f32 (32bit x4) n1 : | ||
+ | NEON vmul.f32 (32bit x4) n12 : 0.904 | ||
+ | NEON vadd.f32 (32bit x4) n12 : 0.904 | ||
+ | NEON vmla.f32 (32bit x4) n12 : 0.904 16062.4 | ||
+ | NEON vfma.f32 (32bit x4) n12 : - - - | ||
+ | Average | ||
+ | Highest | ||
- | processor : 0 | ||
- | model name : ARMv8 Processor rev 4 (v8l) | ||
- | BogoMIPS : 48.00 | ||
- | Features : half thumb fastmult vfp edsp neon vfpv3 tls vfpv4 idiva idivt lpae evtstrm aes pmull sha1 sha2 crc32 | ||
- | CPU implementer : | ||
- | CPU architecture: | ||
- | CPU variant : 0x0 | ||
- | CPU part : 0xd03 | ||
- | CPU revision : 4 | ||
- | processor : 1 | + | * Group 1: |
- | model name : ARMv8 Processor rev 4 (v8l) | + | * VFP/NEON (DP fp) |
- | BogoMIPS : 48.00 | + | TIME(s) |
- | Features : half thumb fastmult vfp edsp neon vfpv3 tls vfpv4 idiva idivt lpae evtstrm aes pmull sha1 sha2 crc32 | + | VFP fmuld (64bit x1) n8 : |
- | CPU implementer : 0x41 | + | VFP faddd (64bit x1) n8 : 0.301 |
- | CPU architecture: 8 | + | VFP fmacd (64bit x1) n8 : |
- | CPU variant : 0x0 | + | VFP vfma.f64 (64bit x1) n8 |
- | CPU part : 0xd03 | + | VFP fmuld (64bit x1) ns4 |
- | CPU revision : 4 | + | VFP faddd (64bit x1) ns4 : 0.603 |
+ | VFP fmacd (64bit x1) ns4 : 1.054 2294.8 1147.4 | ||
+ | VFP vfma.f64 (64bit x1) ns4 : | ||
+ | VFP fmuld (64bit x1) n1 : | ||
+ | VFP faddd (64bit x1) n1 : | ||
+ | VFP fmacd (64bit x1) n1 : | ||
+ | VFP vfma.f64 (64bit x1) n1 : - - - | ||
+ | Average | ||
+ | Highest | ||
- | processor : 2 | ||
- | model name : ARMv8 Processor rev 4 (v8l) | ||
- | BogoMIPS : 48.00 | ||
- | Features : half thumb fastmult vfp edsp neon vfpv3 tls vfpv4 idiva idivt lpae evtstrm aes pmull sha1 sha2 crc32 | ||
- | CPU implementer : | ||
- | CPU architecture: | ||
- | CPU variant : 0x0 | ||
- | CPU part : 0xd03 | ||
- | CPU revision : 4 | ||
- | processor : 3 | + | * Group 1: |
- | model name : ARMv8 Processor rev 4 (v8l) | + | * VFP/NEON (SP fp) multi-thread |
- | BogoMIPS : 48.00 | + | TIME(s) |
- | Features : half thumb fastmult vfp edsp neon vfpv3 tls vfpv4 idiva idivt lpae evtstrm aes pmull sha1 sha2 crc32 | + | VFP fmuls (32bit x1) n8 : |
- | CPU implementer : 0x41 | + | VFP fadds (32bit x1) n8 : |
- | CPU architecture: 8 | + | VFP fmacs (32bit x1) n8 : |
- | CPU variant : 0x0 | + | VFP vfma.f32 (32bit x1) n8 |
- | CPU part : 0xd03 | + | NEON vmul.f32 (32bit x2) n8 : |
- | CPU revision : 4 | + | NEON vadd.f32 |
+ | NEON vmla.f32 (32bit x2) n8 : 0.302 32089.1 | ||
+ | NEON vfma.f32 (32bit x2) n8 : | ||
+ | NEON vmul.f32 (32bit x4) n8 : | ||
+ | NEON vadd.f32 (32bit x4) n8 : | ||
+ | NEON vmla.f32 (32bit x4) n8 : | ||
+ | NEON vfma.f32 (32bit x4) n8 : | ||
+ | VFP fmuls (32bit x1) ns4 | ||
+ | VFP fadds (32bit x1) ns4 : 0.603 | ||
+ | VFP fmacs (32bit x1) ns4 : 1.055 | ||
+ | VFP vfma.f32 (32bit x1) ns4 : | ||
+ | NEON vmul.f32 (32bit x2) ns4 : 0.602 | ||
+ | NEON vadd.f32 (32bit x2) ns4 : 0.603 | ||
+ | NEON vmla.f32 (32bit x2) ns4 : 1.055 | ||
+ | NEON vfma.f32 (32bit x2) ns4 : - - - | ||
+ | NEON vmul.f32 (32bit x4) ns4 : 0.603 16058.0 | ||
+ | NEON vadd.f32 (32bit x4) ns4 : 0.602 16066.4 | ||
+ | NEON vmla.f32 (32bit x4) ns4 : 1.054 18359.9 | ||
+ | NEON vfma.f32 (32bit x4) ns4 : - - - | ||
+ | VFP fmuls (32bit x1) n1 : | ||
+ | VFP fadds (32bit x1) n1 : | ||
+ | VFP fmacs (32bit x1) n1 : | ||
+ | VFP vfma.f32 (32bit x1) n1 : - - - | ||
+ | NEON vmul.f32 (32bit x2) n1 : | ||
+ | NEON vadd.f32 (32bit x2) n1 : | ||
+ | NEON vmla.f32 (32bit x2) n1 : | ||
+ | NEON vfma.f32 (32bit x2) n1 : | ||
+ | NEON vmul.f32 (32bit x4) n1 : | ||
+ | NEON vadd.f32 (32bit x4) n1 : | ||
+ | NEON vmla.f32 (32bit x4) n1 : | ||
+ | NEON vfma.f32 (32bit x4) n1 : | ||
+ | NEON vmul.f32 (32bit x4) n12 : 0.904 16062.1 | ||
+ | NEON vadd.f32 (32bit x4) n12 : 0.904 16063.5 | ||
+ | NEON vmla.f32 (32bit x4) n12 : 0.904 32117.4 | ||
+ | NEON vfma.f32 (32bit x4) n12 : - - - | ||
+ | Average | ||
+ | Highest | ||
- | processor : 4 | ||
- | model name : ARMv8 Processor rev 2 (v8l) | ||
- | BogoMIPS : 48.00 | ||
- | Features : half thumb fastmult vfp edsp neon vfpv3 tls vfpv4 idiva idivt lpae evtstrm aes pmull sha1 sha2 crc32 | ||
- | CPU implementer : | ||
- | CPU architecture: | ||
- | CPU variant : 0x0 | ||
- | CPU part : 0xd08 | ||
- | CPU revision : 2 | ||
- | processor : 5 | + | * Group 1: |
- | model name : ARMv8 Processor rev 2 (v8l) | + | * VFP/NEON (DP fp) multi-thread |
- | BogoMIPS : 48.00 | + | TIME(s) |
- | Features : half thumb fastmult vfp edsp neon vfpv3 tls vfpv4 idiva idivt lpae evtstrm aes pmull sha1 sha2 crc32 | + | VFP fmuld (64bit x1) n8 : |
- | CPU implementer : 0x41 | + | VFP faddd (64bit x1) n8 : 0.301 |
- | CPU architecture: 8 | + | VFP fmacd (64bit x1) n8 : |
- | CPU variant : 0x0 | + | VFP vfma.f64 (64bit x1) n8 |
- | CPU part : 0xd08 | + | VFP fmuld (64bit x1) ns4 |
- | CPU revision : 2 | + | VFP faddd (64bit x1) ns4 |
- | + | VFP fmacd (64bit x1) ns4 | |
- | + | VFP vfma.f64 (64bit x1) ns4 : | |
- | ARMv8 Processor rev 4 (v8l) | + | VFP fmuld (64bit x1) n1 : |
+ | VFP faddd (64bit x1) n1 : | ||
+ | VFP fmacd (64bit x1) n1 : | ||
+ | VFP vfma.f64 (64bit x1) n1 : - - - | ||
+ | Average | ||
+ | Highest | ||
- | 2019/01/05 16: | ||
</ | </ | ||
opengl/vfpbenchlog.txt · 最終更新: 2020/12/30 23:46 by oga