ユーザ用ツール

サイト用ツール


opengl:vfpbenchlog

文書の過去の版を表示しています。


VFP Benchmark Log 計測結果まとめ

結果一覧

Device OS SoC CPU FPU clock Single-SPSingle-DP Multi-SP Multi-DP
PC AMD Ryzen 9 3950X Win10 AMD Ryzen 9 3950X Zen2 x64 SSE4.2/AVX2/FMA3 16 3.5GHz 132.173 66.092 1904.671 949.919
PC Intel Core i7-6700K Win10 Intel Core i7-6700K Skylake x64 SSE4.2/AVX2/FMA3 4 4.0GHz 135.577 67.698 542.267 271.127
PC Intel Core i7-4790K Ubuntu Intel Core i7-4790K Haswell x64 SSE4.2/AVX2/FMA3 4 4.0GHz 140.339 46.722 537.865 268.264
PC AMD Ryzen 7 1800X Win10 AMD Ryzen 7 1800X Zen x64 SSE4.2/AVX2/FMA3 8 3.6GHz 62.467 30.860 474.832 237.482
Apple Mac mini Late 2012 OSX.10 Intel Core i7-3615QM Ivy Birdge x64 SSE4.2/AVX 4 2.3GHz 51.427 25.693 194.698 96.913
Apple MacBook Pro Late 2011 OSX.10 Intel Core i7-2720QM Sandy Bridge x64 SSE4.2/AVX 4 2.2GHz 52.260 26.137 162.316 74.049
Pixel 3 A9.0 Snapdragon 845 Kryo 385(A75/55) ARMv8A AArch64 8 2.8GHz 35.994 17.990 139.338 69.582
Essential Phone PH-1 A9.0 Snapdragon 835 Kryo (A73/53) ARMv8A AArch64 8 2.45GHz 34.353 17.178 129.511 67.329
Amazon Fire HD 10 2019 A9.0 Mediatek MT8183 A73/A53 ARMv8A AArch64 8 2.0GHz 31.038 11.671 125.468 46.937
PC AMD A10-7870K Win10 AMD A10-7870K Steamroller x64 SSE4.2/AVX/FMA3 2 3.9GHz 64.743 32.400 124.500 62.247
Apple MacBook Pro Late 2013 OSX.10 Intel Core i5-3210M Ivy Birdge x64 SSE4.2/AVX 2 2.5GHz 48.604 24.317 90.247 45.223
iPhone SE iOS9.3 Apple A9 Twister ARMv8A AArch64 2 1.85GHz 41.857 14.545 81.071 28.333
NVIDIA SHIELD Tablet A4.4 NVIDIA Tegra K1 Cortex-A15 ARMv7A VFPv4 NEON 4 2.2GHz 17.136 3.431 70.174 14.036
Apple iPad A8X i8.0 Apple A8X Typhoon ARMv8A AArch64 3 1.5GHz 23.568 11.751 68.591 33.968
NVIDIA SHIELD Android TV A5.1 NVIDIA Tegra X1 Cortex-A57 ARMv8A AArch64 4 2.1GHz 17.041 8.554 67.588 33.730
Amazon Fire HDX 7 2013 A4.4 Qualcomm 800 MSM8974 Krait 400 ARMv7A VFPv4 NEON 4 2.2GHz 17.128 4.289 67.539 16.874
Motorola Nexus 6 A5.0 Qualcomm 805 APQ8084 Krait 450 ARMv7A VFPv4 NEON 4 2.7GHz 15.575 4.547 64.316 20.393
PC AMD Athlon 5350 Kabini Ubuntu AMD Athlon 5350 Jaguar x64 SSE4.2/AVX 4 2.0GHz 15.943 6.127 63.737 24.504
PC Intel J1900 BayTrail-D Ubuntu Intel Celeron J1900 Silvermont x64 SSE4.2 4 2.0GHz 14.477 3.619 57.902 14.471
NVIDIA Tegra Note 7 A4.4 NVIDIA Tegra 4 Cortex-A15 ARMv7A VFPv4 NEON 4 1.8GHz 13.371 2.655 51.345 9.860
PC Intel N3150 Braswell Ubuntu Intel Celeron N3150 Airmont x64 SSE4.2 4 1.6GHz 12.468 3.117 49.679 12.469
Raspberry Pi 4 Ubuntu Broadcomm BCM2711 Cortex-A72 ARMv8A AArch64 4 1.5GHz 11.973 5.987 47.925 23.962
ASUS Nexus 7 2013 A4.4 Qualcomm S4 APQ8064 Krait ARMv7A VFPv4 NEON 4 1.5GHz 11.947 3.005 47.808 11.751
HTC J butterfly HTL21 A4.1 Qualcomm S4 APQ8064 Krait ARMv7A VFPv4 NEON 4 1.5GHz 11.883 2.967 46.954 11.778
NVIDIA Jetson nano Ubuntu NVIDIA Tegra X1 Cortex-A57 ARMv8A AArch64 4 1.43GHz 11.404 5.702 45.454 22.727
Apple TV (2015) tv9.0 Apple A8 Typhoon ARMv8A AArch64 2 1.4GHz 22.197 11.105 44.331 22.084
Apple iPhone 5s i8.0 Apple A7 Cyclone ARMv8A AArch64 2 1.3GHz 20.621 10.313 40.871 20.480
Apple iPad mini 2 i8.0 Apple A7 Cyclone ARMv8A AArch64 2 1.3GHz 20.373 10.223 40.616 20.238
Dragonboard 410c Debian Snapdragon 410 MSM8916 Cortex-A53 ARMv8A AArch64 4 1.2GHz 9.498 4.749 37.965 18.603
Raspberry Pi 3 Debian Broadcomm BCM2837 Cortex-A53 ARMv8A VFPv4 NEON 4 1.2GHz 9.431 2.477 37.442 9.994
Apple iPod touch 6 i8.4 Apple A8 Typhoon ARMv8A AArch64 2 1.1GHz 17.964 8.899 35.530 17.775
ASUS MeMO Pad 7 ME176 A5.0 Intel Atom Z3745 Silvermont x86 SSE4.2 4 1.83GHz 8.946 2.797 35.473 11.060
HTC Nexus 9 A5.0 NVIDIA Tegra K1 Denver ARMv8A AArch64 2 2.5GHz 17.906 8.762 34.888 17.601
ASUS Nexus Player A5.0 Intel Atom Z3560 Silvermont x86 SSE4.2 4 1.8GHz 8.733 2.733 33.852 10.655
Amazon Fire TV 2015 A5.1 MediaTek MT8173C Cortex-A72 ARMv8A AArch64 2 2.0GHz 15.864 7.934 31.771 15.885
Apple Mac mini Early 2009 OSX.10 Intel Core 2 Duo P7350 Penryn x64 SSE4.1 2 2.0GHz 15.916 6.365 31.662 12.724
Dragonboard 410c A5.1 Snapdragon 410 MSM8916 Cortex-A53 ARMv8A AArch64 4 1.2GHz 9.377 4.737 30.817 15.063
Samsung Nexus 10 A4.4 Samsung Exynos 5250 Cortex-A15 ARMv7A VFPv4 NEON 2 1.7GHz 13.483 2.686 26.724 5.314
ASUS MeMO Pad 7 ME176 (BT) A5.0 Intel Atom Z3745 Silvermont ARMv7A VFPv3/NEON 4 1.83GHz 6.144 1.476 24.329 5.905
Apple iPad 4 i8.0 Apple A6X Swift ARMv7A VFPv4 NEON 2 1.4GHz 10.855 1.818 21.502 3.573
Apple iPhone 5 i9.0 Apple A6 Swift ARMv7A VFPv4 NEON 2 1.3GHz 10.094 1.710 20.029 3.398
ASUS Nexus 7 2012 A4.4 NVIDIA Tegra 3 Cortex-A9 ARMv7A VFPv3 NEON 4 1.2GHz 4.783 1.196 18.905 4.724
ASUS Fonepad 7 ME372CL A4.4 Intel Atom Z2560 Saltwell x86 SSSE3 2 1.6GHz 7.540 1.523 18.630 3.504
Acer Chromebook c720 Ubuntu Intel Celeron 2955U Haswell x64 SSE4.2 2 1.4GHz 8.898 4.448 17.339 8.784
HTC EVO 3d A4.0 Qualcomm S3 MSM8660 Scorpion ARMv7A VFPv3 NEON 2 1.2GHz 8.898 1.112 16.560 1.266
Sony VAIO Type P Ubuntu Intel Atom Z540 Bonnell x86 SSSE3 1 1.86GHz 8.918 1.810 10.927 1.852
Lenovo Yoga Tablet 8 A4.2 MediaTek MT8125 Cortex-A7 ARMv7A VFPv4 NEON 4 1.2GHz 2.374 1.165 9.474 4.653
SHARP Mebius Note PCPJ1 Ubuntu Intel Atom N270 Bonnell x86 SSSE3 1 1.6GHz 5.597 1.548 9.277 1.570
NEC Medias N-06C A2.3 Qualcomm S2 MSM8255 Scorpion ARMv7A VFPv3 NEON 1 1.0GHz 7.786 0.977 7.835 0.981
Apple iPad 2 i8.0 Apple A5 Cortex-A9 ARMv7A VFPv3 NEON 2 1.0GHz 3.960 0.989 7.830 1.961
Apple iPad mini i8.0 Apple A5 Cortex-A9 ARMv7A VFPv3 NEON 2 1.0GHz 3.846 0.983 7.800 1.941
Fire TV Stick 2015 A5.1 Broadcom 28155 Cortex-A9 ARMv7A VFPv3 NEON 2 1.0GHz 3.968 0.992 7.761 1.946
Apple iPad 3 i8.0 Apple A5X Cortex-A9 ARMv7A VFPv3 NEON 2 1.0GHz 3.394 0.983 7.752 1.954
Sony Xperia IS11S A2.3 Qualcomm S2 MSM8255 Scorpion ARMv7A VFPv3 NEON 1 1.0GHz 7.681 0.960 7.623 0.960
Raspberry Pi 2 Debian Broadcomm BCM2836 Cortex-A7 ARMv7A VFPv4 NEON 4 0.9GHz 1.791 0.877 7.087 3.472
HTC Desire A2.2 Qualcomm S1 QSD8250 Scorpion ARMv7A VFPv3 NEON 1 1.0GHz 7.098 0.886 7.058 0.886
Apple iPod touch 5 i8.0 Apple A5 Cortex-A9 ARMv7A VFPv3 NEON 2 0.8GHz 3.161 0.790 6.203 1.565
Sony SmartWatch 3 SWR50 A4.4W Qualcomm 400 MSM8226 Cortex-A7 ARMv7A VFPv4 NEON 4 1.2GHz 2.257 1.144 4.946 2.278
NEC LifeTcouhNote A2.3 NVIDIA Tegra 2 Cortex-A9 ARMv7A VFPv3 2 1.0GHz 1.993 0.999 3.908 1.962
LG OptimusPad L-06C A3.1 NVIDIA Tegra 2 Cortex-A9 ARMv7A VFPv3 2 1.0GHz 1.983 0.997 3.853 1.965
Motorola Moto 360 A5.0 TI OMAP3 Cortex-A8 ARMv7A VFPv3 NEON 1 1.0GHz 3.739 0.126 3.376 0.125
Apple iPod touch 4 i6.0 Apple A4 Cortex-A8 ARMv7A VFPv3 NEON 1 0.8GHz 3.139 0.112 3.139 0.112
Creative Ziio 7 A2.2 Creative ZMS-08 Cortex-A8 ARMv7A VFPv3 NEON 1 1.0GHz 2.781 0.100 2.792 0.099
Apple Watch S2 W3.1 Apple S2 Cortex-A7 ARMv7A VFPv4 NEON 2 0.5GHz 0.986 0.483 1.807 0.879
LG G Watch A4.4W Qualcomm 400 MSM8226 Cortex-A7 ARMv7A VFPv4 NEON 4 1.2GHz 1.419 0.742 1.367 0.676
Apple Watch W2.0 Apple S1 Cortex-A7 ARMv7A VFPv4 NEON 1 0.5GHz 0.951 0.470 0.945 0.469
Raspberry Pi Debian Boradcom BCM2835 ARM1176JZF-S ARMv6 VFPv2 1 0.7GHz 0.674 0.674 0.674 0.674
SmartQ ZWatch A4.4 Ingenic JZ4775 XBurst MIPS32 FPU 1 1.0GHz 0.117 0.116 0.117 0.117

Mobile CPU 32bit

ARM ARM11 (ARMv6) VFPv2

Raspberry Pi ARM1176JZF-S 700MHz

IDEOS MSM7225 ARM11 528MHz

ARM Cortex-A8 (ARMv7A) VFPv3+NEON

Creative ZiiO7 MZS-08 Cortex-A8 1.0GHz single core

iPod touch 4 Apple A4 Cortex-A8 0.8GHz

Qualcomm Scorpion (ARMv7A) VFPv3+NEON

HTC Desire Snapdragon QSD8250 Scorpion 1.0GHz single core

Sony Xperia IS11S Snapdragon MSM8655 Scorpion 1.0GHz single core

NEC Medias N-06C Snapdragon MSM8255 Scorpion 1.0GHz single core

HTC EVO 3D Snapdragon MSM8660 Scorpion 1.2GHz dual core

ARM Cortex-A7 (ARMv7A) VFPv4+NEON

Yoga Tablet 8 MT8125 Cortex-A7 1.2GHz Quad core

Raspberry Pi 2 BCM2836 Cortex-A7 0.9GHz quad core

ARM Cortex-A9 (ARMv7A) VFPv3 (+NEON)

OptimusPad L-06C Tegra2 Cortex-A9 1.0GHz dual core VFPv3-D16

NEC LifeTouchNote Tegra2 Cortex-A9 1.0GHz dual core VFPv3-D16

iPad 2 Apple A5 Cortex-A9 1.0GHz dual core

iPad mini Apple A5 Cortex-A9 1.0GHz dual core

iPad 3 Apple A5X Cortex-A9 1.0GHz dual core

iPod touch 5 Apple A5 Cortex-A9 0.8GHz dual core

Nexus 7 (2012) Tegra 3 1.2GHz Cortex-A9 Quad core

Amazon Fire TV Stick (2015) Boradcom 28155 Cortex-A9 1.0GHz Dual core

ARM Cortex-A15 (ARMv7A) VFPv4+NEON

Nexus 10 Exynos 5 Dual (5250) Cortex-A15 1.7GHz dual core

Tegra Note 7 Tegra4 Cortex-A15 1.8GHz Quad core

NVIDIA SHIELD Tablet Tegra K1 Cortex-A15 2.2GHz Quad core

ARM Cortex-A53 (AArch32) VFPv4+NEON

Dragonboard 410c Snapdragon 410 Cortex-A53 1.2GHz quad core

Raspberry Pi 3 BCM2837 Cortex-A53 1.2GHz debian 8.0

ARM Cortex-A72 (AArch32) VFPv4+NEON

Amazon Fire TV MT8173C Cortex-A72 2.0GHz dual core

Qualcomm Krait (ARMv7A) VFPv4+NEON

HTC J butterfly HTL21 Snapdragon S4 Pro APQ8064 Krait 1.5GHz Quad core

Nexus 7 (2013) Snapdragon S4 Pro APQ8064 Krait 1.5GHz Quad core

Kindle Fire HDX7 Snapdragon 800 MSM8974 Krait 400 2.2GHz Quad core

Nexus 6 Snapdragon 805 APQ8084 Krait 2.7GHz Quad core

Apple Swift (ARMv7A) VFPv4+NEON

iPad 4 Apple A6 Swift 1.4GHz dual core

iPhone 5 Apple A6 Swift 1.3GHz dual core

Apple Typhoon (AArch32) VFPv4+NEON

iPod touch 6 Apple A8 Typhoon 1.1GHz dual core

Intel Atom Saltwell (IA32 x86) SSSE3

ASUS Fonepad 7 LTE ME372CL Z2560 Dual core 1.6GHz (Clover Trail+ / Saltwell)

Intel Atom Silvermont (IA32 x86) SSE4.2

ASUS MeMO Pad 7 ME176 BayTrail-T Atom Z3745 Silvermont 1.83GHz Quad core (x86)

ASUS MeMO Pad 7 ME176 BayTrail-T Atom Z3745 Silvermont 1.83GHz Quad core (ARMv7A Binary Translator)

Nexus Player BayTrail-T Atom Z35xx Silvermont 1.8GHz Quad core (x86)


Smartwatch 32bit

ARM Cortex-A8 (ARMv7A) VFPv3+NEON

Motorola moto 360 Android Wear TI OMAP3 Cortex-A8 1.0GHz single core

ARM Cortex-A7 (ARMv7A) VFPv4+NEON

LG G Watch Android Wear Snapdragon 400 0.8GHz quad core (single core)

Sony SmartWatch 3 SWR50 Android Wear Snapdragon 400 1.2GHz quad core

Fossil Q-Marshal Snapdragon 400 quad core (dual core)

Intel Atom (IA32 x86) SSS4.2

Fossil Q Founder Android Wear Atom T1000 1.0GHz dual core

Ingenic JZ4775 XBurst (MIPS32-R2) FPU

SmartQ ZWatch JZ4775 XBurst 1.0GHz (MIPS32-R2)

Apple S1 Apple Watch (ARMv7A)

Apple S1 Apple Watch (ARMv7A)

Apple S1 Apple Watch (ARMv7A) watchOS 3.1

Apple S2 Apple Watch (ARMv7A)

Apple S2 Apple Watch (ARMv7A)


Mobile CPU 64bit

Apple Cyclone (ARMv8A AArch64 arm64) FPU+NEON

iPhone 5s Apple A7 Cyclone 1.3GHz Dual core ARM64 (AArch64)

iPad mini retina (mini2) Apple A7 Cyclone 1.3GHz dual core

Apple Typhoon (ARMv8A AArch64 arm64) FPU+NEON

iPad Air 2 Apple A8X Typhoon 1.5GHz Triple core ARM64 (AArch64)

iPod touch 6 A8 Typhoon 1.1GHz Dual core ARM64 (AArch64)

Apple TV A8 Typhoon 1.4GHz Dual core ARM64 (AArch64)

Apple Twister (ARMv8A AArch64 arm64) FPU+NEON

iPhone SE A9 Twister 1.85 Dual core ARM64 (AArch64)

NVIDIA Denver (ARMv8A AArch64 arm64) FPU+NEON

Nexus 9 Tegra K1 64 Denver 2.3GHz Dual core ARM64 (AArch64)

Qualcomm Kryo (ARMv8A AArch64 arm64) FPU+NEON

ZenFone AR Snapdragon 821 big core Kryo 2.3G4Hz x2 ARM64 (AArch64) Android 7.0

ZenFone AR Snapdragon 821 little core Kryo 2.18GHz x2 ARM64 (AArch64) Android 7.0

Qualcomm Kryo 280 (Cortex-A73 + A53) (ARMv8A AArch64 arm64) FPU+ASIMD

Essential Phone PH-1 Snapdragon 835 Kryo 280 2.45GHz x4 + 1.9GHz x4 ARM64 (AArch64) Android 9.0

Qualcomm Kryo 385 (Cortex-A75 + A55) (ARMv8.2A AArch64 arm64) FPU+ASIMD+HALFFP

Pixel 3 Snapdragon 845 Kryo 385 2.8GHz x4 + 1.77GHz x4 ARM64 (AArch64) Android 9.0

ARM Cortex-A53 (ARMv8A AArch64 arm64) FPU+NEON

Dragonboard 410c Snapdragon 410 1.2GHz ARM64 (AArch64) android 5.1

Dragonboard 410c Snapdragon 410 1.2GHz ARM64 (AArch64) debian 8.0

Nexus 5X Snapdragon 808 MSM8992 little core A53 1.44GHz x4 ARM64 (AArch64) android 8.1

Galxy S6 Edge Exynos 7420 little core 1.5GHz x4 ARM64 (AArch64) android 7.0

Chromebook Flip C101PA RK3399 little core 1.5GHz x4 ARM64 (AArch64) android 7.1

ARM Cortex-A57 (ARMv8A AArch64 arm64) FPU+NEON

SHIELD Android TV Tegra X1 2.1GHz ARM64 (AArch64) android 5.1

Nexus 5X Snapdragon 808 MSM8992 big core A57 1.82GHz x2 ARM64 (AArch64) android 8.1

Galxy S6 Edge Exynos 7420 big core 2.1GHz x4 ARM64 (AArch64) android 7.0

ARM Cortex-A72 (ARMv8A AArch64 arm64) FPU+NEON

Amazon Fire TV 2015 MT8173C Cortex-A72 2.0GHz (big.LITTLE 2+2) ARM64 (AArch64)

Chromebook Flip C101PA RK3399 big core 2.0GHz x2 ARM64 (AArch64) android 7.1


Desktop CPU

Intel Atom Bonnell (IA32 x86) SSSE3

Atom N270 Single core 1.6GHz (Diamondville / Bonnell)

Atom Z540 Single core 1.86GHz (Menlow / Bonnell)

Intel Atom Silvermont (AMD64 x86_64 x64) SSE4.2

Atom J1900 Quad core 2.0GHz (BayTrail-D / Silvermont)

AMD Jaguar (AMD64 x86_64 x64) SSE4.2/AVX1

Athlon 5350 Quad core 2.0GHz (Kabini / Jaguar)

Intel Atom Airmont (AMD64 x86_64 x64) SSE4.2

Atom N3150 Quad core 1.6GHz (Braswell/Airmont)

Intel Core 2 Duo (AMD64 x86_64 x64) SSE4.1

Core 2 Duo P7350 2.0GHz

Intel Sandy Bridge (AMD64 x86_64 x64) SSE4.2/AVX1

Sandy Bridge Core i7-2720QM 2.2GHz

Intel Ivy Bridge (AMD64 x86_64 x64) SSE4.2/AVX1

Ivy Bridge Core i5-3210M 2.5GHz

Ivy Bridge Core i7-3615QM 2.3GHz

Intel Haswell (AMD64 x86_64 x64) SSE4.2/AVX2/FMA3

Haswell Core i7-4790K 4.0GHz Linux

Haswell Celeron 2955U 1.4GHz (SSE4.2)

Haswell Core i7-4790K 4.0GHz (4.4GHz) 4 core 8 thread Windows 10

Intel Skylake (AMD64 x86_64 x64) SSE4.2/AVX2/FMA3

Skylake Core i7-6700K 4.0GHz (4.2GHz) 4 core 8 thread Windows 10

AMD Ryzen 7 1800X (AMD64 x86_64 x64) SSE4.2/AVX2/FMA3

Ryzen 7 1800X 3.6GHz (4.0GHz) 8 core 16 thread Windows 10

Intel Core i5-1030NG7 (AMD64 x86_64 x64) SSE4.2/AVX2/FMA3/AVX512F,CD,VL,BW,DQ,VNNI

++++Intel Core i5-1030NG7 1.1GHz (3.5GHz) 4 core 8 thread Windows 10|

Date: 20200810 185418
ARCH: x64 (x86_64)
FPU : SSE SSE2 SSSE3 SSE4.1 SSE4.2 AVX AVX2 FMA3 F16C AVX512F/BW/DQ/VL/VNNI
Name: 
CPU Thread:  8
CPU Core  :  4
CPU Group :  1
  Group 0: Thread= 8  Clock=1.100000 GHz  (mask:0)
SSE   : yes
AVX   : yes
FMA   : yes
F16C  : yes
AVX512: yes

Total:
SingleThread HP max: -
SingleThread SP max:  111.310 GFLOPS
SingleThread DP max:   55.593 GFLOPS
MultiThread  HP max: -
MultiThread  SP max:  413.685 GFLOPS
MultiThread  DP max:  204.351 GFLOPS

Group 0:  Thread=8  Clock=1.100000 GHz  (mask:0)
  SingleThread HP max: -
  SingleThread SP max:  111.310 GFLOPS
  SingleThread DP max:   55.593 GFLOPS
  MultiThread  HP max: -
  MultiThread  SP max:  413.685 GFLOPS
  MultiThread  DP max:  204.351 GFLOPS


* Group 0:  Thread=1  Clock=1.100000 GHz  (mask:0)
* SSE/AVX (SP fp)
                                      TIME(s)   MFLOPS      MOPS     FOP   IPC
SSE mulss (32bit x1) n8           :    0.104     6337.3     6337.3  (  1.0 5.8)
SSE addss (32bit x1) n8           :    0.101     6505.5     6505.5  (  1.0 5.9)
FMA vfmaddss (32bit x1) n8        :    0.101    13027.6     6513.8  (  2.0 5.9)
FMA vfmaddss (32bit x1) n12       :    0.143    13885.0     6942.5  (  2.0 6.3)
FMA vfma+mlss (32bit x1) n12      :    0.143    10399.8     6933.2  (  1.5 6.3)
FMA vfma+adss (32bit x1) n12      :    0.142    10437.5     6958.3  (  1.5 6.3)
SSE mulps (32bit x4) n8           :    0.101    26090.0     6522.5  (  4.0 5.9)
SSE addps (32bit x4) n8           :    0.101    26027.5     6506.9  (  4.0 5.9)
SSE mul+addps (32bit x4) n8       :    0.102    25995.0     6498.7  (  4.0 5.9)
FMA vfmaddps (32bit x4) n8        :    0.102    51919.4     6489.9  (  8.0 5.9)
FMA vfmaddps (32bit x4) n12       :    0.143    55295.3     6911.9  (  8.0 6.3)
FMA vfma+mlps (32bit x4) n12      :    0.142    41781.8     6963.6  (  6.0 6.3)
FMA vfma+adps (32bit x4) n12      :    0.143    41652.9     6942.2  (  6.0 6.3)
SSE ml+ad+adps (32bit x4) n9      :    0.108    27519.6     6879.9  (  4.0 6.3)
SSE mulss (32bit x1) ns4          :    0.190     3467.6     3467.6  (  1.0 3.2)
SSE addss (32bit x1) ns4          :    0.190     3466.8     3466.8  (  1.0 3.2)
SSE mulps (32bit x4) ns4          :    0.190    13906.4     3476.6  (  4.0 3.2)
SSE addps (32bit x4) ns4          :    0.190    13867.9     3467.0  (  4.0 3.2)
AVX vmulps (32bit x8) n8          :    0.095    55597.1     6949.6  (  8.0 6.3)
AVX vaddps (32bit x8) n8          :    0.095    55388.9     6923.6  (  8.0 6.3)
AVX vmul+addps (32bit x8) n8      :    0.095    55612.9     6951.6  (  8.0 6.3)
FMA vfmaddps (32bit x8) n8        :    0.122    86880.7     5430.0  ( 16.0 4.9)
FMA vfmaddps (32bit x8) n12       :    0.143   110981.1     6936.3  ( 16.0 6.3)
FMA vfma+mlps (32bit x8) n12      :    0.142    83413.5     6951.1  ( 12.0 6.3)
FMA vfma+adps (32bit x8) n12      :    0.144    82441.6     6870.1  ( 12.0 6.2)
AVX vml+ad+adps (32bit x8) n9     :    0.202    29339.7     3667.5  (  8.0 3.3)
AVX512 vmulps (32bit x16) n12     :    0.295    53719.7     3357.5  ( 16.0 3.1)
AVX512 vaddps (32bit x16) n12     :    0.293    54028.2     3376.8  ( 16.0 3.1)
AVX512 vfmaddps (32bit x16) n12   :    0.293   108003.4     3375.1  ( 32.0 3.1)
AVX512 vfma+mps (32bit x16) n12   :    0.293    81034.9     3376.5  ( 24.0 3.1)
AVX512 vfma+aps (32bit x16) n12   :    0.293    81173.3     3382.2  ( 24.0 3.1)
AVX512 vmulps (32bit x8) n12      :    0.144    55154.4     6894.3  (  8.0 6.3)
AVX512 vaddps (32bit x8) n12      :    0.142    55624.6     6953.1  (  8.0 6.3)
AVX512 vfmaddps (32bit x8) n12    :    0.142   111310.2     6956.9  ( 16.0 6.3)
Average                           :    0.158    45626.1     5768.7  (  8.7 5.2)
Highest                           :    0.095   111310.2     6963.6  ( 32.0 6.3)


* Group 0:  Thread=1  Clock=1.100000 GHz  (mask:0)
* SSE/AVX (DP fp)
                                      TIME(s)   MFLOPS      MOPS     FOP   IPC
SSE2 mulsd (64bit x1) n8          :    0.143     4607.0     4607.0  (  1.0 4.2)
SSE2 addsd (64bit x1) n8          :    0.102     6494.8     6494.8  (  1.0 5.9)
FMA  vfmaddsd (64bit x1) n8       :    0.102    12997.0     6498.5  (  2.0 5.9)
FMA  vfmaddsd (64bit x1) n12      :    0.142    13910.1     6955.0  (  2.0 6.3)
FMA  vfma+mlsd (64bit x1) n12     :    0.143    10395.1     6930.1  (  1.5 6.3)
FMA  vfma+adsd (64bit x1) n12     :    0.143    10382.1     6921.4  (  1.5 6.3)
SSE2 mulpd (64bit x2) n8          :    0.102    12983.3     6491.7  (  2.0 5.9)
SSE2 addpd (64bit x2) n8          :    0.102    12988.4     6494.2  (  2.0 5.9)
SSE2 mul+addpd (64bit x2) n8      :    0.101    13026.5     6513.2  (  2.0 5.9)
FMA  vfmaddpd (64bit x2) n8       :    0.103    25747.8     6437.0  (  4.0 5.9)
FMA  vfmaddpd (64bit x2) n12      :    0.143    27767.1     6941.8  (  4.0 6.3)
FMA  vfma+mlpd (64bit x2) n12     :    0.142    20892.0     6964.0  (  3.0 6.3)
FMA  vfma+adpd (64bit x2) n12     :    0.144    20638.5     6879.5  (  3.0 6.3)
SSE2 ml+ad+dpd (64bit x2) n9      :    0.108    13686.9     6843.4  (  2.0 6.2)
SSE2 mulsd (64bit x1) ns4         :    0.190     3475.3     3475.3  (  1.0 3.2)
SSE2 addsd (64bit x1) ns4         :    0.191     3463.5     3463.5  (  1.0 3.1)
SSE2 mulpd (64bit x2) ns4         :    0.191     6928.2     3464.1  (  2.0 3.1)
SSE2 addpd (64bit x2) ns4         :    0.190     6957.4     3478.7  (  2.0 3.2)
AVX vmulpd (64bit x4) n8          :    0.096    27464.0     6866.0  (  4.0 6.2)
AVX vaddpd (64bit x4) n8          :    0.095    27868.4     6967.1  (  4.0 6.3)
AVX vmul+addpd (64bit x4) n8      :    0.095    27776.9     6944.2  (  4.0 6.3)
FMA vfmaddpd (64bit x4) n8        :    0.101    52105.9     6513.2  (  8.0 5.9)
FMA vfmaddpd (64bit x4) n12       :    0.143    55476.2     6934.5  (  8.0 6.3)
FMA vfma+mlpd (64bit x4) n12      :    0.143    41631.3     6938.6  (  6.0 6.3)
FMA vfma+adpd (64bit x4) n12      :    0.142    41748.7     6958.1  (  6.0 6.3)
AVX vml_ad_adpd (64bit x4) n9     :    0.107    27790.8     6947.7  (  4.0 6.3)
AVX512 vmulpd (64bit x8) n12      :    0.294    26935.4     3366.9  (  8.0 3.1)
AVX512 vaddpd (64bit x8) n12      :    0.294    26918.9     3364.9  (  8.0 3.1)
AVX512 vfmaddpd (64bit x8) n12    :    0.294    53835.4     3364.7  ( 16.0 3.1)
AVX512 vfma+mpd (64bit x8) n12    :    0.293    40495.9     3374.7  ( 12.0 3.1)
AVX512 vfma+apd (64bit x8) n12    :    0.293    40512.9     3376.1  ( 12.0 3.1)
Average                           :    0.157    23158.1     5734.5  (  4.4 5.2)
Highest                           :    0.095    55476.2     6967.1  ( 16.0 6.3)


* Group 0:  Thread=8  Clock=1.100000 GHz  (mask:0)
* SSE/AVX (SP fp) multi-thread
                                      TIME(s)   MFLOPS      MOPS     FOP   IPC
SSE mulss (32bit x1) n8           :    0.244    21628.1     2703.5  (  8.0 2.5)
SSE addss (32bit x1) n8           :    0.207    25501.0     3187.6  (  8.0 2.9)
FMA vfmaddss (32bit x1) n8        :    0.207    51050.5     3190.7  ( 16.0 2.9)
FMA vfmaddss (32bit x1) n12       :    0.310    51031.1     3189.4  ( 16.0 2.9)
FMA vfma+mlss (32bit x1) n12      :    0.310    38279.6     4785.0  (  8.0 4.3)
FMA vfma+adss (32bit x1) n12      :    0.310    38294.5     4786.8  (  8.0 4.4)
SSE mulps (32bit x4) n8           :    0.207   102060.0     3189.4  ( 32.0 2.9)
SSE addps (32bit x4) n8           :    0.207   101944.3     3185.8  ( 32.0 2.9)
SSE mul+addps (32bit x4) n8       :    0.207   101863.1     3183.2  ( 32.0 2.9)
FMA vfmaddps (32bit x4) n8        :    0.207   204040.2     3188.1  ( 64.0 2.9)
FMA vfmaddps (32bit x4) n12       :    0.310   204328.4     3192.6  ( 64.0 2.9)
FMA vfma+mlps (32bit x4) n12      :    0.310   153210.2     3191.9  ( 48.0 2.9)
FMA vfma+adps (32bit x4) n12      :    0.310   153202.8     3191.7  ( 48.0 2.9)
SSE ml+ad+adps (32bit x4) n9      :    0.233   102156.6     3192.4  ( 32.0 2.9)
SSE mulss (32bit x1) ns4          :    0.231    22819.0     2852.4  (  8.0 2.6)
SSE addss (32bit x1) ns4          :    0.232    22796.0     2849.5  (  8.0 2.6)
SSE mulps (32bit x4) ns4          :    0.232    90991.3     2843.5  ( 32.0 2.6)
SSE addps (32bit x4) ns4          :    0.232    91226.8     2850.8  ( 32.0 2.6)
AVX vmulps (32bit x8) n8          :    0.207   204198.0     3190.6  ( 64.0 2.9)
AVX vaddps (32bit x8) n8          :    0.207   204240.5     3191.3  ( 64.0 2.9)
AVX vmul+addps (32bit x8) n8      :    0.207   204291.9     3192.1  ( 64.0 2.9)
FMA vfmaddps (32bit x8) n8        :    0.207   407368.1     3182.6  (128.0 2.9)
FMA vfmaddps (32bit x8) n12       :    0.311   407750.8     3185.6  (128.0 2.9)
FMA vfma+mlps (32bit x8) n12      :    0.311   305974.6     3187.2  ( 96.0 2.9)
FMA vfma+adps (32bit x8) n12      :    0.310   306219.1     3189.8  ( 96.0 2.9)
AVX vml+ad+adps (32bit x8) n9     :    0.262   181174.2     2830.8  ( 64.0 2.6)
AVX512 vmulps (32bit x16) n12     :    0.680   186291.0     1455.4  (128.0 1.3)
AVX512 vaddps (32bit x16) n12     :    0.682   185795.5     1451.5  (128.0 1.3)
AVX512 vfmaddps (32bit x16) n12   :    0.682   371673.9     1451.9  (256.0 1.3)
AVX512 vfma+mps (32bit x16) n12   :    0.683   278186.7     1448.9  (192.0 1.3)
AVX512 vfma+aps (32bit x16) n12   :    0.683   278194.9     1448.9  (192.0 1.3)
AVX512 vmulps (32bit x8) n12      :    0.316   200275.6     3129.3  ( 64.0 2.8)
AVX512 vaddps (32bit x8) n12      :    0.310   204165.1     3190.1  ( 64.0 2.9)
AVX512 vfmaddps (32bit x8) n12    :    0.306   413685.0     3231.9  (128.0 2.9)
Average                           :    0.320   173997.3     2962.1  ( 69.2 2.7)
Highest                           :    0.207   413685.0     4786.8  (256.0 4.4)


* Group 0:  Thread=8  Clock=1.100000 GHz  (mask:0)
* SSE/AVX (DP fp) multi-thread
                                      TIME(s)   MFLOPS      MOPS     FOP   IPC
SSE2 mulsd (64bit x1) n8          :    0.244    21634.6     2704.3  (  8.0 2.5)
SSE2 addsd (64bit x1) n8          :    0.207    25508.6     3188.6  (  8.0 2.9)
FMA  vfmaddsd (64bit x1) n8       :    0.207    51001.7     3187.6  ( 16.0 2.9)
FMA  vfmaddsd (64bit x1) n12      :    0.311    50924.6     3182.8  ( 16.0 2.9)
FMA  vfma+mlsd (64bit x1) n12     :    0.310    38294.8     4786.8  (  8.0 4.4)
FMA  vfma+adsd (64bit x1) n12     :    0.310    38309.1     4788.6  (  8.0 4.4)
SSE2 mulpd (64bit x2) n8          :    0.207    51029.3     3189.3  ( 16.0 2.9)
SSE2 addpd (64bit x2) n8          :    0.207    51025.8     3189.1  ( 16.0 2.9)
SSE2 mul+addpd (64bit x2) n8      :    0.207    51019.7     3188.7  ( 16.0 2.9)
FMA  vfmaddpd (64bit x2) n8       :    0.207   101970.4     3186.6  ( 32.0 2.9)
FMA  vfmaddpd (64bit x2) n12      :    0.311   101845.6     3182.7  ( 32.0 2.9)
FMA  vfma+mlpd (64bit x2) n12     :    0.311    76450.3     3185.4  ( 24.0 2.9)
FMA  vfma+adpd (64bit x2) n12     :    0.310    76611.1     3192.1  ( 24.0 2.9)
SSE2 ml+ad+dpd (64bit x2) n9      :    0.233    51085.6     3192.8  ( 16.0 2.9)
SSE2 mulsd (64bit x1) ns4         :    0.232    22757.1     2844.6  (  8.0 2.6)
SSE2 addsd (64bit x1) ns4         :    0.235    22456.0     2807.0  (  8.0 2.6)
SSE2 mulpd (64bit x2) ns4         :    0.232    45565.8     2847.9  ( 16.0 2.6)
SSE2 addpd (64bit x2) ns4         :    0.231    45703.8     2856.5  ( 16.0 2.6)
AVX vmulpd (64bit x4) n8          :    0.207   102169.6     3192.8  ( 32.0 2.9)
AVX vaddpd (64bit x4) n8          :    0.207   101864.1     3183.3  ( 32.0 2.9)
AVX vmul+addpd (64bit x4) n8      :    0.207   102073.9     3189.8  ( 32.0 2.9)
FMA vfmaddpd (64bit x4) n8        :    0.207   203845.3     3185.1  ( 64.0 2.9)
FMA vfmaddpd (64bit x4) n12       :    0.314   201503.0     3148.5  ( 64.0 2.9)
FMA vfma+mlpd (64bit x4) n12      :    0.314   151182.5     3149.6  ( 48.0 2.9)
FMA vfma+adpd (64bit x4) n12      :    0.318   149330.6     3111.1  ( 48.0 2.8)
AVX vml_ad_adpd (64bit x4) n9     :    0.246    96515.9     3016.1  ( 32.0 2.7)
AVX512 vmulpd (64bit x8) n12      :    0.682    92879.9     1451.2  ( 64.0 1.3)
AVX512 vaddpd (64bit x8) n12      :    0.682    92855.7     1450.9  ( 64.0 1.3)
AVX512 vfmaddpd (64bit x8) n12    :    0.682   185899.7     1452.3  (128.0 1.3)
AVX512 vfma+mpd (64bit x8) n12    :    0.682   139318.3     1451.2  ( 96.0 1.3)
AVX512 vfma+apd (64bit x8) n12    :    0.682   139338.1     1451.4  ( 96.0 1.3)
Average                           :    0.321    86515.2     2939.8  ( 35.1 2.7)
Highest                           :    0.207   203845.3     4788.6  (128.0 4.4)
opengl/vfpbenchlog.1597326592.txt.gz · 最終更新: 2020/08/13 22:49 by oga

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki