ユーザ用ツール

サイト用ツール


opengl:cpuflops

CPU FLOPS

CPU FLOPS

CPU FLOPS (32bit fp) Clock core op/clock op
PC Haswell Core i7-5960X 768.0 GFLOPS 3.0GHz 8 256 AVX: 2(mad) x8(simd) x2(unit) x8(core) x3.0(clock) = 768 GFLOPS
PC Haswell Core i7-5820K 633.6 GFLOPS 3.3GHz 6 192 AVX: 2(mad) x8(simd) x2(unit) x6(core) x3.3(clock) = 633.6 GFLOPS
PC Haswell Core i7-4790K 512.0 GFLOPS 4.0GHz 4 128 AVX: 2(mad) x8(simd) x2(unit) x4(core) x4.0(clock) = 512 GFLOSP
PC Haswell Core i7-4770K 448.0 GFLOPS 3.5GHz 4 128 AVX: 2(mad) x8(simd) x2(unit) x4(core) x3.5(clock) = 448 GFLOPS
PC Sandy Bridge Core i7-2770K 224.0 GFLOPS 3.5GHz 4 64 AVX: 8(simd) x2(unit) x4(core) x3.5(clock) = 224 GFLOPS
PS3 Cell 217.6 GFLOPS 3.2GHz 8 68 SPU: 2(mad) x4(simd) x7(core) x3.2(clock) = 179.2
VMX: 2(mad) x4(simd) x1(core) x3.2(clock) = 25.6
FPU: 2(mad) x2(unit) x1(core) x3.2(clock) = 12.8 = Total 217.6 GFLOPS
PC Sandy Bridge Core i7-2720QM 140.8 GFLOPS 2.2GHz 4 64 AVX: 8(simd) x2(unit) x4(core) x2.2(clock) = 140.8 GFLOPS
PC Ivy Bridge Core i7-3615QM 147.2 GFLOPS 2.3GHz 4 64 AVX: 8(simd) x2(unit) x4(core) x2.3(clock) = 147.2 GFLOPS
PC Kaveri A10-7850K 118.4 GFLOPS 3.7GHz 2 32 AVX: 2(mad) x8(simd) x2(module) x3.7(clock) = 118.4 GFLOPS
Xbox 360 Xenon 115.2 GFLOPS 3.2GHz 3 36 VMX: 2(mad) x4(simd) x3(core) x3.2(clock) = 76.8
FPU: 2(mad) x2(unit) x3(core) x3.2(clock) = 38.4 = Total 115.2 GFLOPS
Xbox One Jaguar 112.0 GFLOPS 1.75GHz 8 64 AVX: 4(simd) x2(unit) x8(core) x1.75(clock) = 112.0 GFLOPS
Apple A9 Twister 108.0 GFLOPS 2.25GHz 2 48 NEON: 2(mad) x4(simd) x3(unit) x2(core) x2.25(clock) = 108.0 GFLOPS
Apple A9 Twister 88.8 GFLOPS 1.85GHz 2 48 NEON: 2(mad) x4(simd) x3(unit) x2(core) x1.85(clock) = 88.8 GFLOPS
APQ8084 Krait 450 86.4 GFLOPS 2.7GHz 4 32 NEON: 2(mad) x4(simd) x4(core) x2.7(clock) = 86.4 GFLOPS
PC Nehalem Core i7-920 85.1 GFLOPS 2.66GHz 4 32 SSE: 4(simd) x2(unit) x4(core) x2.66(clock) = 85.12 GFLOPS
PC Ivy Bridge Core i5-3210M 80.0 GFLOPS 2.5GHz 2 32 AVX: 8(simd) x2(unit) x2(core) x2.5(clock) = 80.0 GFLOPS
MSM8994 Cortex-A15
Cortex-A7
76.0 GFLOPS 2.0GHz
1.5GHz
4+4 32+8 NEON: 2(mad) x2(simd) x2(unit) x4(core) x2.0(clock) = 64
NEON: 2(mad) x1(simd) x4(core) x1.5(clock) = 12 Total 76
PC Sandy Bridge Core i5-2415M 73.6 GFLOPS 2.3GHz 2 32 AVX: 8(simd) x2(unit) x2(core) x2.3(clock) = 73.6 GFLOPS
MSN8974 Krait 400 73.6 GFLOPS 2.3GHz 4 32 NEON: 2(mad) x4(simd) x4(core) x2.3(clock) = 73.6 GFLOPS
Apple A8X Typhoon 72.0 GFLOPS 1.5GHz 3 48 NEON: 2(mad) x4(simd) x2(unit) x3(core) x1.5(clock) = 72.0 GFLOPS
Exynos 5 Octa Cortex-A15
Cortex-A7
71.2 GFLOPS 1.9GHz
1.3GHz
4+4 32+8 NEON: 2(mad) x2(simd) x2(unit) x4(core) x1.9(clock) = 60.8
NEON: 2(mad) x1(simd) x4(core) x1.3(clock) = 10.4 Total 71.2
MSN8974 Krait 400 70.4 GFLOPS 2.2GHz 4 32 NEON: 2(mad) x4(simd) x4(core) x2.2(clock) = 70.4 GFLOPS
Tegra K1 Cortex-A15 70.4 GFLOPS 2.2GHz 4 32 NEON: 2(mad) x2(simd) x2(unit) x4(core) x2.2(clock) = 70.4 GFLOPS
PC Athlon 5350 Kabini Jaguar 64.0 GFLOPS 2.0GHz 4 32 AVX: 4(simd) x2(unit) x4(core) x2.0(clock) = 60.8 GFLOPS
Tegra 4 Cortex-A15 60.8 GFLOPS 1.9GHz 4 32 NEON: 2(mad) x2(simd) x2(unit) x4(core) x1.9(clock) = 60.8 GFLOPS
Tegra 4 Cortex-A15 57.6 GFLOPS 1.8GHz 4 32 NEON: 2(mad) x2(simd) x2(unit) x4(core) x1.8(clock) = 57.6 GFLOPS
Tegra K1 Denver 55.2 GFLOPS 2.3GHz 2 24 NEON: 2(mad) x4(simd) +4(add) x2(core) x2.3(clock) = 55.2 GFLOPS
PC Atom J1900 (Silvermont) 48.0-57.6 GFLOPS 2.0-2.41GHz 4 24 SSE: (4(simd add) + 2(simd mul)) x 4(core) x 2.0(clock) = 48.0GFLOPS (TB 57.6 at 2.41GHz)
APQ8064 Krait 48.0 GFLOPS 1.5GHz 4 32 NEON: 2(mad) x4(simd) x4(core) x1.5(clock) = 48.0 GFLOPS
Apple A7 Cyclone 44.8 GFLOPS 1.4GHz 2 32 NEON: 2(mad) x4(simd) x2(unit) x2(core) x1.3(clock) = 44.8 GFLOPS
Apple A7 Cyclone 41.6 GFLOPS 1.3GHz 2 32 NEON: 2(mad) x4(simd) x2(unit) x2(core) x1.3(clock) = 41.6 GFLOPS
PC Core2 Duo P8600 38.4 GFLOPS 2.4GHz 2 16 SSE: 4(simd) x2(unit) x2(core) x2(clock) = 32 GFLOPS
PC Core2 Duo P7350 32.0 GFLOPS 2.0GHz 2 16 SSE: 4(simd) x2(unit) x2(core) x2(clock) = 32 GFLOPS
MT8135 Cortex-A15
Cortex-A7
28.8 GFLOPS 1.5GHz
1.2GHz
2+2 16+4 NEON: 2(mad) x2(simd) x2(unit) x2(core) x1.5(clock) = 24
NEON: 2(mad) x1(simd) x2(core) x1.2(clock) = 4.8 Total 28.8
Tegra 3 Cortex-A9 27.2 GFLOPS 1.7GHz 4 16 NEON: 2(mad) x2(simd) x4(core) x1.7(clock) = 27.2 GFLOPS
Exynos 5D Cortex-A15 27.2 GFLOPS 1.7GHz 2 16 NEON: 2(mad) x2(simd) x2(unit) x2(core) x1.7(clock) = 27.2 GFLOPS
Apple A6X Swift 22.4 GFLOPS 1.4GHz 2 16 NEON: 2(mad) x4(simd) x2(core) x1.4(clock) = 22.4 GFLOPS
Tegra 3 Cortex-A9 19.2 GFLOPS 1.2GHz 4 16 NEON: 2(mad) x2(simd) x4(core) x1.2(clock) = 19.2 GFLOPS
MSN8260 Scorpion 19.2 GFLOPS 1.2GHz 2 16 NEON: 2(mad) x4(simd) x2(core) x1.2(clock) = 19.2 GFLOPS
Atom Z2560 Saltwell 19.2 GFLOPS 1.6GHz 2 12 SSE: (4(simd add) + 2(simd mul)) x2(core) x1.6(clock) = 19.2 GFLOPS
PC Atom Z540 Bonnell 11.2 GFLOPS 1.86GHz 1 6 SSE: (4(simd add) + 2(simd mul)) x1.86(clock) = 11.16 GFLOPS
OMAP 4430 Cortex-A9 9.6 GFLOPS 1.2GHz 2 8 NEON: 2(mad) x2(simd) x2(core) x1.2(clock) = 9.6 GFLOPS
PC Atom N270 9.6 GFLOPS 1.6GHz 1 6 SSE: (4(simd add) + 2(simd mul)) x1.6(clock) = 9.6 GFLOPS
MT8125 Cortex-A7 9.6 GFLOPS 1.2GHz 4 8 NEON: 2(mad) x1(simd) x4(core) x1.2(clock) = 9.6 GFLOPS
Apple A5 Cortex-A9 8.0 GFLOPS 1.0GHz 2 8 NEON: 2(mad) x2(simd) x2(core) x1.0(clock) = 8.0 GFLOPS
QSD8250 Scorpion 8.0 GFLOPS 1.0GHz 1 8 NEON: 2(mad) x4(simd) x1.0(clock) = 8.0 GFLOPS
Raspberry Pi 2 Cortex-A7 7.2 GFLOPS 0.9GHz 4 8 NEON: 2(mad) x1(simd) x4(core) x0.9(clock) = 7.2 GFLOPS
PS2 Emotion Engine 6.2 GFLOPS 0.299GHz 1 20?
Apple A4 Cortex-A8 4.0 GFLOPS 1.0GHz 1 4 NEON: 2(mad) x2(simd) x1.0(clock) = 4.0 GFLOPS
Tegra 2 Cortex-A9 4.0 GFLOPS 1.0GHz 2 4 VFP: 2(mad) x 2(core) x 1(clock) = 4.0 GFLOPS
PSP MIPS 4000 2.6 GFLOPS 0.333GHz 1 8 SIMD: 2(mad) x4(simd) x0.333(clock) = 2.664 GFLOPS
iPhone 3GS Cortex-A8 2.4 GFLOPS 0.6GHz 1 4 NEON: 2(mad) x2(simd) x0.6(clock) = 2.4 GFLOPS
Xbox Pentium 3 1.5 GFLOPS 0.733GHz 1 2 SSE: 2(simd) x0.733(clock) = 1.466 GFLOPS
Dreamcast SH4 1.4 GFLOPS 0.2GHz 1 7 SIMD: 7(dotproduct 4mul + 3add) x 0.2(clock) = 1.4 GFLOPS
Raspberry Pi ARM1176JZF-S 0.7 GFLOPS 0.7GHz 1 1 VFP: 2(mad) x 0.5(cycle) x 0.7(clock) = 0.7 GFLOPS
iPhone 3G ARM1176JZF-S 0.4 GFLOPS 0.412GHz 1 1 VFP: 2(mad) x 0.5(cycle) x 0.412(clock) = 0.412 GFLOPS

理論上のピーク値です。実際にこの数値が出るわけではありません。表には間違いが含まれている可能性があります。

測定できるものは下記のように実測にて mad(fma) の bit 幅と Unit 数を推測しています。

計測アプリを公開しました。

参考

FPU

opengl/cpuflops.txt · 最終更新: 2016/04/08 21:25 by oga