Hardware Museum

Over 20 years of PC history

Logo

The Ultimate GPU Benchmark (2009 - 2013)

Published: (last update )


The third part is ready. With new test platform and many interesting games. The range of video cards is quite wide this time, starting with Radeon HD 5000 / Geforce GTX 400 series and up to Radeon R9 290X and GeForce GTX 780 Ti.

Introduction

Here it is - the third part of the videocards benchmark project. The goal is to explore performance of the first and second generation of the DX11 GPUs. The multi-GPU technology in this era was greatly improved with frame packing - so of course there are several Crossfire and SLI setups.

The test system was upgraded to the X99 platform. Xeon E5 1650 v3 is equivalent of Core i7-5930k, including the unlocked multiplier. Performance should be slightly better than Sandy Bridge-E, despite the lower clock. Also Haswell-E doesn't suffer from any PCI-E 3.0 compatibility issues.





Test System

Test System - Hardware

Test System - OS and Drivers

Test System - Games




Radeon R9 290X


Tested Video Cards

Radeon HD 5770Radeon HD 5850Radeon HD 5850 OCRadeon HD 5870Radeon HD 5970 3 × Radeon HD 5870Radeon HD 6870Radeon HD 6970 1 GB OC
GPUJuniperCypressCypressCypress2 × Cypress3 × CypressBartsCayman
ArchitectureTerascale 2Terascale 2Terascale 2Terascale 2Terascale 2Terascale 2Terascale 2Terascale 3
Technology40 nm40 nm40 nm40 nm40 nm40 nm40 nm40 nm
Die Size170 mm2334 mm2334 mm2334 mm22 × 334 mm23 × 334 mm2255 mm2389 mm2
Transistor Count1040 mil.2154 mil.2154 mil.2154 mil.2 × 2154 mil.3 × 2154 mil.1700 mil.2640 mil.
Transistor Density6.12 mil. / mm26.45 mil. / mm26.45 mil. / mm26.45 mil. / mm26.45 mil. / mm26.45 mil. / mm26.66 mil. / mm26.79 mil. / mm2
GPU Clock860 MHz725 MHz980 MHz850 MHz725 MHz850 MHz920 MHz925 MHz
Shader Clock860 MHz725 MHz980 MHz850 MHz725 MHz850 MHz920 MHz925 MHz
ROPs163232322 × 323 × 323232
TMUs407272802 × 803 × 805696
Compute Units101818202 × 203 × 201424
Shaders800 Unified1440 Unified1440 Unified1600 Unified2 × 1600 Unified3 × 1600 Unified1120 Unified1536 Unified
L1 Cache10 × 8 kB18 × 8 kB18 × 8 kB20 × 8 kB2 × 20 × 8 kB3 × 20 × 8 kB14 × 8 kB24 × 8 kB
L2 Cache256 kB512 kB512 kB512 kB2 × 512 kB3 × 512 kB512 kB512 kB
Memory1024 MB GDDR51024 MB GDDR51024 MB GDDR51024 MB GDDR51024 MB GDDR51024 MB GDDR51024 MB GDDR51024 MB GDDR5
Memory Clock4800 MHz4000 MHz5000 MHz4800 MHz4000 MHz4800 MHz4200 MHz6000 MHz
Bus Width128 bit256 bit256 bit256 bit2 × 256 bit3 × 256 bit256 bit256 bit
Memory Bandwidth76.8 GB/s128 GB/s160 GB/s154 GB/s256 GB/s3 × 154 GB/s134 GB/s192 GB/s
Fillrate (Pixel)13.8 GP/s23.2 GP/s31.4 GP/s27.2 GP/s2 × 23.2 GP/s3 × 27.2 GP/s29.4 GP/s29.6 GP/s
Fillrate (Texel)34.4 GT/s52.2 GT/s70.6 GT/s68 GT/s2 × 58 GT/s3 × 68 GT/s51.5 GT/s88.8 GT/s
Compute Power (FP32)1376 GFLOPS2088 GFLOPS2822 GFLOPS2720 GFLOPS2 × 2320 GFLOPS3 × 2720 GFLOPS2061 GFLOPS2842 GFLOPS
Compute Power (FP64)-418 GFLOPS564 GFLOPS544 GFLOPS2 × 464 GFLOPS3 × 544 GFLOPS-711 GFLOPS
Bus TypePCI-E 2.0PCI-E 2.0PCI-E 2.0PCI-E 2.0PCI-E 2.0PCI-E 2.0PCI-E 2.0PCI-E 2.0
TDP108 W151 W~250 W188 W294 W ~500 W151 W250 W
DirectX1111111111111111
OpenGL4.54.54.54.54.54.54.54.5
Launch Year20092009200920092009200920102010


Radeon HD 6970 2 GB OCRadeon HD 6990Radeon HD 7850 1 GBRadeon R9 270X OCRadeon HD 7950 BoostRadeon HD 7970Radeon R9 280X2 × Radeon R9 280X
GPUCaymanAntilesPitcairnPitcairnTahitiTahitiTahiti2 × Tahiti
ArchitectureTerascale 3Terascale 3CGN 1CGN 1CGN 1CGN 1CGN 1CGN 1
Technology40 nm40 nm28 nm28 nm28 nm28 nm28 nm28 nm
Die Size389 mm22 × 389 mm2212 mm2212 mm2352 mm2352 mm2352 mm22 × 352 mm2
Transistor Count2640 mil.2 × 2640 mil.2800 mil.2800 mil.4313 mil.4313 mil.4313 mil.2 × 4313 mil.
Transistor Density6.79 mil. / mm26.79 mil. / mm213.2 mil. / mm213.2 mil. / mm212.3 mil. / mm212.3 mil. / mm212.3 mil. / mm212.3 mil. / mm2
GPU Clock900 MHz880 MHz900 MHz1120 MHz860 MHz925 MHz1100 MHz1100 MHz
Shader Clock900 MHz880 MHz900 MHz1120 MHz860 MHz925 MHz1100 MHz1100 MHz
ROPs322 × 3232323232322 × 32
TMUs962 × 9664801121281282 × 128
Compute Units242 × 2416202832322 × 32
Shaders1536 Unified2 × 1536 Unified1024 Unified1280 Unified1792 Unified2048 Unified2048 Unified2 × 2048 Unified
L1 Cache24 × 8 kB2 × 24 × 8 kB16 × 8 kB20 × 8 kB28 × 16 kB32 × 16 kB32 × 16 kB2 × 32 × 16 kB
L2 Cache512 kB2 × 512 kB512 kB512 kB768 kB768 kB768 kB2 ×768 kB
Memory2048 MB GDDR52048 MB GDDR51024 MB GDDR52048 MB GDDR53072 MB GDDR53072 MB GDDR53072 MB GDDR53072 MB GDDR5
Memory Clock5800 MHz5000 MHz4800 MHz5600 MHz5000 MHz5500 MHz6000 MHz6000 MHz
Bus Width256 bit2 × 256 bit256 bit256 bit384 bit384 bit384 bit2 × 384 bit
Memory Bandwidth186 GB/s2 × 160 GB/s154 GB/s179 GB/s240 GB/s264 GB/s288 GB/s2 × 288 GB/s
Fillrate (Pixel)28.8 GP/s2 × 28.2 GP/s28.8 GP/s35.8 GP/s27.5 GP/s29.6 GP/s35.2 GP/s2 × 35.2 GP/s
Fillrate (Texel)86.4 GT/s2 × 84.5 GT/s57.6 GT/s89.6 GT/s96.3 GT/s118.4 GT/s140.8 GT/s2 × 140.8 GT/s
Compute Power (FP32)2765 GFLOPS2 × 2703 GFLOPS1843 GFLOPS2867 GFLOPS3082 GFLOPS3789 GFLOPS4506 GFLOPS2 × 4506 GFLOPS
Compute Power (FP64)691 GFLOPS2 × 676 GFLOPS115 GFLOPS179 GFLOPS771 GFLOPS947 GFLOPS1127 GFLOPS2 × 1127 GFLOPS
Bus TypePCI-E 2.0PCI-E 2.0PCI-E 2.0PCI-E 2.0PCI-E 3.0PCI-E 3.0PCI-E 3.0PCI-E 3.0
TDP250 W375 W126 W180 W200 W260 W295 W2 × 295 W
DirectX1111121212121212
OpenGL4.54.54.64.64.64.64.64.6
Launch Year20102011201220132012201220132013


Radeon R9 290Radeon R9 290X OCRadeon R9 380XGeForce GTS 450GeForce GTX 460 OEMGeForce GTX 460 OCGeForce GTX 470GeForce GTX 480
GPUHawaiiHawaiiTongaGF106GF104GF104GF100GF100
ArchitectureCGN 2CGN 2CGN 3FermiFermiFermiFermiFermi
Technology28 nm28 nm28 nm40 nm40 nm40 nm40 nm40 nm
Die Size438 mm2438 mm2366 mm2238 mm2332 mm2332 mm2526 mm2526 mm2
Transistor Count6200 mil.6200 mil.5000 mil.1170 mil.1950 mil.1950 mil.3200 mil.3200 mil.
Transistor Density14.2 mil. / mm214.2 mil. / mm213.7 mil. / mm24.92 mil. / mm25.87 mil. / mm25.87 mil. / mm26.08 mil. / mm26.08 mil. / mm2
GPU Clock950 MHz1100 MHz980 MHz810 MHz650 MHz925 MHz608 MHz700 MHz
Shader Clock950 MHz1100 MHz980 MHz1620 MHz1300 MHz1850 MHz1216 MHz1400 MHz
ROPs6464321632324048
TMUs1601761283256565660
Compute Units4044324771415
Shaders2560 Unified2816 Unified2048 Unified192 Unified336 Unified336 Unified448 Unified480 Unified
L1 Cache40 × 16 kB44 × 16 kB32 × 16 kB4 × 64 kB7 × 64 kB7 × 64 kB14 × 64 kB15 × 64 kB
L2 Cache1024 kB1024 kB512 kB256 kB512 kB512 kB768 kB768 kB
Memory4096 MB GDDR54096 MB GDDR54096 MB GDDR51024 MB GDDR52048 MB GDDR51024 MB GDDR51280 MB GDDR51536 MB GDDR5
Memory Clock5000 MHz6200 MHz5700 MHz3600 MHz3400 MHz4200 MHz3350 MHz3700 MHz
Bus Width512 bit512 bit256 bit128 bit256 bit256 bit320 bit384 bit
Memory Bandwidth320 GB/s397 GB/s182 GB/s57.6 GB/s109 GB/s134 GB/s134 GB/s178 GB/s
Fillrate (Pixel)60.8 GP/s70.4 GP/s31.4 GP/s13 GP/s20.8 GP/s29.6 GP/s24.3 GP/s33.6 GP/s
Fillrate (Texel)152 GT/s193.6 GT/s125.4 GT/s25.9 GT/s36.4 GT/s51.8 GT/s34 GT/s42 GT/s
Compute Power (FP32)4864 GFLOPS6195 GFLOPS4014 GFLOPS622 GFLOPS874 GFLOPS1243 GFLOPS1089 GFLOPS1344 GFLOPS
Compute Power (FP64)608 GFLOPS774 GFLOPS251 GFLOPS52 GFLOPS73 GFLOPS104 GFLOPS136 GFLOPS168 GFLOPS
Bus TypePCI-E 3.0PCI-E 3.0PCI-E 3.0PCI-E 2.0PCI-E 2.0PCI-E 2.0PCI-E 2.0PCI-E 2.0
TDP275 W~350 W190 W106 W150 W~225 W215 W250 W
DirectX1212121111111111
OpenGL4.64.64.64.64.64.64.64.6
Launch Year20132013201520102010201020102010


2 × GeForce GTX 480GeForce GTX 560 Ti OC2 × GeForce GTX 560 Ti OCGeForce GTX 570 OCGeForce GTX 580 OC2 × GeForce GTX 580 OCGeForce GTX 660 OEMGeForce GTX 670
GPU2 × GF100GF1142 × GF114GF110GF1102 × GF110GK104GK104
ArchitectureFermiFermiFermiFermiFermiFermiKeplerKepler
Technology40 nm40 nm40 nm40 nm40 nm40 nm28 nm28 nm
Die Size2 × 526 mm2332 mm22 × 332 mm2520 mm2520 mm2520 mm2294 mm2294 mm2
Transistor Count2 × 3200 mil.1950 mil.2 × 1950 mil.3000 mil.3000 mil.3000 mil.3540 mil.3540 mil.
Transistor Density6.08 mil. / mm25.87 mil. / mm25.87 mil. / mm25.77 mil. / mm25.77 mil. / mm25.77 mil. / mm212 mil. / mm212 mil. / mm2
GPU Clock700 MHz1000 MHz950 MHz845 MHz900 MHz855 MHz950 MHz1070 MHz
Shader Clock1400 MHz2000 MHz1900 MHz1690 MHz1800 MHz1710 MHz950 MHz1070 MHz
ROPs2 × 48322 × 3240482 × 482432
TMUs2 × 60642 × 6460642 × 6496112
Compute Units2 × 1582 × 815162 × 1667
Shaders2 × 480 Unified384 Unified2 × 384 Unified480 Unified512 Unified2 × 512 Unified1152 Unified1344 Unified
L1 Cache2 × 15 × 64 kB8 × 64 kB2 × 8 × 64 kB15 × 64 kB16 × 64 kB2 × 16 × 64 kB6 × 16 kB + 48 kB Tex7 × 16 kB + 48 kB Tex
L2 Cache2 × 768 kB512 kB2 × 512 kB768 kB768 kB2 × 768 kB384 kB512 kB
Memory1536 MB GDDR51024 MB GDDR51024 MB GDDR51280 MB GDDR51536 MB GDDR51536 MB GDDR53072 MB GDDR52048 MB GDDR5
Memory Clock3700 MHz4800 MHz4580 MHz3800 MHz4800 MHz4600 MHz5600 MHz6000 MHz
Bus Width2 × 384 bit256 bit2 × 256 bit320 bit384 bit2 × 384 bit192 bit256 bit
Memory Bandwidth2 × 178 GB/s154 GB/s2 × 147 GB/s152 GB/s230 GB/s2 × 221 GB/s134 GB/s192 GB/s
Fillrate (Pixel)2 × 33.6 GP/s32 GP/s2 × 30.4 GP/s33.8 GP/s43.2 GP/s2 × 41 GP/s22.8 GP/s34.2 GP/s
Fillrate (Texel)2 × 42 GT/s64 GT/s2 × 60.8 GT/s50.7 GT/s57.6 GT/s2 × 54.7 GT/s91.2 GT/s119.8 GT/s
Compute Power (FP32)2 × 1344 GFLOPS1536 GFLOPS2 × 1459 GFLOPS1622 GFLOPS1843 GFLOPS2 × 1751 GFLOPS2189 GFLOPS2876 GFLOPS
Compute Power (FP64)2 × 168 GFLOPS128 GFLOPS2 × 122 GFLOPS203 GFLOPS154 GFLOPS2 × 146 GFLOPS91 GFLOPS120 GFLOPS
Bus TypePCI-E 2.0PCI-E 2.0PCI-E 2.0PCI-E 2.0PCI-E 2.0PCI-E 2.0PCI-E 3.0PCI-E 3.0
TDP2 × 250 W~250 W~450 W220 W~320 W~600 W105 W200 W
DirectX1111111111111212
OpenGL4.64.64.64.64.64.64.64.6
Launch Year20102011201120102010201020122012


GeForce GTX 690GeForce GTX 760 OCGeForce GTX 770 OCGeForce GTX 780 @ 155 WGeForce GTX 780 OCGeForce GTX 780 Ti OCQuadro 4000
GPU2 × GK104GK104GK104GK110GK110GK110GF100
ArchitectureKeplerKeplerKeplerKeplerKeplerKeplerFermi
Technology28 nm28 nm28 nm28 nm28 nm28 nm40 nm
Die Size2 × 294 mm2294 mm2294 mm2561 mm2561 mm2561 mm2526 mm2
Transistor Count2 × 3540 mil.3540 mil.3540 mil.7100 mil.7100 mil.7100 mil.3200 mil.
Transistor Density12 mil. / mm212 mil. / mm212 mil. / mm212.7 mil. / mm212.7 mil. / mm212.7 mil. / mm26.08 mil. / mm2
GPU Clock1050 MHz1175 MHz1290 MHz900 MHz1100 MHz1200 MHz475 MHz
Shader Clock1050 MHz1175 MHz1290 MHz900 MHz1100 MHz1200 MHz950 MHz
ROPs2 × 32323248484832
TMUs2 × 1289612819219224032
Compute Units2 × 8681212158
Shaders2 × 1536 Unified1152 Unified1536 Unified2304 Unified2304 Unified2880 Unified256 Unified
L1 Cache2 × 8 × 16 kB + 48 kB Tex6 × 16 kB + 48 kB Tex8 × 16 kB + 48 kB Tex12 × 16 kB + 48 kB Tex12 × 16 kB + 48 kB Tex15 × 16 kB + 48 kB Tex8 × 64 kB
L2 Cache2 × 512 kB512 kB512 kB1536 kB1536 kB1536 kB512 kB
Memory2048 MB GDDR52048 MB GDDR54096 MB GDDR53072 MB GDDR53072 MB GDDR53072 MB GDDR52048 MB GDDR5
Memory Clock6000 MHz6000 MHz8000 MHz6000 MHz6000 MHz7000 MHz2800 MHz
Bus Width2 × 256 bit256 bit256 bit384 bit384 bit384 bit256 bit
Memory Bandwidth2 × 192 GB/s192 GB/s256 GB/s288 GB/s288 GB/s336 GB/s89.6 GB/s
Fillrate (Pixel)2 × 33.6 GP/s37.6 GP/s41.3 GP/s43.2 GP/s52.8 GP/s57.6 GP/s15200 MP/s
Fillrate (Texel)2 × 134.4 GT/s112.8 GT/s165.1 GT/s172.2 GT/s211.2 GT/s288 GT/s15200 MT/s
Compute Power (FP32)2 × 3076 GFLOPS2707 GFLOPS3963 GFLOPS4147 GFLOPS5069 GFLOPS6912 GFLOPS486 GFLOPS
Compute Power (FP64)2 × 128 GFLOPS113 GFLOPS165 GFLOPS173 GFLOPS211.8 GFLOPS288 GFLOPS243 GFLOPS
Bus TypePCI-E 3.0PCI-E 3.0PCI-E 3.0PCI-E 3.0PCI-E 3.0PCI-E 3.0PCI-E 2.0
TDP256 W250 W~ 320 W~ 155 W275 W300 W142 W
DirectX12121212121211
OpenGL4.64.64.64.64.64.64.6
Launch Year2012201320132013201320132010

Next page