Looking Towards the Professionals
We now dive into the compute performance side of the TITAN V, where it real shines.
This is a multi-part story for the NVIDIA Titan V:
Earlier this week we dove into the new NVIDIA Titan V graphics card and looked at its performacne from a gaming perspective. Our conclusions were more or less what we expected – the card was on average ~20% faster than the Titan Xp and about ~80% faster than the GeForce GTX 1080. But with that $3000 price tag, the Titan V isn't going to win any enthusiasts over.
What the Titan V is meant for in reality is the compute space. Developers, coders, engineers, and professionals that use GPU hardware for research, for profit, or for both. In that case, $2999 for the Titan V is simply an investment that needs to show value in select workloads. And though $3000 is still a lot of money, keep in mind that the NVIDIA Quadro GP100, the most recent part with full-performance double precision compute from the Pascal chip, is still selling for well over $6000 today.
The Volta GV100 GPU offers 1:2 double precision performance, equating to 2560 FP64 cores. That is a HUGE leap over the GP102 GPU used on the Titan Xp that uses a 1:32 ratio, giving us just 120 FP64 cores equivalent.
|Titan V||Titan Xp||GTX 1080 Ti||GTX 1080||GTX 1070 Ti||GTX 1070||RX Vega 64 Liquid||Vega Frontier Edition|
|Base Clock||1200 MHz||1480 MHz||1480 MHz||1607 MHz||1607 MHz||1506 MHz||1406 MHz||1382 MHz|
|Boost Clock||1455 MHz||1582 MHz||1582 MHz||1733 MHz||1683 MHz||1683 MHz||1677 MHz||1600 MHz|
|Memory Clock||1700 MHz MHz||11400 MHz||11000 MHz||10000 MHz||8000 MHz||8000 MHz||1890 MHz||1890 MHz|
|384-bit G5X||352-bit G5X||256-bit G5X||256-bit||256-bit||2048-bit HBM2||2048-bit HBM2|
|Memory Bandwidth||653 GB/s||547 GB/s||484 GB/s||320 GB/s||256 GB/s||256 GB/s||484 GB/s||484 GB/s|
|TDP||250 watts||250 watts||250 watts||180 watts||180 watts||150 watts||345 watts||300 watts|
|Peak Compute||12.2 (base) TFLOPS
14.9 (boost) TFLOPS
|12.1 TFLOPS||11.3 TFLOPS||8.2 TFLOPS||7.8 TFLOPS||5.7 TFLOPS||13.7 TFLOPS||13.1 TFLOPS|
|Peak DP Compute||6.1 (base) TFLOPS
7.45 (boost) TFLOPS
|0.37 TFLOPS||0.35 TFLOPS||0.25 TFLOPS||0.24 TFLOPS||0.17 TFLOPS||0.85 TFLOPS||0.81 TFLOPS|
The current AMD Radeon RX Vega 64, and the Vega Frontier Edition, all ship with a 1:16 FP64 ratio, giving us the equivalent of 256 DP cores per card.
Test Setup and Benchmarks
Our testing setup remains the same from our gaming tests, but obviously the software stack is quite different.
|PC Perspective GPU Testbed|
|Processor||Intel Core i7-5960X Haswell-E|
|Motherboard||ASUS Rampage V Extreme X99|
|Memory||G.Skill Ripjaws 16GB DDR4-3200|
|Storage||OCZ Agility 4 256GB (OS)
Adata SP610 500GB (games)
|Power Supply||Corsair AX1500i 1500 watt|
|OS||Windows 10 x64|
Applications in use include:
- Cinebench R15
- Sisoft Sandra GPU Compute
- SPECviewperf 12.1
Let's not drag this along – I know you are hungry for results! (Thanks to Ken for running most of these tests for us!!)