A new GPU, a familiar problem
NVIDIA’s GM206 based GTX 960 card is finally launching, after weeks of rumors and leaks. Can it topple AMD’s lead at $199?
Editor's Note: Don't forget to join us today for a live streaming event featuring Ryan Shrout and NVIDIA's Tom Petersen to discuss the new GeForce GTX 960. It will be live at 1pm ET / 10am PT and will include ten (10!) GTX 960 prizes for participants! You can find it all at https://www.pcper.com/live
There are no secrets anymore. Calling today's release of the NVIDIA GeForce GTX 960 a surprise would be like calling another Avenger's movie unexpected. If you didn't just assume it was coming chances are the dozens of leaks of slides and performance would get your attention. So here it is, today's the day, NVIDIA finally upgrades the mainstream segment that was being fed by the GTX 760 for more than a year and half. But does the brand new GTX 960 based on Maxwell move the needle?
But as you'll soon see, the GeForce GTX 960 is a bit of an odd duck in terms of new GPU releases. As we have seen several times in the last year or two with a stagnant process technology landscape, the new cards aren't going be wildly better performing than the current cards from either NVIDIA for AMD. In fact, there are some interesting comparisons to make that may surprise fans of both parties.
The good news is that Maxwell and the GM206 GPU will price out starting at $199 including overclocked models at that level. But to understand what makes it different than the GM204 part we first need to dive a bit into the GM206 GPU and how it matches up with NVIDIA's "small" GPU strategy of the past few years.
The GM206 GPU – Generational Complexity
First and foremost, the GTX 960 is based on the exact same Maxwell architecture as the GTX 970 and GTX 980. The power efficiency, the improved memory bus compression and new features all make their way into the smaller version of Maxwell selling for $199 as of today. If you missed the discussion on those new features including MFAA, Dynamic Super Resolution, VXGI you should read that page of our original GTX 980 and GTX 970 story from last September for a bit of context; these are important aspects of Maxwell and the new GM206.
NVIDIA's GM206 is essentially half of the full GM204 GPU that you find on the GTX 980. That includes 1024 CUDA cores, 64 texture units and 32 ROPs for processing, a 128-bit memory bus and 2GB of graphics memory. This results in half of the memory bandwidth at 112 GB/s and half of the peak compute capability at 2.30 TFLOPS.
Those are significant specification hits and will result in a drop of essentially half the gaming performance for the GTX 960 compared to the GTX 980. Some readers and PC enthusiasts will immediately recognize the GTX 960 as a bigger drop from the flagship part than recent generations of graphics cards from NVIDIA. You're not wrong.
GTX 960 | GTX 970 | GTX 980 | GTX 760 | GTX 770 | GTX 780 | GTX 660 | GTX 670 | GTX 680 | |
---|---|---|---|---|---|---|---|---|---|
GPU | GM206 | GM204 | GM204 | GK104 | GK104 | GK110 | GK106 | GK104 | GK104 |
GPU Cores | 1024 | 1664 | 2048 | 1152 | 1536 | 2304 | 960 | 1344 | 1536 |
Rated Clock | 1126 MHz | 1050 MHz | 1126 MHz | 980 MHz | 1046 MHz | 863 MHz | 980 MHz | 915 MHz | 1006 MHz |
Texture Units | 64 | 104 | 128 | 96 | 128 | 192 | 80 | 112 | 128 |
ROP Units | 32 | 64 | 64 | 32 | 32 | 48 | 24 | 32 | 32 |
Memory | 2GB | 4GB | 4GB | 2GB | 2GB | 3GB | 2GB | 2GB | 2GB |
Memory Clock | 7000 MHz | 7000 MHz | 7000 MHz | 6000 MHz | 7000 MHz | 6000 MHz | 6000 MHz | 6000 MHz | 6000 MHz |
Memory Interface | 128-bit | 256-bit | 256-bit | 256-bit | 256-bit | 384-bit | 192-bit | 256-bit | 256-bit |
Memory Bandwidth | 112 GB/s | 224 GB/s | 224 GB/s | 192 GB/s | 224 GB/s | 288 GB/s | 144 GB/s | 192 GB/s | 192 GB/s |
TDP | 120 watts | 145 watts | 165 watts | 170 watts | 230 watts | 250 watts | 140 watts | 170 watts | 195 watts |
Peak Compute | 2.30 TFLOPS | 3.49 TFLOPS | 4.61 TFLOPS | 2.25 TFLOPS | 3.21 TFLOPS | 3.97 TFLOPS | 1.81 TFLOPS | 2.46 TFLOPS | 3.09 TFLOPS |
Transistor Count | 2.94B | 5.2B | 5.2B | 3.54B | 3.54B | 7.08B | 2.54B | 3.54B | 3.54B |
Process Tech | 28nm | 28nm | 28nm | 28nm | 28nm | 28nm | 28nm | 28nm | 28nm |
MSRP | $199 | $329 | $549 | $249 | $399 | $649 | $230 | $399 | $499 |
This table compares the last three brand generations of NVIDIA's GeForce cards from x80, x70 and x60 products. Take a look at the GTX 680, a card based on the GK104 GPU and the GTX 660 based on GK106; the mainstream card has 62.5% of the CUDA cores and 75% of the memory bus width. The GTX 760 is actually based on the same GK104 GPU as the GTX 680 and GTX 770 and includes a wider 256-bit memory bus though dropped to half of the CUDA cores of the GK110-based GTX 780.
It's complicated (trust me, I know), but NVIDIA definitely wants to get to smaller GPU dies again on the lower-priced parts. Way back in 2012 NVIDIA released the GTX 660 with a 2.54 billion transistor die on the 28nm process, but stayed performance competitive with the 700-series. The GTX 760 jumped up a lot to a 3.54 billion transistor die and increased the price up to $250 at launch. Today's release of the GTX 960 is down to 2.94 billion transistors, near that of the GTX 660, but with a lower starting price point of $199.
Power use on the GTX 960 is amazingly low with a rated TDP of 120 watts and in our testing the GPU almost never even approaches that level. In fact, when playing a game like DOTA 2 with V-Sync off (60 FPS cap) the card barely draws more than 35 watts! (More details on that on the power page.)
In the press documentation from NVIDIA, the company makes several attempts to put a better spin on the specifications surround the GeForce GTX 960. For the first time, NVIDIA mentions an "effective memory clock" rate that is justified by the efficiency improvement in memory compression of Maxwell over Kepler. While this is definitely true, it's been true between generations for years and is part of the reason analysis of GPUs lie ours continue to exist. Creating metrics to selective improve line items is a bad move, and I expressed as much during our early meetings.
Separately, NVIDIA is moving forward with the continued emphasis on MFAA performance numbers. Remember that multi-frame sampled anti-aliasing (MFAA) was launched with the GTX 980 and GTX 970, and uses a post-processing filter to combine multiple frames temporally at 2xMSAA quality with shifted sample points. The result is a 4xMSAA look at 2xMSAA performance, at least in theory. When the GTX 980 and GTX 970 launched game support was incredibly limited, making the feature less than exciting. With this new driver, Maxwell GPUs will be able to support MFAA on all DirectX 11 and 10 games that support MSAA excluding only Dead Rising 3, Dragon Age 2 and Max Payne 3. That applies to most games we test in our suite including Crysis 3, Battlefield 4 and Skyrim; other games like Metro: Last Light or Bioshock Infinite use internal AA methods, not driver-based MSAA, and thus are unable to utilize MFAA.
When NVIDIA defaults to using MSAA, they are comparing 2xMSAA with MFAA (4xAA quality essentially) to 4xMSAA on other cards. To its credit, NVIDIA says they are only comparing this way to previous NVIDIA hardware, not to AMD's competing hardware. My thoughts on this are mixed at this point as it will no doubt start a race from both parties to fully integrate and showcase custom, proprietary AA methods exclusively going forward. See my page on MFAA performance later in the review for more details.
There are interesting comparisons to be made between the new GTX 960 and the currently shipping competing parts from AMD. Some of the specification differences will be claimed as important advantages for the Radeon line up. Obviously our performance evaluation will be the final deciding factor, but is there anything to these claims?
GTX 960 | GTX 760 | R9 285 | R9 280 | |
---|---|---|---|---|
GPU | GM206 | GK104 | Tonga | Tahiti |
GPU Cores | 1024 | 1152 | 1792 | 1792 |
Rated Clock | 1126 MHz | 980 MHz | 918 MHz | 827 MHz |
Texture Units | 64 | 96 | 112 | 112 |
ROP Units | 32 | 32 | 32 | 32 |
Memory | 2GB | 2GB | 2GB | 3GB |
Memory Clock | 7000 MHz | 6000 MHz | 5500 MHz | 5000 MHz |
Memory Interface | 128-bit | 256-bit | 256-bit | 384-bit |
Memory Bandwidth | 112 GB/s | 192 GB/s | 176 GB/s | 240 GB/s |
TDP | 120 watts | 170 watts | 190 watts | 250 watts |
Peak Compute | 2.30 TFLOPS | 2.25 TFLOPS | 3.29 TFLOPS | 3.34 TFLOPS |
Transistor Count | 2.94B | 3.54B | 5.0B | 4.3B |
Process Tech | 28nm | 28nm | 28nm | 28nm |
MSRP | $199 | $249 | $249 | $249 |
While comparing GPU core counts is useless between architectures, many of the other data points can be debated. The most prominent difference is the 128-bit memory bus that GM206 employs when compared to the R9 285 with a 256-bit memory bus or even the R9 280 with its massive 384-bit memory bus. Raw memory bandwidth is the net result of this - the GTX 960 only sports 112 GB/s while the R9 280 tosses out 240 GB/s, more than twice the value. This allows the R9 280 to have a 3GB frame buffer but also means it has disadvantage in TDP and transistor count / die size. An additional 1.4 billion transistors and 130 watts of thermal headroom are substantial. The Tonga GPU in the R9 285 has more than 2.0 billion additional transistors when compared to the GTX 760 - what they all do though is still up for debate.
There is no denying that from a technological point of view, having a wider memory bus and higher memory bandwidth is a good thing for performance. But it comes at cost - both in terms of design and in terms of AMD's wallet. Can NVIDIA really build a GPU that is both substantially smaller but equally as powerful?
i r disappoint
i r disappoint
edit
edit
Why is the texture fill rate
Why is the texture fill rate for the 970 lower than the 980, aren’t they using the same memory architecture?
What is the minimum power
What is the minimum power supply requirements for 950 SLI
Live Q: can any other card be
Live Q: can any other card be used in SLI with an NVIDIA GeForce GTX 960 ?
no, you can only sli with the
no, you can only sli with the same card family. i’ve read that some games won’t even work with different brand cards. I find that one hard to believe but worth double checking. 2 cards doesn’t give you 4gb of videoram. You will still only have 2 gb for gaming.
This is more of a GTX 950 Ti
This is more of a GTX 950 Ti to be honest. While the power consumption is excellent, the performance is nowhere near the x60 class. The gap between this card and the 970 is simply too big. A cut-down GM204 with 1280:80:48 3GB 192bit will fit the 960 monika better.
So when is this power
So when is this power efficiency going to turned into affordable performance? Nothing to see here.
This card is just is about as
This card is just is about as close as you can get to exactly as powerful as a gtx 760 with the same amount of vram and and a 128 bit bus. The only place you see true improvements are in the titles and benchmarks that Nvidia spent time making driver/firmware improvements. Look at older benchmarks like Heaven or 3dmark vantage and the gtx 970 and 760 are neck and neck. We got a much more power efficient card but at the same time a crippled memory bus to save money. Except for the niche htpc market (which I may or may not belong to) this is really isn’t that exciting and pretty much a sidegrade to the existing gtx 760. Even the compute numbers clearly show this.
I think a lot of people here
I think a lot of people here hating on the 960 are missing a key point – SLI. $200×2=$400. That’s $400 for great fps at 2650×1440 with high settings, barely any heat, and barely any noise.
I think it’s a great deal, and I think Nvidia has a TON of room to lower the price if they want/need. Who cares though, its still only $400 for 2x GTX 960 that can basically do everything youll need. Play more, worry less.
What about the people that
What about the people that don’t want to deal with crappy SLI profiles for games that are getting lazier and lazier with optimization. I want to spend money now for something that plays everything fairly decent and in a year or two buy another.
Will the 128-bit memory
Will the 128-bit memory interface cause any performance hit in modern games?
In layman terms, how does
In layman terms, how does Nvidia make their performance so smooth compared to the competition? It’s the one thing in particular I like about their products. 🙂
Do you know why the
Do you know why the performance differential for Crysis 3 is so much smaller than for the other games? I was considering this card for an upgrade, but if it doesn’t play nice with the CryEngine 3 (Star Citizen in particular) then it might not be worth it.
I am on the Sapphire Vapor-X
I am on the Sapphire Vapor-X R9 290 right now but I just don’t see anyone making a really smart choice by going for this slow of a card. I would at least get the 970.
Call optimization for what it is but PC games now are crushing through 2GB of vram all day long. Dying light at 1080p and max settings is hitting 4gb. Farcry 4 hitting 3GB. Shadow of Mordor 3.5GB. All of this at 1080p…
It’s funny to see reviews
It’s funny to see reviews about gaming GPU’s that emphasize power consumption, and decibel levels. My son and I game everyday for hours on a regular basis. We play games for entertainment because we have a little bit of disposable income I can use for just playing… paying a few cents worth of electricity extra is not even a concern, we saved more by buying a GPU with a better price.
Decibles… seriously? We aren’t playing in a library… and most gamers have headphones that make the decibel level of pc’s really un-noticeable. Even if we are recording with sensitive mics for gameplay uploads… our decibel level is not a concern.
Maybe it’s just me… and I just don’t understand the new sophisticated testing methods. But as a father who enjoys gaming with my son, price… and a GPU’s in game playability is my main concern. I refuse to pay the premium price some manufacturers charge just for a few less watts and decibles… we can use that saved money buying new games, and snacks.
Yes yes we all get it…the
Yes yes we all get it…the 960 isnt blowing anyone away with raw performance. At the same time, everyone is giving this card a ton of crap and saying how over priced it is…is this coming from the same crowd that has been paying $250 for a 760 as close as a couple months ago? A TON of people have a 760. The 960 gives you marginally better performance using much less power, giving off much less heat, less noise AND at $200. If power, heat and noise do not concerns you, then think of it as getting an overclocked 760 at a $50 discount.
Finally upgraded my Radeon
Finally upgraded my Radeon 5850 after a long and painful to tolerate card failure. So happy with this card.
NEVER AMD again !
Fry’s price-matched Amazon and same $10 rebate.
$189 after rebate ; )
Is it worth it to upgrade
Is it worth it to upgrade from my gtx 560 to the 960 because from looking at this review it’s not…