Haswell – A New Architecture
It’s finally here, the Intel Haswell processor in the form of the Core i7-4770K for desktop users. Is it time to upgrade?
Thanks for stopping by our coverage of the Intel Haswell, 4th Generation Core processor and Z87 chipset release! We have a lot of different stories for you to check out and I wanted to be sure you knew about them all.
- PCPer Live! ASUS Z87 Motherboard and Intel Haswell Live Event! – Tuesday, June 4th we will be hosting a live streaming event with JJ from ASUS. Stop by to learn about Z87 and overclocking Haswell and to win some motherboards and graphics cards!
- ASUS ROG Maximus VI Extreme Motherboard Review
- MSI Z87-GD65 Gaming Motherboard Review
- ASUS Gryphon Z87 Micro-ATX Motherboard Review
This spring has been unusually busy for us here at PC Perspective – with everything from new APU releases from AMD, new graphics cards from NVIDIA and now new desktop and mobile processors from Intel. There has never been a better time to be a technology enthusiast though some would argue that the days of the enthusiast PC builder are on the decline. Looking at the revived GPU wars and the launch of Intel's Haswell architecture, 4th Generation Core processors we couldn't disagree more.
Built on the same 22nm process technology that Ivy Bridge brought to the world, Haswell is a new architecture from Intel that really changes focus for the company towards a single homogenous design that has the ability to span wide ranging markets. From tablets to performance workstations, Haswell will soon finds its way into just about every crevasse of your technology life.
Today we focus on the desktop though – the release of the new Intel Core i7-4770K, fully unlocked, LGA1150 processor built for the Z87 chipset and DIY builders everywhere. In this review we'll discuss the architectural changes Haswell brings, the overclocking capabilities and limitations of the new design, application performance, graphics performance and quite a bit more.
Haswell remains a quad-core processor built on 1.4 billion transistors in a die measuring 177 mm2 with integrated processor graphics, shared L3 cache, dual channel DDR3 memory controller. But much has changed – let's dive in.
The Haswell Architecture
I have already done quite a bit of writing about the Haswell architecture itself, but much of it is going to be new to our readers or at the very least many will need a refresher. Let's dive into some of the details that were first revealed at the Intel Developer Forum last September.
While Sandy Bridge and Ivy Bridge were really derivatives of prior designs and thought processes, the Haswell design is something completely different for the company. Yes, the microarchitecture of Haswell is still very similar to Sandy Bridge (SNB), but the differences are more philosophical rather than technological.
Intel's target is a converged core: a single design that is flexible enough to be utilized in mobility devices like tablets while also scaling to the performance levels required for workstations and servers. They retain the majority of the architecture design from Sandy Bridge and Ivy Bridge including the core as well as the key features that make Intel's parts unique: HyperThreading, Intel Turbo Boost, and the ring interconnect.
The three pillars that Intel wanted to address with Haswell were performance, modularity, and power innovations. Each of these has its own key goals including improving performance of legacy code (existing), and having the ability to extract greater parallelism with less coding work for developers.
The modularity of Haswell is what gives the processor design its extreme flexibility while providing a consistent optimization path for software developers. The ability for a designer to write an application that can run (though at different feature or performance levels) across the entire array of devices that Haswell will find its way in is powerful.
Haswell (at least in this iteration) will be available in various different configurations including 2-4 processing cores, three different levels of graphics subsystem, differing idle and active power levels, interconnects, and platforms. This will greatly increase the power and performance ranges of Haswell compared to Ivy Bridge (and Sandy Bridge) and is enabled by the system agent that acts as the intermediary between all of the components on the SoC.
Intel also claims that Haswell will permit third-party IP integration, and thus will be capable of adding specific features and technologies as the OEMs demand.
Changes to power management on Haswell address both active (in use) and sleep states in order to see the biggest alterations from previous architectures. The goal is to lower the power consumption required during CPU load while also decreasing the amount of time it takes for the entire system to enter and leave sleep states. Intel introduces a new S0ix status that it is borrowing from the ultra-mobile designs of Atom to get a 20x improvement in low power states, and allows improved realizable battery life.
Just as important as the new states themselves is that Intel claims they are completely transparent to "well written" software.
Other changes in the design address power with Haswell, including changes to Turbo Boost technology and more granular voltage and frequency "islands" for the CPU to enter. Also changed from SNB and IVB is that the frequency of the cores is decoupled from the ring bus allowing voltages to scale more gracefully to where the power is actually needed. For example, Ivy Bridge and Sandy Bridge both required power to increase on the CPU cores when the GPU needed more bandwidth on the ring interconnect for other purposes, which is a waste of valuable power.
While we talked about the idle power changes in the slide above, Intel also pointed out that at this point that is is the only CPU vendor that has complete control over its manufacturing. Intel can utilize that advantage by tweaking the process in very specific ways to meet any goals that the engineers might have.
Because the majority of Haswell designs will be completely Intel-based platforms, it makes sense for Intel to address this as well. You will see new voltage regulators and better power-managed controllers (embedded now) in addition to new IO options like I2C, SDIO and I2s that are traditionally only found in mobile devices. New link power states for traditional IO connections like USB and SATA are being introduced that can nearly drop power draw at idle to zero watts.
Haswell Microarchitecture Changes
While the Haswell design is based mainly on the architecture introduced with Sandy Bridge, there are some changes that Intel made to improve performance in the more typical fashion with an eye towards IPC (instructions per clock).
There were no changes in the key pipelines of Haswell but there were many areas that Intel said are "typical improvement points" for the company. The branch predictor has been improved as this is usually the best return on time investment from a CPU-design stand point; Intel increased the buffers on the OOO (out of order) structures in order to help improve the ability for the processor to find parallelism and take advantage of it.
Throughput also sees a boost, with 8 total ports on the reservation station with another ALU unit, another branching unit, and address store. This gives Haswell some improved metrics like two branches per cycle and two floating point MADDs per cycle – both improvements over what we saw in Sandy Bridge and Ivy Bridge processors.
New compute instructions expand on AVX, doubling both single precision and double precision FLOPs per core per cycle. Other new instructions accelerate very specific algorithms with updates for extract and deposits, bit manipulation, rotates, etc.
The cache implementation also sees interesting changes with Haswell including a doubling of the bandwidth to 32-bits wide and one L2 cache read every cycle. Seeing both L1 and L2 cache bandwidths double in a single generation without changing the organization and size of those structures is impressive, though it needs more explanation as well.
Another big upcoming change is the introduction of transactional synchronization extensions (TSX). TSX is a method to improve concurrency and multi-threadedness with as little work for the programmer as possible. By using these new ISA extensions, a developer can apply simple prefixes and suffixes to code blocks to indicate that they are independent and can be run in parallel. Hardware is then capable of managing transactional updates and restart execution if the required block isn't able to be run.
While this might be pretty specific to discuss with our audience, the implications are impressive. Increasing the parallelization of software is one of the key issues holding back innovation on many levels. We have seen the GPU vendors fight this (think CUDA) for years, and Intel's continued push into the MIC (many integrated core) markets will require it as well. If you are interested in this technology, you should check out David Kanter's detailed analysis of it.
In every graph 3570K is named
In every graph 3570K is named i7 instead of i5. Damn you copy paste 🙂
so as expected its a big MEH
so as expected its a big MEH
Micro Center has the I7-4770K
Micro Center has the I7-4770K for $279, $70 cheaper than Newegg.
Even better. They dropped
Even better. They dropped the price on the 3770K by $130 to $229.
WTF Newegg/resellers. Have you really been gouging people this long?
Its been that way for a long
Its been that way for a long time if you have a local MicroCenter to pick one up at, otherwise its still ~$320 at most etailers
They are not making any money
They are not making any money at that price. DUH
I`m going Haswell when Blue
I`m going Haswell when Blue hits.
Also , I saw in other tech sites that the 4770 is not the top performing Haswell chip that will be released.
There will be others with more GPU horsepower ?
I presently have a C2Q 9550
I presently have a C2Q 9550 12 MB L2 cache…would I get much benefit ?
Stepping up from a C2Q 9550
Stepping up from a C2Q 9550 (same chip I have now) to just an i7 920 would be a huge leap, let alone SB being another sizable jump, with the 5% from both Haswell and IB I think it’s safe to say you will see major performance boost even with a 1Ghz OC on that chip you have now.
I haven’t went out to upgrade myself because I was a believer in the Haswell empty promises that wasted my time, but I work with machines that are SB i5’s and they are smoking smooth, quiet, cool, and fast.
I’ve only heard of a lower
I’ve only heard of a lower TDP 65W model that has the eDRAM onboard (flagship iGPU) that is supposed to be comparable to the i7 4770K, but I really don’t see how that is possible.
Anyways, I wouldn’t call anything with more “GPU” power to be a top performer on the 4770K lineup because to be quite honest, nobody buying those chips is looking for the integrated GPU component. They’d probably sell better if they took that space and replaced it with 2 extra cores. People would have far less to bitch about and you’d see performance gains that would give Intel another 4 years of this 5% performance boost before people start bitching about monopoly.
AMD could name their chip
AMD could name their chip SuperDuperIntelKiller and it still wouldn`t be close.
Cinebench 11.5 .
Cinebench 11.5 . multithread
Haswell -> 7.68
Here is my conclusion:
NOT WORTH IT.
Ivy should be 2x faster than SB.
Haswell should be 4x times faster than SB and 2x Ivy, not a 10% >< Intel thinks we're all idiots or what? It'a all AMD's fault which does not put enough pressure. Period.
LMAO LMAO LMAO, somebody is
LMAO LMAO LMAO, somebody is seriously underestimating, how hard it is to double chip’s performance every year without adding more execution units.
Looks like my 2600k will live
Looks like my 2600k will live in my system for another few years. “YAY! – MY WALLET
Also a follow up to the first
Also a follow up to the first page there is NO Transactional Synchronization Extensions (TSX) in the Core i7 4770K Processor.
Not to sound like a dick, the
Not to sound like a dick, the first page was just a wast of time I’m not a design engineer now if I had access to the equipment I be more then gladly to study the Architecture.
I do know what your talking about though but for the newbie or first timer they wouldn’t have a clue why because your throwing words with no meaning or diagrams to where it’s coming in or going out and what it’s connected to. Long story short I got bored very fast and just wanted to skip the first page all together but didn’t.
In the future don’t throw up shit like this unless you have some sort of diagram to follow, Tom’s Hardware don’t use this and either does HardOCP keep it simple but yet in lighting the read slowly not slide show screen shot’s from IDF.
Second Page well let’s just say I didn’t pay for a $400 Graphics card to be reading about Intel’s GT2 Architecture and Mobile Crap but then again some people are probable interested in this stuff but I doubt anyone that read this website is.
Thanks for the Overkill Review PCPer.
Evelyn. I agree that
Evelyn. I agree that Raymond`s postlng is flabbergasting, yesterday I got a gorgeous Acura after I been earnin $7654 this-last/4 weeks and just over ten k last munth. without a doubt its my favourite-work Ive ever had. I started this six months/ago and practically straight away began to make over $82, per-hr. I use the details here, Bow6.com
Wow. Sort of cool but barely
Wow. Sort of cool but barely evolutionary and nothing crazy new. So glad I bought a beefy 2600k and a sick GB z68. I knew the rumors around haswell were too good to be true. The bottom line of this review should be- “If you are a PC gamer with a fast GPU and an i7. Ignore Haswell altogether.” Honestly did we hit a wall? Is 5GHZ on 8 cores good enuf for anything? I will wait (probably for a long time) for the CPU that starts to crush my 2600k in gaming FPS. Glad to see my investment still giving me returns despite several new CPU releases.
Truth be told: Sandy Bridge was the big leap in gaming CPUs. Everything since then has been extremely underwhelming and incremental. Great review as always guys.
Yup totally agree with you. I
Yup totally agree with you. I have had my 2600k for almost 2 1/2 years and 3 years come march 2014.
It overclocks like a beast and although I moved away from p67 boards to a z77, It is still rocking without worrying.
Not to sound contradiction but I am going to give my 2600k to my son and keep it in the family. I ordered a 4770k and a z87 board and that will be it until there is a huge jump in microprocessors.
Idle power consumption was
Idle power consumption was higher than the 3770 most likely due to the FIVR.
The marketplace and people
The marketplace and people take care of themselves and others.
I have a new rig. Asus
I have a new rig. Asus maximus vi extreme board and ci7 4770k but it wouldnt give any display via hdmi to me. Please help !! The only way i am able to use my desktop new is that i have temporarily installed a hd7770 and using its hdmi output for display.
Thanks in advance. My retailer told me that since its k processor u need a graphics card for display!!?!
@ryan!could you adjust
@ryan!could you adjust message signal interrupt value to one per core per device in the future(if you aren’t already)specificly for CPU with GPU onboard.driver are limited to one interrupt per socket per device.(ya it is limited!but ms suggest one MSI per physical CPU.since now each core are CPU . I feel it isn’t fair for cpu including GPU to ignore this!why I ignore normal GPU?diminishing return.I feel this have a more dramatic impact on Apr like has well or jaguar then on desktop with GPU like a 7970.Ty Ryan
PS:Drivers can register a single InterruptMessageService routine that handles all possible messages or individual InterruptService routines for each message
I7 920 fanatics: the 920 is a
I7 920 fanatics: the 920 is a great cpu of you are gaming, doing Photoshop and other light stuff. If you are doing 3D, video editing and compositing and other heavy stuff, the 4770 will swipe the floor with your 920, in performance and power consumption.
ASUS Maximus Extreme 2.0X
ASUS Maximus Extreme 2.0X Motherboard(X38 Chipset, 2 Gigabit Ethernet NICS Onboard)need 1 each. Send RFQ.