SYSMark 2014 SE
SYSmark® 2014 SE (Second Edition) is an application-based benchmark that reflects usage patterns of business users in the areas of Office Productivity, Data/Financial Analysis, and Media Creation. Joining these in SYSmark 2014 SE is a new Responsiveness scenario which models ‘pain points’ in the user experience when performing common activities. SYSmark 2014 SE features the most popular applications from each of their respective fields.
In SYSMark 2014 SE, the Threadripper 2990WX falls short of the 2950X in all but one scenario, data/financial analysis. Most of these tasks aren't heavily threaded, so what we are seeing is the adverse effects of increased latencies on the system as a whole with the four die architecture of the 2990WX. Namely, the responsiveness score sees a substantial hit comparing the 2950X and 2990WX.
Compared to the Intel Skylake-X CPUs, the 2950X is quite competitive in most tests, excluding Media Creation.








“Due to this, the WX-series
“Due to this, the WX-series Threadripper processors must remain in a NUMA configuration, and present themselves as four individual NUMA nodes to an operating system, akin to a quad-CPU system. Additionally, the Infinity Fabric link between each of these dies is effectively running at half the speed of the 2-die arrangement found with the X-series processors.”
What are yoh refering to here? AFAIK, it is fully connected in the 4 die threadripper, just like it is in Epyc. In the two die variant, you only have one link between the two die and that is it. In the 4 die variant, they have 3 links in use each to connect to the other 3 die with a single hop latency. I don’t think I would refer to anything as half speed other than the memory bandwidth. I suspect that windows does not have the necessary NUMA optimizations to handle such a configuration properly anyway. I would be running linux on such a system. It gets a lot of use in HPC and can handle, in some cases, thousands of processor cores with a wide variety of memory configurations.
The mp3 encode as a benchmark does seem a bit odd. The gamming benchmarks, while not really odd, are of little importance. If you are going to buy a $900 or $1700 dollar processor for gaming at 1080p, unless you are using a software renderer, it would be a compelete waste. For game developers, this might still be a good system, assuming you are a developer capable of making your game perform well with many cores available, or at least not crash on start-up. As noted, windows looks like a problem here. It might have been good to test at 4k, just to see if it is graphics card limited, or whether the cpu is the bottleneck. It could hit windows scaling issues though. Also, nvidia’s driver is probably a near worst case scenario on any system that doesn’t have a single, last level cache. It seems to have a lot of fine grained, thread to thread communication. Maintaining a single last level cache with good latency is a major bottleneck to scaling to more cores, so it would be better in most cases if it would just go away, and developers would optimize their software for multiple core clusters They have to do that anyway for the consoles with similar 4 core cluster architectures. I wouldn’t be surprised to see cell phones go with core clusters also, due to better power consumption.
Well, off to look for linux compile benchmarks on Threadripper.
I stumbled on this customer
I stumbled on this customer review of the TR 1900X at Newegg:
“- Large 20% memory performance difference between NUMA and non-NUMA settings.
“On my system, the NUMA setting (memory interleave on) for some reason reduces CPU performance by
about 15%, while boosting RAM performance by 20%.”
…
“Wish amd could improve the memory controller and reduce CPU-RAM latency to competitive levels.”
Perhaps this is something to consider, and compare, when benching TR2 CPUs.
p.s. I believe der8auer at
p.s. I believe der8auer at YouTube switched memory interleave ON when running 2 x ASUS Hyper M.2 x16 add-in cards with 8 x Samsung 960 Pro NVMe SSDs:
https://www.youtube.com/watch?v=9CoAyjzJWfw
fast-forward starting around 7:30 on the counter
for the BIOS setup in that video
@ 8:07 on the counter: “Memory Interleaving”
Paul Alcorn’s recent article
Paul Alcorn’s recent article is a fun and easy read:
“AMD Ryzen Threadripper 2 vs. Intel Skylake-X:
Battle of the High-End CPUs” (August 14, 2018)