Testing Methodology and System Setup

Testing Methodology

Graphics card testing has become the most hotly debated issue in the hardware enthusiast community recently.  Because of that, testing graphics cards has become a much more complicated process than it once was.  Where before you might have been able to rely on the output of a few synthetic, automatic benchmarks to make your video card purchase, that is just no longer the case.  Video cards now cost up to $500 and we want to make sure that we are giving the reader as much information as we can to aid you in your purchasing decision.  We know we can’t run every game or find every bug and error, but we try to do what we can to aid you, our reader, and the community as a whole.

With that in mind, all the benchmarks that you will see in this review are from games that we bought off the shelves just like you.  Of these games, there are two different styles of benchmarks that need to be described.

The first is the “timedemo-style” of benchmark.  Many of you may be familiar with this style from games like Quake III; a “demo” is recorded in the game and a set number of frames are saved in a file for playback.  When playing back the demo, the game engine then renders the frames as quickly as possible, which is why you will often see the “timedemo-style” of benchmarks playing back the game much more quickly than you would ever play the game.  In our benchmarks, the FarCry tests were done in this matter: we recorded four custom demos and then played them back on each card at each different resolution and quality setting.  Why does this matter?  Because in these tests where timedemos are used, the line graphs that show the frame rate at each second, each card may not end at the same time precisly because one card is able to play it back faster than the other — less time passes and thus the FRAPs application gets slightly fewer frame rates to plot.  However, the peaks and valleys and overall performance of each card is still maintained and we can make a judged comparison of the frame rates and performance.

The second type of benchmark you’ll see in this article are manual run throughs of a portion of a game.  This is where we sit at the game with a mouse in one hand, a keyboard under the other, and play the game to get a benchmark score.  This benchmark method makes the graphs and data easy to read, but adds another level of difficulty to the reviewer — making the manual run throughs repeatable and accurate.  I think we’ve accomplished this by choosing a section of each game that provides us with a clear cut path. We take three readings of each card and setting, average the scores, and present those to you.  While this means the benchmarks are not exact to the most minute detail, they are damn close and practicing with this method for many days has made it clear to me that while this method is time consuming, it is definitely a viable option for games without timedemo support.

The second graph is a bar graph that tells you the average framerate, the maximum framerate, and the minimum framerate.  The minimum and average are important numbers here as we want the minimum to be high enough to not affect our gaming experience.  While it will be the decision of each individual gamer what is the lowest they will allow, comparing the Min FPS to the line graph and seeing how often this minimum occurs, should give you a good idea of what your gaming experience will be like with this game, and that video card on that resolution.

Our tests are completely based around the second type of benchmark method mentioned above — the manual run through.

Test System Setup

Because of the way we benchmark our graphics cards and how we display that information, getting more than three GPUs on a single graph is nearly impossible and degrades the quality of the graphs enough to make them almost useless.  Because of this, I decided to break up our data into two sets of graphs: one single GPU and one dual GPU.  The first graphs will compare the new GeForce 7800 GTX to the GeForce 6800 Ultra and the ATI X850 XT Platinum Edition.  The second set will look at a single 7800 GTX, two 7800 GTXs in SLI mode and the 6800 Ultras in SLI mode as well.

Each set includes a line graph and a bar graph.  The line graph still shows the performance over the span of time of the benchmark and the bar graph shows the data in a min/max/avg format that many readers like to see and are more comfortable with. 

You’ll also notice that I only tested the games we benchmarked at the 1600×1200 resolution.  While some gamers may not have the ability to run at that high of a resolution, it is really a silly notion to buy a $600 graphics card and NOT run at the highest frame rate possible that gives acceptable performance.  With the new 7800 GTX, and even the 6800 Ultra and X850 XT PE in most cases, these cards benchmarked are fast enough that 1600×1200 isn’t in doubt and thus I feel that gamers OUGHT to be playing at this level having spent the money they did on such product.  I am open to comments on this issue as well, so please, let me know how you feel on it.

Test System Setup

GPUs

GeForce 7800 GTX
GeForce 6800 Ultra
ATI X850 XT Platinum Edition

Processor

Athlon 64 FX-55

Motherboard

Gigabyte K8NXP-SLI

Chipset Driver

6.53

Memory

2 x 512 MB Corsair 3200XL

Memory Timings

2.0 3-3-6

Sound Card

Sound Blaster Live! Value

Hard Drive

Maxtor DiamondMax 10 300 GB

Operating System

Windows XP Professional SP1

NVIDIA Driver

77.62

ATI Driver

5.6 CCC

Software tested:

  • Doom 3 v1.3
  • Far Cry v1.3
  • Half-Life 2 Engine 7 (two maps)
  • EverQuest 2
  • 3DMark05

As a side note, I really wanted to get retail Battlefield 2 benchmarks in this article before publication, but that just didn’t happen.  After picking up the game early in the afternoon yesterday, I spent the better part of six hours or so attempting to “get to know” the game well enough to where I felt I could accurately portray it as a GPU benchmark.  There is no doubt the game is very hardware intensive and will make a good benchmark for both CPUs and GPUs, but I did not feel comfortable simply throwing up some benchmark numbers I felt were unreliable or otherwise un-beneficial to the end user.  Much has come up on the web recently about the hardware and software settings that best utilize the BF2 engine and much of it is still in the air at this time.  I could have simply turned the settings to medium and come up with a basic and generic timedemo for you to get something online, but I would rather wait until I have a good grasp of the engine and game as a whole before putting out information that you all can use in your purchasing decisions.  I apologize if this screws up anyone’s reading!

« PreviousNext »