AMD's Power Trip: The Radeon HD 6950 and 6970
AMD tries to play catch-up to Nvidia on performance, but sticks to its efficiency mantra with the new Radeon 6900 series GPUs
When AMD launched the Radeon 6800 series back in October, we suggested that some people might be confused by the naming scheme. The Radeon HD 6870 & 6850, code-named Barts, were actually midrange GPUs, and not replacements for AMD’s high end Radeon HD 5870.
Now AMD is releasing its high end GPU, code-named Cayman, in two versions. The high end, which AMD calling its “enthusiast” GPU is the Radeon HD 6970, while the Radeon HD 6950 slots into a space between the 6870 and the 6970. What’s more, the HD 6870 doesn’t replace the dual-GPU Radeon HD 5970. If all that seems once again confusing, it probably is. It’s probably best just to consider AMD’s current naming scheme as unrelated to its old one, despite similarities.
Architectural Features
Cayman is based on the same general architecture as Barts, but is an entirely different chip, with features and tweaks all its own. Let’s take a look at some of the key features of the new GPU.
- Dual Graphics Engine, including twin tessellation compute paths. Cayman delivers up to 24 of what AMD calls SIMD engines, as opposed to the 12-14 shipping on the HD 6800 GPUs. Also included are up to 96 texture units. AMD estimates tessellation performance is roughly three times faster than the HD 5870.

Twin graphics engines essentially double throughput over the 6870.
- The graphics engine is based around a VLIW4 thread processor architecture. AMD decided to eliminate the old “T-unit”, which were targeted for transcendental operations. This allowed for a 10% improvement in performance per mm2, as well as improving scheduling.

The Cayman core has been streamlined into a VLIW4 engine, which simplifies scheduling and improves space usage on the chip.
- The ROPs have been redesigned and streamlined, and now are 2x faster in 16-bit integer performance and 2-4x faster on 32-bit floats (compared to the 5870).
- In order to improve GPU compute performance, AMD implemented asynchronous dispatch, and the GPU can execute multiple compute kernels at the same time, with each kernel having its own command queue and virtual address space. Flow control has been improved, and double-precision floating point performance has been boosted. DP performance is now about 25% that of single precision.

GPU compute improvements should make Cayman a better general purpose processor.
New Anti-Aliasing Modes
Given the additional rendering horsepower, AMD is implementing new anti-aliasing modes into their Catalyst Control Center. These AA modes work with all 6000 series cards.
Enhanced Quality Anti-Aliasing
The first is something AMD is calling “Enhanced Quality Anti-Aliasing” or EQAA. This is similar in concept to Nvidia’s Coverage Sample Anti-Aliasing (CSAA). The new EQAA modes use up to 16 coverage samples per pixel, with the number of colors and samples independently controllable. The net result is better quality within the same memory envelope. EQAA is compatible with other forms of AA supported by AMD.

AMD’s EQAA supports customizable color and coverage samples within a pixel.
EQAA can be enabled within the Catalyst Control Panel to increase image quality with whatever existing AA mode that’s set.

Enabling EQAA within Catalyst Control Panel.
Using EQAA results in a relatively minimal performance hit; our experience in quick testing resulted in less than a 5% difference with 4xAA enabled, and 4xAA plus EQAA.
Morpohological Anti-Aliasing
This is a different beast altogether. MAA is a postprocessing technique which DirectCompute to deliver full-scene anti-aliasing. It’s similar to edge AA in concept, but can detect all edges and works with DX9 through DX11. It even works with games that normally don’t support AA.

The cool thing about MAA is that it works even with transparency.
Currently, morphological AA is enabled in the Catalyst Control Center. Future games may support it. If you enable MAA, you’ll see a performance hit of about 10-20% if you’re using it simultaneously with 4x anti-aliasing. But you’ll see image quality on par with supersampling AA, with better performance than supersampling.

Morphological AA delivers image quality similar to supersampling, but without the performance hit of supersampling.
Controlling Power Usage
One of the issues that GPU companies wrestle with is power usage at full throttle. The problem is that some GPU loads, like Furmark, can hammer a graphics card, and the card needs to be designed to deliver the power and cooling levels needed for such loads. But Furmark, Perlin Noise and other artificial benchmarks can impose a greater load on a GPU than most games.
Even within a game, there are only brief periods when the GPU throttles up to maximum performance; most of the time, it’s running at less than full throttle. What that means is that cards are generally overengineered to manage thermal and power loads that really occur for brief intervals.
To manage those loads, AMD is building in a feature called PowerTune – something that AMD’s David Baumann calls “inverse Turbo.” The GPU constrains power usage to a set maximum level, not allowing the chip to run at higher power levels. That means that momentary peak performance levels may be constrained slightly. However, that also means the graphics card can be shipped supporting higher normal clock speeds.
AMD will also allow users to control the amount of power draw (up to plus or minus 20%) in order to either minimize power and noise or maximize performance. These controls will be built into the AMD Overdrive section of the Catalyst Control Center. Users can then set the balance between maximum thermal power and performance.
The Cards
AMD will be shipping in two versions, the AMD Radeon HD 6970 and the HD 6950. We’re including the feature set for the Radeon HD 6870 as a base reference point.
| Features |
Radeon HD 6870 |
Radeon HD 6950 |
Radeon HD 6970 |
| Core Clock |
900MHz |
800MHz |
880MHz |
| GDDR5 Memory Clock |
1050MHz |
1250MHz |
1375MHz |
| Stream Processors |
1120 |
1408 |
1536 |
| Texture Units |
56 |
88 |
96 |
| GDDR5 Frame Buffer |
1GB |
2GB |
2GB |
| ROPs |
32 |
32 |
32 |
| Maximum Power |
151W |
200W |
250W |
| Idle Power |
19W |
20W |
20W |
| Approx. Price |
$250 |
$319 |
$369 |
The Radeon HD 6970 is aimed at the same market segment as the recently released Nvidia GTX 570, while the Radeon HD 6950 occupies the price point currently held by the GTX 470, but the GTX 470 is being phased out.
Keep reading for the good stuff: All the benchmarks you can take.