Quantcast

Don't have an account? Register Now! Forgot password?

Maximum IT
Features

ATI to Nvidia: You're a Dinosaur

comment Commentsprint Printemail EmailDeliciousDiggStumbleUponRedditFacebookSlashdot

GPU Core Competencies

With this generation of GPU, ATI’s beginning to see the payoff from the premature move to a 55nm die size last generation. While Nvidia languishes at 65nm, ATI is packing more silicon into a smaller space, and increasing efficiency at the same time. But, that’s not all ATI’s done. The new RV770 series GPUs feature a redesigned GPU core with an astounding 800 stream processors—the little silicon dynamos that handle everything from rendering soft shadows and bump maps to decoding H.264 video from Blu-ray movies.

By integrating 16KB of cache, bundles of 10 stream processors, and four dedicated texture units in so-called SIMD units, ATI has juiced much better shader performance out of the overall package. The 10 stream processors share can share information with other processors in the same bundle using their shared memory, which makes the new shader core much more efficient than previous designs. And, because the shader cores pump their output directly into dedicated texture units, there’s very little time lost between writing the output to texture memory.

By integrating the stream processors in modular units around the texture processing cores, the RV770 design minimizes latency and improves performance. Each SIMD unit is connected to four dedicated texture units with 480GB/sec of bandwidth between them. This was absolutely crucial to maintain performance, or the texture units, which render the actual pixels that are displayed, would become the bottleneck again.

Under the Hood: RV770 Unveiled

 

AMD RV770 Die Shot

ATI packed the latency-sensitive silicon, like the stream processors and the basic texturing units in the center of the RV770s GPU. Surrounding that is are the memory controllers and L2 cache, and on the periphery of the chip rests the memory interface (GDDR5 on the 4870 and GDDR3 on the 4850), the PCI Express connection, the Crossfire controller, and the various display controllers for DVI, HDMI, VGA, and DisplayPort. And they packed all that on a 260mm^2 55nm die.

 

Sometimes Narrower is Better

There are two basic ways to increase memory bandwidth. You can increase the clock speed of the memory or you can transfer more data with every clock cycle by increasing the width of the memory bus. Like ATI’s previous-generation GPUs, Nvidia’s GTX 280 uses a 512-bit wide memory bus. The new GPU utilizes a narrower 256-bit bus, but it’s using new GDDR5 memory, which allows twice as many transfers per clock cycle as GDDR3. This gives ATI roughly the same memory bandwidth as the GTX 280 on a board with a cheap 256-bit bus and which transfers more data at lower clock speeds.

GDDR5 also uses fewer pins to connect the memory to the board. This reduces board complexity compared to DDR3, which is especially important when you consider the greatly reduced space available for connector pins on GPUs that use smaller process technology. By using a less complex 256-bit bus and cranking the clocks up on the GDDR5 memory, ATI should be able to able to bring decent memory performance in without harming yields for the GPU.

The high-end Nvidia GeForce GTX 280 runs its GDDR3 memory at a punishing 1100MHz and pushing an impressive 115GB/sec of bandwidth. Meanwhile ATI’s 4870 just ticks along at 900MHz, but runs at the same 115GB/sec. The net result is that the ATI card’s memory draws less power and generates less heat, while delivering the same level of performance as the more expensive card.

Running GDDR5 memory at speeds lower than GDDR3 memory with the same bandwidth is great, but the current low-end and mid-range ATI boards only support 512MB of total memory (the GeForce GTX 260 ships with 896MB of memory on a 448-bit interface and the GTX 280 ships with a full gigabyte). For the most part, performance doesn’t seem to suffer due to lack of memory, but that could change as graphically intensive games like Far Cry 2 and Fallout 3 are released this year.

COMMENTS:11
COMMENTS
avatarpleo

The Pleo Robotic Dinosaur from Ugobe Life Forms looks set to one of the biggest selling Christmas presents.The Pleo Robotic Dinosaur features a range of emotions that will keep you children fascinated.

----------------------------------------------

pleo toy

Login or register to post comments
avatarWhat about Linux drivers? Is

What about Linux drivers? Is ATI going to play nice with us?

Login or register to post comments
avatarFor those that don't know,

For those that don't know, OpenCL is C-code compiled into OpenGL code to make graphics processing/debugging quicker and easier. The C-code is first written, debugged, compiled with llvm, debugged some more (or sent back to C for more debugging), etc., etc.

 

If you hack compilers, you'll love llvm. 

Login or register to post comments
avataroh?

could ATI/AMD be finally coming out of the disaster that was this past year?

Login or register to post comments
avatar  I just picked up a couple

  I just picked up a couple 4870's to replace my 8800gtx.

Login or register to post comments
avatarTrading up HD3870? This looks sweeter!

I paid like $240.00 for my HD3870, which I hooked up to a 47" LG 1080p HDTV. Sweet through HDMI. Quality is amazing! I'm thinking of reselling my 3870 for like $150.00 on eBay or something and paying the extra $50 for a jump  Of course, my clock is about 775MHz+ with an OC tool, so that is a trade off for the cheaper card. The way I see it, pros are as follows:

  • 2.5x the procs (800 vs. 320)
  • Single slot card over my mammoth of a 3870 Asus cooler
  • Less power hungry
  • Dirt cheap for the performance increase.
  • Crysis at higher resolutions? (only played the demo, but DX10 is so sweet looking). I'd by it then for sure.

 

Login or register to post comments
avatarGrats to A(MD+TI)

I am glad that they will be able to compete again.  I hope that they are secretly building a megafast top end card, just waiting for Nvidia to roll out one more geforce XXX so they can drop the bomb.

Price /performance is what matters to all but the affluent.  Why sport extra heat in your case when most of the time good enough will do?  I personally don't want to apply Degree to my case to keep it from perspiring, especially with warmer weather coming up (here in some parts, but not where I am at).

For those of us who do the best we can with what we got, this sounds like the perfect upgrade and right price to anyone who has a 1 1/2 year old & older videocard, yet wants to play modern games.  

THERE ARE ONLY 11 TYPES OF PEOPLE IN THIS WORLD. Those that think binary jokes are funny, those that don't, and those that don't know binary

Login or register to post comments
avatarWell, I guess you could

Well, I guess you could consider the 4870 X2 the bombshell, but I agree something even better will come out soon.

I found the bombshell! Visiontek has a new watercooled beyond superclocked 4870 Black Edition.  Sounds sick, eh!

N0t a n00b

Login or register to post comments
avatarN0t a n00b

N0t a n00b

Login or register to post comments
avatarI'm inclined to agree with

I'm inclined to agree with you there. I've been a bit of an ATI fan over the years as their cards performed pretty well at a decent price and they were Canadian owned prior to the AMD buyout. I'd recommend nVidia more for the performance freaks. However, I find that ATI boards generally performed well for most people as well as budget shoppers, including myself.

- mike_art03a
IT Technician Gov't of Canada

Login or register to post comments
avatarditto

Ditto here.  In my next rig I plan to build, I'm putting in an  8800GTS 512 MB by EVGA, with an EVGA mobo that supports SLI.  However, if you feel that the 4870's are better for a budget box, please let me know.

Sincerely yours, from FOB Striker, Iraq,

SGT Samuel E. McClard II

Life's a journey, enjoy the ride!!

Login or register to post comments

This Month's Issue
FEATURE How to Get FREE Programs, Services, Software & MoreFEATURE Digital Photo Printer RoundupHOW TOBuild a 3D CameraFEATUREDIY Arcade PCWHITE PAPERHow TRIM Works