After decades of fitful progress, parallel processing is suddenly hot and will soon be commonplace on ordinary PCs. For applications rich in data-level parallelism, performance is soaring by leaps and bounds.
Multicore CPUs from Intel and AMD are all good, but the game-changers are the next-gen GPUs from Nvidia and AMD/ATI. These chips are evolving from highly specialized 3D-graphics processors for games into broader computing engines for nongame software. Nvidia is leading the charge with a new GPU architecture that, for the first time, supports general-purpose computing as strongly as it supports graphics.
Nvidia’s new Fermi GPUs will support error-correction codes (ECC), one terabyte of memory, concurrent kernels, and faster double-precision floating-point math. These features are largely unnecessary for 3D graphics but vital for high-performance general-purpose computing. (In fact, ECC slows down graphics processing, which is why it can be disabled in Fermi chips sold for the consumer market.)
Continue reading after the jump.