AMD FireStream 9250 breaks 1 teraflop barrier

AMD can hardly contain their excitement with the news that their FireStream 9250 processor has broken the 1 teraflop barrier. Fitted to the FireStream 9250 graphics card, it handles not only video output but can work in tandem with the main CPU to accelerate complex calculations. And unlike its 9170 predecessor, it only occupies 1 PCI slot and demands less than 150 watts.

In fact, the AMD FireStream 9250 delivers up to eight gigaflops per watt, and developers have found up to a 55x performance increase on financial analysis codes (when compared to using the CPU alone). It has 1GB of GDDR3 memory and is, according to AMD, ideally sized and suited to 1U servers as well as most desktop systems and workstations.

It, and the supporting AMD Stream SDK, will be available in Q3 2008. The MSRP is $999, which sounds a lot until you consider that the 9170 – which is currently available – costs a whopping $1,999.

Press Release:

AMD Stream Processor First to Break 1 Teraflop Barrier

—Next-generation AMD FireStream™ 9250 processor accelerates scientific and engineering calculations, efficiently delivering supercomputer performance at up to eight gigaflops-per-watt —

DRESDEN, Germany — June 16, 2008 –At the International Supercomputing Conference, AMD (NYSE:AMD) today introduced its next-generation stream processor, the AMD FireStream™ 9250, specifically designed to accelerate critical algorithms in high-performance computing (HPC), mainstream and consumer applications. Leveraging the GPU design expertise of AMD's Graphics Product Group, AMD FireStream 9250 breaks the one teraflop barrier for single precision performance. It occupies a single PCI slot, for unmatched density and with power consumption of less than 150 watts, the AMD FireStream 9250 delivers an unprecedented rate of performance per watt efficiency with up to eight gigaflops per watt.

Customers can leverage AMD's latest FireStream offering to run critical workloads such as financial analysis or seismic processing dramatically faster than with CPU alone, helping them to address more complex problems and achieve faster results. For example, developers are reporting up to a 55x performance increase on financial analysis codes as compared to processing on the CPU alone, which supports their efforts to make better and faster decisions.1 Additionally, the use of flexible GPU technology rather than custom accelerators assists those creating application-specific systems to enhance and maintain their solutions easily.

The AMD FireStream 9250 stream processor includes a second-generation double-precision floating point hardware implementation delivering more than 200 gigaflops, building on the capabilities of the earlier AMD FireStream™ 9170, the industry's first GP-GPU with double-precision floating point support. The AMD FireStream 9250's compact size makes it ideal for small 1U servers as well as most desktop systems, workstations, and larger servers and it features 1GB of GDDR3 memory, enabling developers to handle large, complex problems.

Driving broad consumer adoption with open systems

AMD enables development of the FireStream family of processors with its AMD Stream SDK, designed to help developers create accelerated applications for AMD FireStream, ATI FireGL™ and ATI Radeon™ GPUs. AMD takes an open-systems approach to its stream computing development environment to ensure that developers can access and build on the tools at any level. AMD offers published interfaces for its high-level language API, intermediate language, and instruction set architecture; and the AMD Stream SDK's Brook+ front-end is available as open source code.

In keeping with its open systems philosophy, AMD has also joined the Khronos Compute Working Group. This working group's goals include developing industry standards for data parallel programming and working with proposed specifications like OpenCL. The OpenCL specification can help provide developers with an easy path to development across multiple platforms.

"An open industry standard programming specification will help drive broad-based support for stream computing technology in mainstream applications," said Rick Bergman, senior vice president and general manager, Graphics Product Group, AMD. "We believe that OpenCL is a step in the right direction and we fully support this effort. AMD intends to ensure that the AMD Stream SDK rapidly evolves to comply with open industry standards as they emerge."

Accelerating industry adoption

The growth of the stream computing market has accelerated over the past few years with Fortune 1000 companies, leading software developers and academic institutions utilizing stream technology to achieve tremendous performance gains across a variety of applications.

"Stream computing is increasingly important for mainstream and consumer applications and is no longer limited to just the academic or engineering industries. Today we are truly seeing a fundamental shift in emerging system architectures," said Jon Peddie, president, Jon Peddie Research. "As the industry's only provider of both high-performance discrete GPUs and x86-compatible CPUs, AMD is uniquely well-suited to developing these architectures."

AMD customers, including ACCIT, Centre de Physique de Particules de Marseille, Neurala and Telanetix are using the AMD Stream SDK and current AMD FireStream, ATI FireGL or ATI Radeon boards to achieve dramatic performance gains on critical algorithms in HPC, workstation and consumer applications. Currently, Neurala reports that it is achieving 10-200x speedups over the CPU alone on biologically inspired neural models, applicable to finance, image processing and other applications.2

AMD is also working closely with world class application and solution providers to ensure customers can achieve optimum performance results. Stream computing application and solution providers include CAPS entreprise, Mercury Computer Systems, RapidMind, RogueWave and VizExperts. Mercury Computer Systems provides high-performance computing systems and software designed for complex image, sensor, and signal processing applications. Its algorithm team reports that it has achieved 174 GFLOPS performance for large 1D complex single-precision floating point FFTs on the AMD FireStream 9250.3

Pricing and availability

AMD plans to deliver the FireStream 9250 and the supporting SDK in Q3 2008 at an MSRP of $999 USD. AMD FireStream 9170, the industry's first double-precision floating point stream processor, is currently available for purchase and is competitively priced at $1,999 USD. For more information about AMD FireStream 9250 or AMD FireStream 9170 or AMD's complete line of stream computing solutions, please visit http://www.amd.com/stream.

About AMD

Advanced Micro Devices (NYSE: AMD) is a leading global provider of innovative processing solutions in the computing, graphics and consumer electronics markets. AMD is dedicated to driving open innovation, choice and industry growth by delivering superior customer-centric solutions that empower consumers and businesses worldwide. For more information, visit http://www.amd.com.

RapidMind has reported a 55x speedup over CPU alone on binomial options pricing calculators. The comparison is versus Quantlib running on a single core of a Dual-Core AMD Opteron™ 2352 processor on Tyan S2915 w/ Win XP 32 (Palomar Workstation from Colfax)

Neurala comparison is against dual AMD Opteron 248 processor (using only a single processor for comparison) w/ 2GB SDRAM DDR 400 ECC dual channel and SUSE Linux 10 (custom kernel)

Mercury benchmark system details: Intel Core2 6820 @ 2.13 GHz w/ 3GB of RAM, FireStream 9250 stream processor