Loading...
Please wait, while we are loading the content...
Similar Documents
Entering the petaflop era: the architecture and performance of roadrunner.
Content Provider | CiteSeerX |
---|---|
Author | Lang, Mike Hoisie, Adolfy Pakin, Scott Davis, Kei Kerson, Darren J. Barker, Kevin J. Sancho, Jose C. |
Abstract | precision) hybrid-architecture supercomputer developed by LANL and IBM. It contains 12,240 IBM PowerXCell 8i processors and 12,240 AMD Opteron cores in 3,060 compute nodes. Roadrunner is the first supercomputer to run Linpack at a sustained speed in excess of 1 Pflop/s. In this paper we present a detailed architectural description of Roadrunner and a detailed performance analysis of the system. A case study of optimizing the MPI-based application Sweep3D to exploit Roadrunner’s hybrid architecture is also included. The performance of Sweep3D is compared to that of the code on a previous implementation of the Cell Broadband Engine architecture—the Cell BE—and on multicore processors. Using validated performance models combined with Roadrunner-specific microbenchmarks we identify performance issues in the early pre-delivery system and infer how well the final Roadrunner configuration will perform once the system software stack has matured. Keywords—Petascale computing, heterogeneous, accelerators, performance analysis, Roadrunner. I. |
File Format | |
Access Restriction | Open |
Subject Keyword | Roadrunner-specific Microbenchmarks Petaflop Era Validated Performance Model Compute Node Detailed Architectural Description Performance Issue First Supercomputer Sustained Speed Performance Analysis Case Study Hybrid-architecture Supercomputer Amd Opteron Final Roadrunner Configuration Detailed Performance Analysis Keywords Petascale Computing Previous Implementation Mpi-based Application Sweep3d Roadrunner Hybrid Architecture Ibm Powerxcell Cell Broadband Engine System Software Stack Early Pre-delivery System Multicore Processor |
Content Type | Text |
Resource Type | Article |