Loading...
Please wait, while we are loading the content...
Similar Documents
Regression Testing on Shaheen Cray XC 40 : Implementation and Lessons Learned
| Content Provider | Semantic Scholar |
|---|---|
| Author | Hadri, Bilel Kortas, Samuel Fiedler, Robert Markomanolis, George S. |
| Copyright Year | 2017 |
| Abstract | Leadership-class supercomputers are becoming larger and more complex tightly integrated systems consisting of many different hardware components, tens of thousands of processors and memory chips, kilometers of networking cables, large numbers of disks, and hundreds of applications and libraries. To increase scientific productivity and ensure that applications efficiently and effectively exploit a system’s full potential, all the components must deliver reliable, stable, and performant service. Therefore, to deliver the best computing environment to our users, system performance assessments are critical, especially after an unplanned downtime or any scheduled maintenance session. This paper describes the design and implementation of the regression testing methodology used on the Shaheen2 XC40 to detect and track issues related to the performance and functionality of compute nodes, storage, network, and programming environment. We also present an analysis of the results over 24 months, along with the lessons learned. |
| File Format | PDF HTM / HTML |
| Alternate Webpage(s) | https://cug.org/proceedings/cug2017_proceedings/includes/files/pap119s2-file1.pdf |
| Language | English |
| Access Restriction | Open |
| Content Type | Text |
| Resource Type | Article |