Loading...
Please wait, while we are loading the content...
Similar Documents
Development of Naturally Fault Tolerant Algorithms for Computing on 100,000 Processors
| Content Provider | Semantic Scholar |
|---|---|
| Author | Geist, Al Engelmann, Christian |
| Copyright Year | 2002 |
| Abstract | This paper describes ongoing research at Oak Ridge National Laboratory into the issues and potential problems of algorithm scalability to 100,000 processor systems. Such massively parallel computers are projected to be needed to reach a petaflops computational speed before 2010. And to make such hypothetical machines a reality, IBM Research has begun developing a computer named “BlueGene” that could have up to 65,536 processor chips in the 2005 time frame. A key issue is how to effectively utilize a machine with 100,000 processors. Scientific algorithms have shown poor scalability on 10,000 processor systems that exist today. In this paper we define a new term called super-scalable algorithms, which have the property of natural fault tolerance, then go on to show that such algorithms do exist for scientific applications. Finally, we describe a 100,000 processor simulator we have developed to test the new algorithms. |
| File Format | PDF HTM / HTML |
| Alternate Webpage(s) | http://www.csm.ornl.gov/~geist/Lyon2002-geist.pdf |
| Alternate Webpage(s) | http://www.epm.ornl.gov/~geist/Lyon2002-geist.pdf |
| Language | English |
| Access Restriction | Open |
| Content Type | Text |
| Resource Type | Article |