Loading...
Please wait, while we are loading the content...
Similar Documents
Detecting and Correcting Errors in Functional Units Performing Composable Operations
| Content Provider | Semantic Scholar |
|---|---|
| Author | Scheffer, Lou |
| Copyright Year | 2003 |
| Abstract | In the operation of a DSM chip, there is the possibility of transient errors. This paper proposes a new way to detect and/or correct such errors. If we must do N identical composable operations, we can detect errors by doing 1 additional similar operation, and both detect and correct errors by performing about log2(N) additional operations. For example, suppose an algorithm requires performing 1000 FFTs. With one additional FFT, we can verify that all FFTs were performed correctly. With 10 additional FFTs (performed on various linear combinations of the input data) we can detect which, if any, FFT was wrong, and compute the correct answer without re-doing the incorrect computation. This result holds whether the results are computed in one cycle or many, sequentially or in parallel, or in hardware and software. |
| File Format | PDF HTM / HTML |
| Alternate Webpage(s) | http://www.lscheffer.com/ABFT.pdf |
| Language | English |
| Access Restriction | Open |
| Content Type | Text |
| Resource Type | Article |