Loading...
Please wait, while we are loading the content...
Similar Documents
A Fault Model for Upgrades in Distributed Systems (CMU-PDL-08-115)
| Content Provider | Semantic Scholar |
|---|---|
| Author | Dumitras, Tudor Kavulya, Soila Narasimhan, Priya |
| Copyright Year | 2008 |
| Abstract | Recent studies, and a large body of anecdotal evidence, suggest that upgrades are unreliable and often end in failure, causing downtime and data-loss. While this is sometimes due to software defects in the new version, most upgradefailures are the result of faults in the upgrade procedure, such as broken dependencies. In this paper, we present data on upgrade failures from three independent sources — a user study, a survey and a field study — and, through statistical cluster analysis, we construct a novel fault model for upgrades in distributed systems. We identify four distinct types of faults: (1) simple configuration errors (e.g., typos); (2) semantic configuration errors (e.g., misunderstood effects of parameters); (3) broken environmental dependencies (e.g., incorrect libraries, port conflicts); and (4) complex procedural errors. We estimate that, on average, Type 1 faults occur in 15.2 % of upgrades, and Type 4 faults occur in 16.8 % of upgrades. A Fault Model for Upgrades in Distributed Systems Tudor Dumitras, Soila Kavulya and Priya Narasimhan Carnegie Mellon University Pittsburgh, PA 15217 tudor@cmu.edu spertet@ece.cmu.edu priya@cs.cmu.edu Abstract Recent studies, and a large body of anecdotal evidence, suggest that upgrades are unreliable and often end in failure, causing downtime and data-loss. While this is sometimes due to software defects in the new version, most upgradefailures are the result of faults in the upgrade procedure, such as broken dependencies. In this paper, we present data on upgrade failures from three independent sources — a user study, a survey and a field study — and, through statistical cluster analysis, we construct a novel fault model for upgrades in distributed systems. We identify four distinct types of faults: (1) simple configuration errors (e.g., typos); (2) semantic configuration errors (e.g., misunderstood effects of parameters); (3) broken environmental dependencies (e.g., incorrect libraries, port conflicts); and (4) complex procedural errors. We estimate that, on average, Type 1 faults occur in 15.2 % of upgrades, and Type 4 faults occur in 16.8 % of upgrades.Recent studies, and a large body of anecdotal evidence, suggest that upgrades are unreliable and often end in failure, causing downtime and data-loss. While this is sometimes due to software defects in the new version, most upgradefailures are the result of faults in the upgrade procedure, such as broken dependencies. In this paper, we present data on upgrade failures from three independent sources — a user study, a survey and a field study — and, through statistical cluster analysis, we construct a novel fault model for upgrades in distributed systems. We identify four distinct types of faults: (1) simple configuration errors (e.g., typos); (2) semantic configuration errors (e.g., misunderstood effects of parameters); (3) broken environmental dependencies (e.g., incorrect libraries, port conflicts); and (4) complex procedural errors. We estimate that, on average, Type 1 faults occur in 15.2 % of upgrades, and Type 4 faults occur in 16.8 % of upgrades. |
| File Format | PDF HTM / HTML |
| Alternate Webpage(s) | http://www.pdl.cmu.edu/PDL-FTP/stray/CMU-PDL-08-115.pdf |
| Alternate Webpage(s) | http://repository.cmu.edu/cgi/viewcontent.cgi?article=1011&context=pdl |
| Alternate Webpage(s) | http://www.pdl.cs.cmu.edu/PDL-FTP/stray/CMU-PDL-08-115.pdf |
| Alternate Webpage(s) | http://www.pdl.cmu.edu/PDL-FTP/associated/CMU-PDL-08-115.pdf |
| Alternate Webpage(s) | http://users.ece.cmu.edu/~tdumitra/upgrade_faults/dumitras_cmu_pdl_08_115.pdf |
| Alternate Webpage(s) | http://www.pdl.cs.cmu.edu/PDL-FTP/associated/CMU-PDL-08-115.pdf |
| Language | English |
| Access Restriction | Open |
| Content Type | Text |
| Resource Type | Article |