Loading...
Please wait, while we are loading the content...
Similar Documents
Cloud resource usage: extreme distributions invalidating traditional capacity planning models
| Content Provider | ACM Digital Library |
|---|---|
| Author | Loboz, Charles Z. |
| Abstract | For years Capacity Planning professionals knew or suspected that various characteristics of computer usage have non-normal distribution. At the same time much of the traditional workload modeling and forecasting is based on mathematical techniques assuming some sort of normality of underlying distributions. If the dissonance between the existing and assumed distribution exists, then resulting capacity models are of lower quality, with possibly erroneous forecasts - and confidence intervals much wider than expected. This paper analyzes distribution of daily resource usage on two storage clusters for 259 days. For each day we consider the distribution of resource usage by customer accounts for five different resources: storage used, storage transactions executed, internal network transfer, egress transfer and inter-data-center transfer - 2590 sample distributions in total. All distributions were highly imbalanced and far from normal and 94% of distribution samples have tails heavier than log-normal, exponential, or normal distributions. These findings spell significant problems for most models assuming normality. Mathematically: Central Limit Theorem does not apply to power-law distributions - so the 'averaging' effect cannot be counted on to help with modeling using traditional approach. Operationally: very high volatility found means that the 'capacity buffers' need to be large, leading to wasted capacity. Other, administrative, means need to be applied to reduce that. Overall the distributions of resource usage in cloud storage are so far from normal, even after usual transformations, that traditional approach to forecasting and capacity planning needs to be reconsidered. |
| Starting Page | 7 |
| Ending Page | 14 |
| Page Count | 8 |
| File Format | |
| ISBN | 9781450306997 |
| DOI | 10.1145/1996109.1996112 |
| Language | English |
| Publisher | Association for Computing Machinery (ACM) |
| Publisher Date | 2011-06-08 |
| Publisher Place | New York |
| Access Restriction | Subscribed |
| Subject Keyword | Probability distributions Volatility Resource usage Power law Capacity planning |
| Content Type | Text |
| Resource Type | Article |