9 CONCLUSIONS AND FUTURE
WORK
Through modelling and supporting simulation, we
combine the reactive behaviour of two well known
switching policies – the Proportional Switching Pol-
icy (PSP) and the Bottleneck Aware Switching policy
(BSP) – with the proactive properties of several work-
load forecasting models. Seven forecasting models
are used, including Last Observation, Simple Algo-
rithm, Sample Moving Average, Low Pass Filter and
Autoregressive Moving Average. As each of the fore-
casting schemes has its own bias, we also develop
three meta-forecasting models (the Active Window
Model, the Voting Model and the Selective Model)
to ensure consistent and improved results.
We base our results on real-world workload traces
from several sources, including from the San Diego
Supercomputer Centre, from the ClarkNet Internet
access provider for the Metro Baltimore-Washington
DC area and, from the NASA Kennedy Space Cen-
ter web-server in Florida. For each of the three real-
world workloads, we contrast an enterprise system
with fixed resources (no switching policy) with sev-
eral alternatives: a system that employes a dynamic
server switching policy (PSP or BSP); a system that
uses PSP or BSP, and a single forecasting scheme;
and finally a system that employes PSP or BSP, and
a meta-forecasting scheme.
The results are significant in a number of respects:
(i) Dynamic server switching (using PSP or BSP) im-
proves revenue in all cases, the Bottleneck Aware
Switching policy is particularly effective; (ii) Using a
single forecasting scheme in tandem with PSP or BSP
is difficult as no one scheme wins out across all work-
loads and, if the wrong scheme is chosen, this may
lead to a reduction in the overall revenue generated
by the system; (iii) The meta-forecasting schemes al-
ways improve revenue when used in tandem with PSP
or BSP, in the worst case the improved revenue will
be negligible, in the best case the revenue may be
increased by around 40%; (iv) The Active Window
Model (AWM) proves to be the best scheme in all
cases, on average this scheme gives an improvement
in revenue of 15.1% over all three real-world work-
loads, the size of the active window is important and
must therefore be subject to some pre-calculatation
based on sample traces.
We are currently investigating the effectiveness
of these schemes in extreme (highly bursty) environ-
ments.
REFERENCES
Al-Ghamdi, M., Chester, A. P., and Jarvis, S. A. (2010).
Predictive and dynamic resource allocation for enter-
prise applications. In Proceedings of the 2010 10th
IEEE International Conference on Scalable Comput-
ing and Communications (ScalCom), pages 2776–
2783, Washington, DC, USA. IEEE Computer Soci-
ety.
Arlitt, M. and Williamson, C. (1996). Web server workload
characterization: the search for invariants. SIGMET-
RICS Perform. Eval. Rev., 24(1):126–137.
Casale, G. and Serazzi, G. (2004). Bottlenecks identifi-
cation in multiclass queueing networks using convex
polytopes. In 12th Annual Meeting of the IEEE Int’l
Symposium on Modelling, Analysis, and Simulation of
Comp. and Telecommunication Systems (MASCOTS).
Cavendish, D., Koide, H., Oie, Y., and Gerla, M. (2010).
A mean value analysis approach to transaction perfor-
mance evaluation of multi-server systems. Concurr.
Comput. : Pract. Exper., 22(10):1267–1285.
Cherkasova, L. and Phaal, P. (2002). Session-based ad-
mission control: A mechanism for peak load manage-
ment of commercial web sites. IEEE Trans. Comput.,
51(6):669–685.
Chester, A. P., Xue, J. W. J., He, L., and Jarvis, S. A. (2008).
A system for dynamic server allocation in applica-
tion server clusters. In ISPA ’08: Proceedings of the
2008 IEEE International Symposium on Parallel and
Distributed Processing with Applications, pages 130–
139, Washington, DC, USA. IEEE Computer Society.
Cuomo, G. (2000). IBM WebSphere Application Server
Standard and Advanced Editions; A methodology for
performance tuning. IBM.
Dushay, N., French, J. C., and Lagoze, C. (September
1999). Predicting indexer performance in a distributed
digital library. In Third European Conference on Re-
search and Advanced Technology for Digital Libraries
(ECDL99), Paris, France.
Faraz, A. and Vijaykumar, T. (2010). Joint optimization of
idle and cooling power in data centers while maintain-
ing response time. SIGPLAN Not., 45(3):243–256.
Federgruen, A. and Groenevelt, H. (1986). The greedy pro-
cedure for resource allocation problems: Necessary
and sufficient conditions for optimality. Oper. Res.,
34(6):909–918.
Gilly, K., Alcaraz, S., Juiz, C., and Puigjaner, R. (2004).
Comparison of predictive techniques in cluster-based
network servers with resource allocation. Modeling,
Analysis, and Simulation of Computer Systems, Inter-
national Symposium on, pages 545–552.
Hsieh, C. and Lam, S. (1988). Pam-a noniterative approx-
imate solution method for closed multichain queue-
ing networks. SIGMETRICS Perform. Eval. Rev.,
16(1):261–269.
Keung, H. N. L. C., Dyson, J. R. D., Jarvis, S. A., and Nudd,
G. R. (2003). Predicting the performance of globus
monitoring and discovery service (mds-2) queries. In
Proceedings of the 4th International Workshop on
DYNAMIC RESOURCE ALLOCATION AND ACTIVE PREDICTIVE MODELS FOR ENTERPRISE APPLICATIONS
561