prediction performance. Therefore, the simulation
runtime prediction approach proposed in this paper is
superior to the existing single machine learning
regression model.
5 CONCLUSIONS
This paper discusses the runtime prediction programs
for CSM in the cloud environment, we propose a
simulation runtime prediction method based on
ensemble learning to support the efficient deployment
for CSM in the cloud. Firstly, the simulation
application is deployed in the cloud environment to
generate the data set, and the feature selection
technology is utilized to obtain the relevant feature
set. Secondly, a prediction algorithm based on
stacking ensemble learning is proposed, which
improves the prediction accuracy of ensemble model
by selecting the optimal model subset. The algorithm
can also automatically predict the runtime of CSM
application and select the optimal computing
resources. To prove the advantages of the proposed
approach, we evaluate different machine learning
methods, such as linear regression, multilayer
perceptron, regression tree and random forest.
Experiments show that ours approach could
effectively predict the runtime of CSM applications.
The proposed approach could be enhanced by the
following future work:
(1) The generality of the proposed method can be
considered to predict the runtime of different
types of simulation applications.
(2) Explore more partition algorithms, expand the
optional partition algorithm library and reduce
the deployment cost of simulation
applications.
ACKNOWLEDGEMENTS
This research was funded by the National Natural
Science Foundation of China (no. 61903368 and
no.61906207).
REFERENCES
Bocciarelli, P., D'Ambrogio, A., Mastromattei, A., Paglia,
E., & Giglio, A. (2017). Business process modeling and
simulation: state of the art and MSaaS opportunities.
Paper presented at the Proceedings of the Summer
Simulation Multi-Conference.
Cetinski, K., & Juric, M. B. (2015). AME-WPC: Advanced
model for efficient workload prediction in the cloud.
Journal of Network & Computer Applications, 55(sep.),
191-201.
Chen, T., He, T., & Benesty, M. (2016). xgboost: Extreme
Gradient Boosting.
Cruz, R. M., Sabourin, R., Cavalcanti, G. D., & Ren, T. I.
(2015). META-DES: A dynamic ensemble selection
framework using meta-learning. Pattern recognition,
48(5), 1925-1935.
D’Angelo, G., & Marzolla, M. (2014). New trends in
parallel and distributed simulation: From many-cores to
cloud computing. Simulation Modelling Practice and
Theory, 49, 320-335.
Dietterich, T. G. (2000). Ensemble Methods in Machine
Learning. Paper presented at the International
Workshop on Multiple Classifier Systems.
Fujimoto, R., Bock, C., Chen, W., Page, E., & Panchal, J.
H. (2017). Research Challenges in Modeling and
Simulation for Engineering Complex Systems.
Fujimoto, R. M. (2016). Research Challenges in Parallel
and Distributed Simulation.
Fujimoto, R. M., Malik, A. W., & Park, A. (2010). Parallel
and distributed simulation in the cloud. SCS M&S
Magazine, 3, 1-10.
Haken, H. (2006). Information and self-organization: A
macroscopic approach to complex systems: Springer
Science & Business Media.
Hui, L., Groep, D., & Wolters, L. (2005). An evaluation of
learning and heuristic techniques for application run
time predictions.
Kaur, G., Bala, A., & Chana, I. (2019). An intelligent
regressive ensemble approach for predicting resource
usage in cloud computing. Journal of Parallel and
Distributed Computing, 123, 1-12.
Kim, I. K., Wang, W., Qi, Y., & Humphrey, M. (2020).
Forecasting Cloud Application Workloads with
CloudInsight for Predictive Resource Management.
IEEE Transactions on Cloud Computing, PP(99), 1-1.
Kurt, V., Silas, D. M., & Jan, B. (2012). Conservative
Distributed Discrete Event Simulation on Amazon EC2.
Paper presented at the 2012 12th IEEE/ACM
International Symposium on Cluster, Cloud and Grid
Computing (ccgrid 2012).
Lee, B. D., & Schopf, J. M. (2004). Run-time prediction of
parallel applications on shared environments. Paper
presented at the IEEE International Conference on
Cluster Computing.
Li, J., Cheng, K., Wang, S., Morstatter, F., Trevino, R. P.,
Tang, J., & Liu, H. (2017). Feature selection: A data
perspective. ACM Computing Surveys (CSUR), 50(6),
1-45.
Liu, X., Qiang, H., Qiu, X., Chen, B., & Huang, K. (2012).
Cloud-based computer simulation: Towards planting
existing simulation software into the cloud. Simulation
Modelling Practice & Theory, 26(none), 135-150.
Matsunaga, A., & Fortes, J. A. (2010). On the use of
machine learning to predict the time and resources
consumed by applications.
Paper presented at the 2010
10th IEEE/ACM International Conference on Cluster,