Synthetic Optimisation Techniques for Epidemic Disease Prediction Modelling

Terence Fusco, Yaxin Bi, Haiying Wang, Fiona Browne

Abstract

In this paper, research is presented for improving optimisation performance using sparse training data for disease vector classification. Optimisation techniques currently available such as Bayesian, Evolutionary and Global optimisation and are capable of providing highly efficient and accurate results however, performance potential can often be restricted when dealing with limited training resources. In this study, a novel approach is proposed to address this issue by introducing Sequential Model-based Algorithm Configuration(SMAC) optimisation in combination with Synthetic Minority Over-sampling Technique(SMOTE) for optimised synthetic prediction modelling. This approach generates additional synthetic instances from a limited training sample while concurrently seeking to improve best algorithm performance. As results show, the proposed Synthetic Instance Model Optimisation (SIMO) technique presents a viable, unified solution for finding optimum classifier performance when faced with sparse training resources. Using the SIMO approach, noticeable performance accuracy and f-measure improvements were achieved over standalone SMAC optimisation. Many results showed significant improvement when comparing collective training data with SIMO instance optimisation including individual performance accuracy increases of up to 46% and a mean overall increase for the entire 240 configurations of 13.96% over standard SMAC optimisation.

Download


Paper Citation


in Harvard Style

Bi Y., Wang H. and Browne F. (2018). Synthetic Optimisation Techniques for Epidemic Disease Prediction Modelling.In Proceedings of the 7th International Conference on Data Science, Technology and Applications - Volume 1: DATA, ISBN 978-989-758-318-6, pages 95-106. DOI: 10.5220/0006823800950106


in Bibtex Style

@conference{data18,
author={Yaxin Bi and Haiying Wang and Fiona Browne},
title={Synthetic Optimisation Techniques for Epidemic Disease Prediction Modelling},
booktitle={Proceedings of the 7th International Conference on Data Science, Technology and Applications - Volume 1: DATA,},
year={2018},
pages={95-106},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006823800950106},
isbn={978-989-758-318-6},
}


in EndNote Style

TY - CONF

JO - Proceedings of the 7th International Conference on Data Science, Technology and Applications - Volume 1: DATA,
TI - Synthetic Optimisation Techniques for Epidemic Disease Prediction Modelling
SN - 978-989-758-318-6
AU - Bi Y.
AU - Wang H.
AU - Browne F.
PY - 2018
SP - 95
EP - 106
DO - 10.5220/0006823800950106