Prediction of Public Procurement Corruption Indices using Machine Learning Methods
Kornelije Rabuzin, Nikola Modrušan
2019
Abstract
The protection of citizens’ public financial resources through advanced corruption detection models in public procurement has become an almost inevitable topic and the subject of numerous studies. Since it almost always focuses on the prediction of corrupt competition, the calculation of various indices and indications of corruption to the data itself are very difficult to come by. These data sets usually have very few observations, especially accurately labelled ones. The prevention or detection of compromised public procurement processes is definitely a crucial step, related to the initial phase of public procurement, i.e., the phase of publication of the notice. The aim of this paper is to compare prediction models using text-mining techniques and machine-learning methods to detect suspicious tenders, and to develop a model to detect suspicious one-bid tenders. Consequently, we have analyzed tender documentation for particular tenders, extracted the content of interest about the levels of all bids and grouped it by procurement lots using machine-learning methods. A model that includes the aforementioned components uses the most common text classification algorithms for the purpose of prediction: naive Bayes, logistic regression and support vector machines. The results of the research showed that knowledge in the tender documentation can be used for detection suspicious tenders.
DownloadPaper Citation
in Harvard Style
Rabuzin K. and Modrušan N. (2019). Prediction of Public Procurement Corruption Indices using Machine Learning Methods. In Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2019) - Volume 3: KMIS; ISBN 978-989-758-382-7, SciTePress, pages 333-340. DOI: 10.5220/0008353603330340
in Bibtex Style
@conference{kmis19,
author={Kornelije Rabuzin and Nikola Modrušan},
title={Prediction of Public Procurement Corruption Indices using Machine Learning Methods},
booktitle={Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2019) - Volume 3: KMIS},
year={2019},
pages={333-340},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0008353603330340},
isbn={978-989-758-382-7},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K 2019) - Volume 3: KMIS
TI - Prediction of Public Procurement Corruption Indices using Machine Learning Methods
SN - 978-989-758-382-7
AU - Rabuzin K.
AU - Modrušan N.
PY - 2019
SP - 333
EP - 340
DO - 10.5220/0008353603330340
PB - SciTePress