the classifiers improve, except with random forest
that showed a better accuracy and F1 score when it
was used with all the features. Overall, ChiSq and
PCA obtained the highest accuracy, precision, recall
and F1 score. LOG and RF were the classifiers that
computed the best performance.
In general, the greatest problem with the models
was the false negatives, this is important to consider,
it is better to have a good classification of the false
negatives than the false positives. For future
development, some experimental work will attempt
to model the physiological HF problem, which is
difficult to do with few features. In addition, these
models will be replicated in a big data health
environment and test its functioning with massive
databases.
REFERENCES
Asl, B., Setarehdan, S. and Mohebbi, M. 2008. Support
vector machine-based arrhythmia classification using
reduced features of heart rate variability signal.
Artificial Intelligence in Medicine, 44(1), pp.51-64.
Domingos, P. 2012. A few useful things to know about
machine learning. Communications of the ACM,
55(10), p.78.
Dun B., Wang E., Majumder S. 2016. Heart Disease
Diagnosis on Medical Data Using Ensemble Learning.
Friedman, J. H. 1999. Greedy Function Approximation: A
Gradient Boosting Machine. Annals of Statistics. 29.
1189-1232. 10.2307/2699986.
Guidi, G., Pettenati, M., Melillo, P. and Iadanza, E. 2014.
A Machine Learning System to Improve Heart Failure
Patient Assistance. IEEE Journal of Biomedical and
Health Informatics, 18(6), pp.1750-1756.
Hastie, T., Tibshirani, R. and Friedman, J. 2017. The
Elements of Statistical Learning: Data Mining,
Inference, and Prediction. Springer.
HEART. 2018. Heart Failure. [online] Available at:
http://www.heart.org/HEARTORG/Conditions/Heart
Failure/HeartFailure_UCM_002019_SubHomePage.jsp
[Accessed 16 Jun. 2018].
Khan, S.S. 2016. Prediction of Angiographic Disease
Status using Rule Based Data Mining Techniques.
Biological Forum An International Journal, 8(2), pp
103-107.
Hornik, K. 1991. Approximation capabilities of multilayer
feedforward networks. Neural Networks, 4(2):251–
257.
Khanna, D., Sahu, R., Baths, V. and Deshpande, B. 2015.
Comparative Study of Classification Techniques
(SVM, Logistic Regression and Neural Networks) to
Predict the Prevalence of Heart Disease. International
Journal of Machine Learning and Computing, 5(5),
pp.414-419.
Khalaf, A., Owis, M. and Yassine, I. 2015. A novel
technique for cardiac arrhythmia classification using
spectral correlation and support vector machines.
Expert Systems with Applications, 42(21), pp.8361-
8368.
Kodati, S. 2018. Analysis of Heart Disease using in Data
Mining Tools Orange and Weka. Global Journal of
Computer Science and Technology, 18(1).
Mutyala, N. 2018. Prediction of Heart Diseases Using
Data Mining and Machine Learning Algorithms and
Tools. International Journal of Scientific Research in
Computer Science, Engineering and Information
Technology, 3(3).
Marsland, S. 2015. Machine Learning: an Algorithmic
Perspective. Chapman and Hall/CRC.
Parthiban, G. and K. Srivatsa, S. 2012. Applying Machine
Learning Methods in Diagnosing Heart Disease for
Diabetic Patients. International Journal of Applied
Information Systems, 3(7), pp.25-30.
Rahhal, M., Bazi, Y., AlHichri, H., Alajlan, N., Melgani,
F. and Yager, R. 2016. Deep learning approach for
active classification of electrocardiogram signals.
Information Sciences, 345, pp.340-354.
Rajagopal, R. and Ranganathan, V. 2017. Evaluation of
effect of unsupervised dimensionality reduction
techniques on automated arrhythmia classification.
Biomedical Signal Processing and Control, 34, pp.1-8.
Breiman, L. (2001) Random forests. Machine Learning,
45, 5-32.
Shalev-Shwartz, S., Ben-David, S. 2016. Understanding
machine learning: From theory to algorithms. New
York: Cambridge University Press.
Singh, R., Saini, B. and Sunkaria, R. 2018. Detection of
coronary artery disease by reduced features and
extreme learning machine. Clujul Medical, 91(2),
p.166.
Spark.apache.org. 2018. Linear Methods - RDD-based
API - Spark 2.0.0 Documentation. [online] Available
at: https://spark.apache.org/docs/2.0.0/mllib-linear-
methods.html#logistic-regression
[Accessed 20 Jun. 2018].
UCI. 2018. UCI Machine Learning Repository: Heart
Disease Data Set. [online] Available at:
http://archive.ics.uci.edu/ml/datasets/heart+disease
[Accessed 20 Jun. 2018].
World Health Organization. 2018. Cardiovascular diseases
(CVDs). [online] Available at:
http://www.who.int/cardiovascular_diseases/en/
[Accessed 19 Jun. 2018].
Yaghouby, F., Ayatollah, A., Soleimani, R. 2009.
Classification of Cardiac Abnormalities Using
Reduced Features of Heart Rate Variability Signal.
World Applied Sciences Journal, 6 (11), pp.1547-
1554.
Ziasabounch, N. 2014. ANFIS Based Classification Model
for Heart Disease Prediction. International Journal of
Electrical & Computer Sciences, 14 (2), pp.7-12.