2006), recall, and area under the curve (AUC) (Davis
and Goadrich 2006; Powers 2011). LR produced the
highest accuracy of about 98.3%. Followed to LR,
KNN and SVM produced accuracy about 97.6%, and
96.7%, respectively. Three models were generating
similar accuracy rates. Sometimes, accuracy is not
only enough to judge the model performance.
Therefore, analysis of other parameters such as
precision, recall, and AUC is mandatory to define
model validation.
Precision can define positive outcomes from total
predicted positive instances. In this study, we found
similar accuracy for two models (LR and KNN) about
98± 0.04%. When compared with the other two
models, SVM was producing a low positive
prediction rate of 97.1%. On the other hand, recall
(sensitivity) can define true positives from total actual
positives. Both precision and recall are based on the
understanding of the relevance of positive outcomes.
From Table2, the sensitivity for LR predictive model
found at about 97.4%. Alternatively, KNN was with
the highest sensitivity rate of 98.3%, and SVM with
the lowest sensitivity rate of 96.6% can found.
Despite this, in machine learning, AUC can help to
overcome classification problems. It is one of the key
performance tools for model performance checks.
Generally, the AUC was ranging in between [0, 1].
By definition, if AUC ≈ 1, then the model was
correctly distinguishing the target class. The AUC
values of LR, KNN, and SVM were 99.7%, 99.6%,
and 98.3%, respectively.
Table 2: Performance metrics of different predictive
models.
Model Accuracy Precision Recall AUC
SVM 0.967 0.971 0.966 0.983
LR 0.983 0.986 0.974 0.997
KNN 0.976 0.982 0.983 0.996
Figure 3: Graphical representation of AUC values.
4 CONCLUSIONS
In this study, three supervised ML algorithms (SVM,
LR, and KNN) were defined to classify dementia
patients. Feature extraction performed using the
principal component analysis method using the R
platform. Different performance parameters set was
defined the model validation. Results validated that
the three models are accurately classifying dementia
patients with better rates from 96.7-98.3%. In
unbalanced datasets, accuracy is not only the
parameter to validate the model. Therefore, other
metrics, such as precision, recall, and AUC, were also
considered. The AUC of LR and KNN reached the
highest value of one, such that these two predictive
models were well classified the dementia patients.
This work is concluding that employment PCA
techniques were much better than the manual
selection of attributes with minimum medical
knowledge. Therefore, with limited features and
integration of the PCA method, we were achieved
better accuracy rates when compared with previous
studies in dementia classifications.
CONFLICTS OF INTEREST
The authors do not possess any conflicts during the
publication.
ACKNOWLEDGMENTS
We are thankful to the Principal Investigators: D.
Marcus, R, Buckner, J. Csernansky, and J. Morris, to
provide access to OASIS longitudinal studies.
REFERENCES
Aditya, C. R., and M. B.Sanjay Pande. 2017. “Devising an
Interpretable Calibrated Scale to Quantitatively Assess
the Dementia Stage of Subjects with Alzheimer’s
Disease: A Machine Learning Approach.” Informatics
in Medicine Unlocked.
Barragán Martínez, D., M. A. García Soldevilla, A. Parra
Santiago, and J. Tejeiro Martínez. 2019. “Alzheimer’s
Disease.” Medicine (Spain).
Baştanlar, Yalin, and Mustafa Özuysal. 2014. “Introduction
to Machine Learning.” Methods in Molecular Biology.
Battineni, Gopi, Nalini Chintalapudi, and Francesco
Amenta. 2019. “Machine Learning in Medicine:
Performance Calculation of Dementia Prediction by