
4 DISCUSSION
The results indicate that both overall feature selection
and group-based feature selection improved the per-
formance of the decision tree, Naive Bayes, and SVM
models. Also, group-based feature selection showed
higher performance than overall feature selection for
all models. In this study, the number of features was
reduced from 73 to 25 by feature selection, resulting
in a 65% reduction rate; at the same time, an increase
in accuracy and F1 score was observed for all classi-
fiers. It indicates strong overall performance.
The study using the same dataset (Appakaya
et al., 2020) obtained 88.5% accuracy. However,
this performance was obtained by applying leave-
one-subject-out (LOSO). LOSO is a cross-validation
method that indicates the performance reliability for
entirely new data. With this advantage, it can be
preferred in diagnostic decision support systems. In
this study, 10-fold cross-validation was applied dur-
ing the experiments, and 96.36% accuracy was ob-
tained. Feature selection not only increased accuracy
across all classifier types, but also improved results by
enhancing other metrics closer to a balanced ratio.
The results underscore the importance of feature
engineering and model selection in achieving the best
possible classification performance. Feature selection
excludes the features with lower importance for the
training process and increases the classification per-
formance. By choosing the proper subset of the fea-
tures, classification performance could be improved.
5 CONCLUSION
Parkinson’s disease (PD) is a neurodegenerative con-
dition characterized by a decrease in dopamine levels
in the brain. Currently, there is no known cure for PD,
but early diagnosis plays a crucial role in managing
the progression of the disease. In this study, an ap-
proach for early detection of PD using three feature
subsets obtained by speech analysis was proposed,
and the effect of feature selection on classification
performance was observed. MRMR based feature se-
lection was applied to define the most discriminative
features. The study revealed a potential improvement
in the classification performance by selecting impor-
tant features in speech-based PD diagnosis.
ACKNOWLEDGEMENTS
This work was supported by the Scientific and Tech-
nological Research Council of Turkey (T
¨
UB
˙
ITAK) as
2224-A project. The authors heartedly thank and ap-
preciate T
¨
UB
˙
ITAK for their support.
REFERENCES
Appakaya, S. B. and Sankar, R. (2018). Classification of
parkinson’s disease using pitch synchronous speech
analysis. In 2018 40th Annual International Confer-
ence of the IEEE Engineering in Medicine and Biol-
ogy Society (EMBC).
Appakaya, S. B., Sankar, R., and Ra, I.-H. (2020). Classifier
comparison for two distinct applications using same
data. In 9th International Conference on Smart Media
and Applications. SMA 2020: The 9th International
Conference on Smart Media and Applications.
Appakaya, S. B., Sankar, R., and Sheybani, E. (2021).
Novel unsupervised feature extraction protocol us-
ing autoencoders for connected speech: Application
in parkinson’s disease classification. In Wireless
Telecommunications Symposium (WTS), page 1–5.
IEEE.
Aydin, E. A. (2020). Subject-specific feature selection
for near infrared spectroscopy based brain-computer
interfaces. Computer Methods and Programs in
Biomedicine, 195:105535.
Boualoulou, N., Drissi, T. B., and Nsiri, B. (2023). Cnn
and lstm for the classification of parkinson’s disease
based on the gtcc and mfcc. Applied Computer Sci-
ence, 19(2):1–24.
Chu, S., Narayanan, S., and Kuo, C. C. J. (2009). Environ-
mental sound recognition with time–frequency audio
features. IEEE Transactions on Audio, Speech, and
Language Processing, 17(6):1142–1158.
Connolly, B. S. and Lang, A. E. (2014). Pharmacological
treatment of parkinson disease. JAMA, 311:1670.
Dimauro, G. and Girardi, F. (2019). Italian parkinson’s
voice and speech.
Dimauro, G., Nicola, V. D., Bevilacqua, V., Caivano, D.,
and Girardi, F. (2017). Assessment of speech intelli-
gibility in parkinson’s disease using a speech-to-text
system. IEEE Access, 5:22199–22208.
Hassan, F., Mehmood, M. H., Younis, B., Mehmood, N.,
Imran, T., and Zafar, U. (2022). Comparative analysis
of machine learning algorithms for classification of
environmental sounds and fall detection. International
Journal of Innovations in Science & Technology.
https://journal.50sea.com/index.php/IJIST/article/vie
w/188.
Hawi, S., Alhozami, J., AlQahtani, R., AlSafran, D.,
Alqarni, M., and Sahmarany, L. E. (2022). Automatic
parkinson’s disease detection based on the combina-
tion of long-term acoustic features and mel frequency
cepstral coefficients (mfcc). Biomedical Signal Pro-
cessing and Control, 78:104013.
Kumar, D., Satija, U., and Kumar, P. (2022). Automated
classification of pathological speech signals. In IEEE
19th India Council International Conference (INDI-
CON), pages 1–5.
Feature Selection Improves Speech Based Parkinson’s Disease Detection Performance
731