
sion trees to effectively capture intricate SCD data
relationships while minimizing over-fitting. It stands
out in handling noisy data and outliers, making it ro-
bust in real-world situations where data quality is a
concern. RF also provides valuable insights into fea-
ture importance, aiding in the identification of pivotal
variables. Additionally, its capacity for parallelization
ensures the efficient processing of sizable datasets. In
contrast, although R-SVM can handle non-linear rela-
tionships using the radial basis function, it may neces-
sitate meticulous parameter tuning and feature scal-
ing, rendering it somewhat more intricate in specific
contexts.
Also RSVM tends to outperform LSVM, PSVM,
and SSVM models due to its adeptness at manag-
ing intricate, non-linear data relationships. While
LSVM is confined to straight lines or hyper planes
for class separation, RSVM employs the radial ba-
sis function kernel, enabling it to transform data into
a higher-dimensional space. In this space, complex
non-linear relationships are more accurately captured.
Although PSVM and SSVM also employ non-linear
kernels, they often struggle with intricate data pat-
terns. RSVM, with its radial basis function kernel,
excels in scenarios where class boundaries are intri-
cate and not easily defined geometrically. Its ability
to adapt to data intricacies results in a more flexible
and accurate decision boundary.
While previous studies have made significant
strides in SCD and ML, our research introduces nov-
elty by utilizing real-time blood pathology SCD data
and diverse ML techniques. This approach is crucial
for disease severity/co-morbidities prediction, aid-
ing in diagnosis, disease monitoring, drug develop-
ment, regenerative medicine, and fundamental re-
search. The findings presented in the study also open
avenues for future research in the field of inherited
blood disorders. One potential direction involves the
exploration of advanced spectroscopic methods, with
a focus on refining techniques for real-time monitor-
ing and diagnosis which might also consider the inte-
gration of multi-omics approaches, combining spec-
troscopic insights with genomics, transcriptomics,
and metabolomics data to provide a more comprehen-
sive understanding of the molecular intricacies under-
lying these disorders.
4 CONCLUSION
This study conducted a comparative analysis of five
distinct machine learning techniques: Random Forest,
Linear Support Vector Machine, Radial Support Vec-
tor Machine, Polynomial Support Vector Machine,
and Sigmoid Support Vector Machine for classifying
disease severity in sickle cell patients. The system
predicts disease severity, guiding treatment and med-
ication dosage. Performance metrics were assessed
across all classifiers, revealing Random Forest as the
most accurate method with 88% accuracy, 82% re-
call, and 92% specificity. The study’s stability and
reliability were affirmed through performance evalua-
tion. Future work may explore more features from ad-
vanced spectroscopic methods and also deep learning
techniques for classification, contingent on obtaining
sufficient training data to harness deep learning’s full
potential.
ACKNOWLEDGEMENTS
We wish to thank the Sickle cell Institute (SCIC)
Raipur, Chhattisgarh, for providing pathology hema-
tological data to conduct this research work.
REFERENCES
da Silva, W. R., Silveira Jr, L., and Fernandes, A. B. (2020).
Diagnosing sickle cell disease and iron deficiency ane-
mia in human blood by raman spectroscopy. Lasers in
Medical Science, 35(5):1065–1074.
Das, P. K., Meher, S., Panda, R., and Abraham, A. (2019).
A review of automated methods for the detection of
sickle cell disease. IEEE reviews in biomedical engi-
neering, 13:309–324.
Dheyab, H. F., Ucan, O. N., Khalaf, M., and Mohammed,
A. H. (2020). Implementation a various types of
machine learning approaches for biomedical datasets
based on sickle cell disorder. In 2020 4th International
Symposium on Multidisciplinary Studies and Innova-
tive Technologies (ISMSIT), pages 1–6. IEEE.
Elsabagh, A., Elhadary, M., Elsayed, B., Elshoeibi, A. M.,
Ferih, K., Kaddoura, R., Alkindi, S., Alshurafa, A.,
Alrasheed, M., Alzayed, A., et al. (2023). Artificial
intelligence in sickle disease. Blood Reviews, page
101102.
Farota, S. B., Diallo, A. H., Ba, M. L., Camara, G., and
Diagne, I. (2022). An ai-based model for the predic-
tion of a newborn’s sickle cell disease status. In In-
ternational Conference on Innovations and Interdisci-
plinary Solutions for Underserved Areas, pages 96–
104. Springer.
Gollapalli, M., Alabdullatif, L., Alsuwayeh, F., Aljouali,
M., Alhunief, A., and Batook, Z. (2022). Text mining
on hospital stay durations and management of sickle
cell disease patients. In 2022 14th International Con-
ference on Computational Intelligence and Communi-
cation Networks (CICN), pages 1–6. IEEE.
Jain, P. and Gupta, S. (2023). Multi-exposure laser speckle
contrast imaging (meci)-based prediction of blood
Machine Learning-Based Disease Severity Prediction in Sickle Cell Patients: Spectroscopic Insights
55