overcoming the challenges associated with
conventional manual microscopic identification
techniques, we developed an AI-based diagnostic
system for ALL detection using Random Forest (RF),
XGBoost (XGb) and Decision Tree (DT) algorithms
(El Alaoui et al., 2023). The three models were
trained using 86 ALL and 86 control patients.
Moreover, a Grid Search hyperparameter tuning
technique was applied on each of the three
classification models, while a Forward Feature
Selection approach was performed to select the 10
most ALL-discriminatory complete blood count
(CBC) features out of the initial 20, which included:
Absolute Neutrophil Count (ANC), Hematocrit
(HCT), Red Blood Cells (RBCs), MCV, MCH,
Neutrophils %, Basophils %, Lymphocyte count,
MPV and Platelets. Training the models using a 5-
fold cross-validation technique resulted in high
accuracies corresponding to 91.4% for DT and an
identical 88.6% for both RF and XGb, respectively
(El Alaoui et al., 2023).
We also developed a ML-based diagnosis and
screening model for Chronic Lymphoblastic
Leukemia (CLL) using 3 ML techniques including
Linear Regression (LR), Linear Discriminant
Analysis (LDA) and XGb, selected out of 8 candidate
models following 5-fold cross-validation. The three
models were trained using a total of 682 CBC records,
where 88 were confirmed CLL patients and 594 were
control. Dataset imbalance and disproportionality
were overcome by means of (SMOTE-Synthetic
Minority Over-Sampling Technique), (Padmanabhan
et al., 2023). Moreover, the common high-ranked
CBC parameters were extracted using chi-square,
mutual information, extra tree and XGboost
classifiers. The final set of features included WBC,
Lymphocyte count, Neutrophil %, Lymphocyte %,
Monocyte %, ANC, Platelets, Basophil %, Monocyte
count, Basophil count and Eosinophil count, by order.
The final performances of the selected models using
the 11 chosen CBC parameters resulted in a 97.05%,
97.63% and 98.62% accuracy corresponding to QDA,
LR and XGb, respectively (Padmanabhan et al.,
2023).
Based on these novel results from the two
aforementioned studies, it is fair to mention that pre-
diagnosis delays can be effectively tackled through
the integration of AI models in diagnostic systems
and devices to foster early hematological disease
detection.
3.2 Case Study (b): Prediction of
Clinical Complications in Cancer
Patients
With the aim of expediating the medical diagnosis
process and enhancing its reliability for better health
outcomes and increased chances of patient survival,
we developed a novel AI-based approach to better
manage patients with hematological cancer, namely
the ones affected by therapy-induced
myelosuppression, multiple infections, and febrile
neutropenia (FN) (Padmanabhan et al., 2022).
In addition to the increasing sepsis risk and
mortality in hematologic cancer patients with FN
associated with treatment-induced myelosuppression,
a high prevalence of multidrug-resistant organisms is
also captured in such patients, which limits the
number of treatment options that the patient is
allowed amidst a similar set of health complications.
Therefore, the early identification of such organisms
within the cancer patient body can help prepare the
latter to receive a better treatment, enable good
hospital management and prevent the spread of such
organisms to the weaker category of patients
(Padmanabhan et al., 2022).
The current application serves as a predictive
model for multidrug-resistant organisms, sepsis, and
mortality risk in hematological cancer patients with
FN. The application consists of medical data
extraction using 1166 febrile neutropenia episodes
reported in 513 patients, trained using the XGboost
algorithm, a model selected out of 6 candidate models
using a 10-fold cross-validation. Furthermore, to
address the data disproportionality problem, data
augmentation techniques and model-scoring-based
hyperparameter tuning were used, and a set of
features were added to the model to enhance the
predictability of the previously mentioned clinical
complications. The performance for sepsis, multi-
drug resistant organism and mortality predictions
resulted in a 0.85, 0.91 and 0.88 AUC (area under the
curve), respectively, highlighting AI’s potential in
treatment clinical decision-making. Figure 2
represents the adopted methodology followed in
building the aforementioned predicted models.
4 CONCLUSIONS
The computational power of AI constitutes a very
strong leap in the field of hematology management,
such that it enables a deep level of understanding of
the unseen relations between clinical patient