• Feature selection and replacing;
• Cross-validation;
• Hyper-parameters optimization per ML model;
• Collecting the best results considering the stratifi-
cation criteria.
Incorporation of the classification models into the
HSP allows predicting the type of sepsis according to
the incoming parameters from the various devices and
in an incremental way, as new data comes in.
Besides the models already integrated in the HSP,
a specially developed component allows users to pro-
vide their own datasets to train and evaluate new ML
models. The following features are provided:
• Upload files in text format with input data, allow-
ing the specification of a field separator to support
CSV or TSV file formats; From the input data, the
target class field must be selected;
• Specify the training procedure, using cross vali-
dation or simple dataset split;
• Select the target ML model and its hyper-
parameters;
• Evaluate results;
• Store or download the model for further use.
3.4 User Interface
The HSP was designed to be simple and easy to use,
while keeping the information organized in such a
way that the user has direct access to the most impor-
tant data. For this purpose, the interface was divided
in three main areas: Patients, Devices and Manage-
ment. Management contains all administrative tools
that are used to configure the platform and organiza-
tions. The other two sections contain specific data
about patients and devices available in each organiza-
tion. Furthermore, if a user is associated with more
than one organization it is possible to switch between
workspaces, gaining access to all data. In the follow-
ing paragraphs, a more detailed description of each
section of the interface is presented and discussed.
Once a user successfully logs into the system, the
patient’s overview page is displayed (Figure 3).
This view provides a list of the institution’s pa-
tients with the ability to sort and search according to
different criteria, for example search by study date,
study status or patient details. On the left side of
this view, the available search filters are organized in
groups to make the search task simple. In the cen-
tral section, a table containing a list of patients that
meet the search criteria is displayed, containing rele-
vant information that helps to identify the patient as
well as the number of studies and the status of the lat-
est study requested for each patient. If no search cri-
teria are specified, the list of all patients is displayed.
A summary of important information helps to iden-
tify patients and studies that require a technicians at-
tention in cases where the study was requested or
a physicians attention when the studies have already
retrieved data for the physician to analyse. In this ex-
ample (Figure 3), each patient listed contains one as-
sociated study. While the study of the first listed pa-
tient is in a status where the technician must analyse
and validate the device results, the second requires the
physician to decide about the patient diagnosis.
Each personal page contains clinical information
about the patient, the operations available to the user,
and a list of the studies performed. The most recent
study is highlighted to enable faster access. Here,
the physician can request a new study for the patient.
When taking this action, the physician is prompted
to introduce some optional information about the pa-
tient, such as temperature or blood pressure. After
submitting these data, a study pipeline is initialized
and the study can be managed by the technician. Dur-
ing the study, a workflow page is available display-
ing the five steps of the pipeline (Start, Status, Valida-
tion, Classification, Decision), which allow tracking
the study progress. In the first stage, the view gives
the indication that the technician must undertake the
required procedures to prepare the device to process a
blood sample. If any error is detected by the platform,
a notification is shown and the Start study button is
disabled. Otherwise, as soon the device is ready, the
technician may proceed and start the study.
In the next step (Figure 4), it is possible to keep
track of the device processing, visualizing at any time
the operation status of each module. As each device
finishes processing and has results available, the HSP
downloads the data from the devices into its database
and makes it available through the web interface, so
that the technician can begin validating these values,
thus speeding up the reporting process. When every
module finishes processing, the study advances to the
next step where the technician must submit a report.
In this stage, the technician is responsible for
analysing the results from the device and verifying
that no errors have occurred during the blood sample
processing. When the process finishes without errors
the technician must then validate the data and submit
the report, which will be then accessible by the physi-
cian. Otherwise, if it lacks quality, a new study may
be requested cancelling the current study and starting
a new one.
At this point, the platform combines the data pro-
vided by the physician with the one returned by the
A Computational Pipeline for Sepsis Patients’ Stratification and Diagnosis
411