its purpose is to provide support in the detection of
the focus of infection through the area of influence
of the outbreak. We also compare the results with
those of a previous work (Pujante et al., 2020) using
the OrientDB database, showing differences between
languages and modeling.
2 BACKGROUND
Non-relational databases, among which the graph-
oriented databases, GDB, arise in response to the lim-
itations that relational databases present to face the
new challenges and trends that are emerging in the
current world of computing and data management.
The GBDs specify their scope of action in scenarios
in which it is necessary to work with highly connected
data sets, being designed for optimal storage and nav-
igation through the relationships established between
them. Thus, in this type of databases, the data are
stored as the nodes of a graph, and their relationships
as the edges, allowing to apply the theory of graphs
to traverse the data stored in the database (Angles and
Gutierrez, 2008).
Neo4j is a native graph-oriented database that uses
a model of labeled and property graphs. Labeled
graphs consist of graphs in which labels can be as-
signed (generally an integer, although they can also
be of another type of data, such as text strings) both to
their nodes and to their edges to distinguish them from
the rest. As for property graphs, they are weighted
graphs (their edges have some kind of numerical eval-
uation) in which properties can be defined in key-
value format on both nodes and edges. Neo4j uses
its own descriptive language, Cypher, with which
the queries are made by describing the nodes, edges
and properties to be obtained, having a very different
structure from that of SQL.
Regarding the study of infection diseases, re-
searchers of (Lose et al., 2019) focus on Tuberculosis
disease research from the biology perspective, repre-
senting biological entities in a labelled property graph
using Neo4J. From the theoretical point of view, sev-
eral models recently described the spread of disease
(Huang et al., 2016). However, little attention is paid
on the use of GDBs for infectious diseases from an
epidemiological perspective. There are several stud-
ies that explore the advantages that graph modeling
can offer for tracing the spread of epidemics. Exam-
ples of them are (Grande et al., 2015), where graphs
have been used to represent the spread of the hep-
atitis C virus, distinguishing different types of per-
son nodes according to certain characteristics, such
as sex and age; or (Chen et al., 2011), where the au-
thors used social network analysis with a contact net-
work. In (Y. Kai and Jun, 2018) weighted graphs has
been theoretically studied to simulate the spread of an
epidemic. In (M. Boman and Stenhem, 2006), three-
dimensional graphs are used to represent the analysis
of contacts between individuals in specific geographic
areas in which an outbreak has been detected. The
work of (Chen et al., 2007) used geographical infor-
mation to analyse outbreaks for SARS diseases. As
far as we know, there is not research on the litera-
ture of the use of GDBs for analysing infection spread
in hospitals considering spatial and temporal informa-
tion.
3 MODELING CLINICAL
EVENTS AND
SPATIO-TEMPORAL
RELATIONS
With this modeling we can represent a common case
like the following:
A patient arrives at the emergency room with se-
vere stomach pain and is treated by the digestive
medicine service, specifically by its emergency unit,
and remains in a box for 4 hours. She then goes to
the plant where another unit of the same service is re-
sponsible for her care for a day. The next day she un-
derwent an operation, which entailed a change of bed
for the operating room, although she was still treated
by the same unit in the same floor. At the end of the
surgery, she returns to the same room and bed where
she remains until he is discharged after three days.
It is necessary to model the clinical events, the
spatial configuration of the hospital, and the temporal
relations between the events and the locations of the
patient during the care trajectory. This case is mod-
eled as a graph in Figure 1.
Regarding the locations, it is necessary to estab-
lish a physical organization of the hospital that is spe-
cific enough to track the care trajectory, but also gen-
eral enough to adapt to the different architectures that
hospital may present. It is also required to have a flex-
ible structure that allows the grouping of Areas and
Zones of the same or different floor. This concept is
LogicalZone, which can also host other logical zones
under the sole condition that no cycles occur. It would
also be necessary to create another organization of
the space related to the functional distribution of the
hospital, with which the units and services that work
in it are represented. Thus, spatial modeling can be
subdivided into three distributions: physical, logical
and functional, which can be seen in a UML domain
HEALTHINF 2021 - 14th International Conference on Health Informatics
742