line (CIG). A well-known formal language used for
encoding CIGs, also adopted by the HL7 consortium,
one of the major standard in the world of Medical
Informatics, is the Arden Syntax (Hripcsak, 1994):
it is based on the ideas of composing CG as a clin-
ical work-flow of several Medical Logic Modules
(MLMs). Arden Syntax, like other existing languages
designed for addressing similar issues (e.g., (Ohno-
Machado et al., 1998; Quaglini et al., 2001; Shahar
et al., 1998; Poikonen, 1997)), focuses on tasks that
need to be done. This in order to follow a pathway of
clinical procedure, and the control structures needed
to do it. A good comparison between existing lan-
guages is presented in (Peleg et al., 2003; Mulyar
et al., 2007).
Remarkably, in the context of the comparison it
has also been highlighted the problem of retrieving
existing patients from the different data sources. Due
to this reason, a complete system able to process CIGs
could need a large number of other technologies, in
order to get the required data stored in the different
data sources. Beside all the related issues, such as
differences in the granularity of stored information or
different encoding for the same sort of data, the adop-
tion of an intermediate software layer able to stan-
dardise the way for representing a specific clinical
meaning is required.
To standardize the knowledge representation in
the clinical domain, a major contribution (in partic-
ular for anatomical issues and their relationships, dis-
eases and relations between them or the anatomical
regions involved, etc..) comes from languages like
Resource Description Framework Schema (RDFS) or
Web Ontology Language (OWL). These approaches,
mainly derived from the work done on the Semantic
Web topic, have spawned the growth of many other
formalisms like query languages (e.g. SPARQL)
or languages for rule engines (e.g. OWL-S) (Ye
et al., 2009; Rold
´
an-Garc
´
ıa et al., 2009). However,
the aforementioned technologies are more oriented to
propose a way for representing the generic knowledge
of the medical domain rather than describe work-
flows in an efficient and compact manner.
At the state of the art, even though a number of
approaches have been proposed (Sordo et al., 2003;
Sujansky and Altman, 1996), retrieving data from ex-
isting EHRs in an automatic and affordable manner is
still considered a very complex task, due to the hetero-
geneity of EHR encodings. Moreover, existing EHRs
rely on different architectures in terms of Data Bases
(DBs), operating systems and servers. This represents
a critical barrier to the further diffusion of DSSs in
hospitals, because it forces the user to a double data
entry, which inevitably leads to mistakes and updating
issues, or to employ very expensive ad-hoc solutions.
One of the most interesting proposal is i2b2
1
but it
still has few applications and is more tailored to solve
problems of data acquisition and presentation for data
statistical analysis, rather than in the medical domain.
Nevertheless, i2b2 represents a promising first pro-
posal in an emerging scenario.
In this paper we focus on CSide Language (CSL).
The project CSide, now closed, aimed at building a
monolitic, advanced, multi-centric EHR. Specifically,
CSL has been thought as a language for representing
CIGs. Differently from existing similar languages,
CSL uses a different perspective. It sacrifices the ex-
pressivity of the language –from the point of view of
the clinical work-flows– in order to simplify the link
to available EHR and the real format of data. The
general idea of CSL relies on the fact that the most
interesting CGs are not those written between spe-
cialists of the same area, but guidelines written for
specialists of different areas (e.g., guidelines written
by cardiologists in order to be used by General Prac-
titioners). Evidently, due to the significant gap of
knowledge, the suggested work-flows are usually sim-
pler than those designed to be exploited by specialists
within the same topic, and the expressivity of the lan-
guage can therefore be reduced in order to make eas-
ier the access to the data.
In this paper we shortly introduce the CSL lan-
guage, and we propose a first experimental investi-
gation of such technology. Specifically, we tested
CSL on three different CGs, with a medium level of
complexity and very different structure (workflows vs
truth tables vs time role in the guidelines) in the real
clinical context of the EHR in a medium-size center
for studying Thyroid diseases. Moreover, we tested
CSL on a different EHR, in use in the Cardiovascu-
lar Diagnostic Center (CDC) to build, from available
data, more abstract concepts, i.e. the Circumferential
and Systolic Stress (cESS).
The remainder of this paper is organised as fol-
lows. Firstly, we describe the CSL architecture. We
then provide the results of the performed analysis. Fi-
nally, discussions and conclusions are given.
2 CSL
The architecture of CSL includes two layers, which
exploit different languages. The two level structure
has been designed for de-coupling EHR and DSS in-
terfaces. In CSL it is possible to modify the EHR
1
i2b2, “Informatics for Integrating Biology and the Bed-
side,” [Online]. Available: https://www.i2b2.org/. [Ac-
cessed 07 02 2014]
HEALTHINF 2016 - 9th International Conference on Health Informatics
160