result of the process and (c) being a regulator of the
process two types (activator or inhibitor),
represented in the CM by the “Input”, “Output” and
"Regulator" classes respectively. Through our
CMHG we incorporate genomic data currently used
(e.g., dbSNP, Ensembl, etc.), achieving a conceptual
representation that meets the needs of the
bioinformatics domain. As we said earlier, this
evolution aims to improve the conceptual definition
of the human genome, and thus leave a conceptual
framework for further improvements.
4 CONCLUSIONS AND FUTURE
WORK
PM is going to change the way we have historically
understood medicine. The new practical context
associated with it requires a sound working
environment, and the correct application of the
adequate SE practices. We face this problem in this
work focusing on the need to design a holistic CM
intended to capture structurally all the relevant
domain information, together with the conceptual
complexity associated with the continuously
changing context of Precision Medicine.
We assume that conceptual modeling techniques
are the basic strategy to design and develop the
required sound and efficient Genomic Information
Systems (GeIS). Most of this work is devoted to
reporting how complicated keeping “alive” such a
CM is, especially due to the rapidly evolving
knowledge. The conceptual representation of basic
notions has been discussed, emphasizing that the
CM applied to this type of environment facilitates
the generation of systems that support decision-
making processes in the Bioinformatics domain. The
domain knowledge must always be prepared to
incorporate any required extension in order to meet
new needs. This is why the CM is not only useful
but also necessary. The initial version (v1) focused
on modeling "Genotyping" then sought to create a
semantic and content description. However, we had
to discuss multiple decisions before moving on to
our next CMHG v2. Version 2 is characterized by
the change in its central axis based on "genes" and
takes as its axis the concept of "Chromosome (and
chromosome elements)". This change was made to
simplify the schema and provide a more flexible
approach to extend it according to the domain
evolution. This new version gives us greater
precision, and allows us to manipulate data in a
more direct way. All these decisions have a direct
implication on how data are managed and
consequently on how data quality is to be assessed.
Future research work will focus to three main goals:
(1) the evolution of the CM by adding new genomic
concepts into the CM (i.e., haplotypes). (2) The
implementation of a complete ETL process, using
our CM. The ETL should be able to identify relevant
data for a particular phenotype, and to load them
conveniently in the DB that represents the
conceptual model. (3) develop a proper, unified
framework specifically for GeIS. The aim of this
framework is to complement the conceptual model
with a DQA procedure in order to ensure the quality
of the information represented and loaded by the
ETL. The achievement of these three goals will
provide the required support to the knowledge on
which PM is based.
ACKNOWLEDGEMENTS
This work has been supported by the MESCyT of the
Dominican Rep. and also has the support of
Generalitat Valenciana through project IDEO
(PROMETEOII/2014/039), the MICINN through
project DataME (ref: TIN2016-80811-P) and the
Research and Development Aid Program (PAID-01-
16) of the UPV under the FPI grant 2137.
REFERENCES
Aguilar, A. (2015). Medicina personalizada, medicina de
precisión, ¿cuán lejos estamos de la perfección?.
Carcinos 5 (2): 32-33.
Aronson, S., and Rehm, H. (2015). Building the
foundation for genomics in precision medicine. Nature
526 (7573): 336–342.
Jiménez, N. (2014). Una medicina nueva, más inteligente
y menos invasiva. FARMAESPAÑA INDUSTRIAL
pages 72-73.
León, A., Reyes R., J. F., Burriel, V., and Valverde, F.
(2016). Data Quality problems when integrating
genomic information. 3rd. QMMQ Workshop 2016
(Gifu, Japan), pages 173-182.
Instituto Nacional del Cáncer (2015). Medicina de
precisión en el tratamiento del cáncer. from
https://www.cancer.gov/espanol/cancer/tratamiento/tip
os/medicina-de-precision.
Olivé, A. (2007). Conceptual modeling of information
systems, Springer-Verlag. Berlin Heidelberg.
Paez, J.G., Jänne, P.A., Lee, J.C., Tracy, S., Greulich, H.,
Gabriel, S., Herman, P., Kaye, F.J., Lindeman, N.,
Boggon, T.J., and Naoki, K. (2004). EGFR mutations
in lung cancer: correlation with clinical response to
gefitinib therapy. Science 304(5676): 1497–1500
Pastor, O., Reyes J. F., and Valverde, F. (2016).
Conceptual Schema of the Human Genome (CSHG).
Tech. Rep. from http://hdl.handle.net/10251/67297.