techniques. For this reason, our first contribution
was focused on the development of a complete text-
mining solution to extract meaningful information
from clinical narrative reports. This solution was
implement “as-a-service” using the cTAKES
framework. The main goal was to provide an easy
and functional service that can detect relevant
clinical concepts and their respective interactions.
As such, our developed tool can easily detect
entities, concepts and relations contained in clinical
text-reports. In addition, this work provides a
semantic knowledge base resulting from the
application of our method. This was built from the
clinical annotations retrieved from our text-mining
system. The constructed knowledge base was built
using Linked Data standards to facilitate the
application of several knowledge discovery
mechanisms, such as reasoning. Our final goal is to
make this radiology knowledge base freely available
through Physionet Web site
(http://www.physionet.org/). This will empower
novel discovery methods due to the existence of a
well-structured clinical report data.
ACKNOWLEDGEMENTS
This work has received support from the EU/EFPIA
Innovative Medicines Initiative Joint Undertaking
(EMIF grant n° 115372). Pedro Sernadela is funded
by Fundação para a Ciência e Tecnologia (FCT)
under the grant agreement SFRH/BD/52484/2014.
Eriksson Monteiro is funded by FCT under the grant
agreement SFRH/BD/102195/2014. Sérgio Matos is
funded under the FCT Investigator programme.
REFERENCES
Baldridge, J., 2005. The opennlp project. URL:
http://opennlp. apache. org/index. html,(accessed 2
February 2012).
Bastiao Silva, L., Costa, C. & Oliveira, J.L., 2014.
Semantic Search over DICOM Repositories. In
Healthcare Informatics (ICHI), 2014 IEEE
International Conference on. IEEE, pp. 238–246.
Belleau, F. et al., 2008. Bio2RDF: towards a mashup to
build bioinformatics knowledge systems. Journal of
biomedical informatics, 41(5), pp.706–16. Available
at: http://www.sciencedirect.com/science/article/pii/
S1532046408000415 [Accessed June 25, 2015].
Berners-Lee, T., Hendler, J. & Lassila, O., 2001. The
semantic web. Scientific American, 284.5, pp.28–37.
Available at: http://isel2918929391.googlecode.com/
svn-history/r347/trunk/RPC/Slides/p01_theSemantic
Web.pdf [Accessed July 8, 2014].
Bird, S., 2006. NLTK. In Proceedings of the
COLING/ACL on Interactive presentation sessions -.
Morristown, NJ, USA: Association for Computational
Linguistics, pp. 69–72. Available at:
http://dl.acm.org/citation.cfm?id=1225403.1225421
[Accessed June 25, 2015].
Campos, D., Matos, S. & Oliveira, J.L., 2013a. Gimli:
open source and high-performance biomedical name
recognition. BMC bioinformatics, 14(1), p.54.
Available at: http://www.biomedcentral.com/1471-
2105/14/54 [Accessed June 25, 2015].
Campos, D., Matos, S. & Oliveira, J.L., 2013b. Neji: a tool
for heterogeneous biomedical concept identification.
Proceedings of BioLINK SIG, 2013, pp.28–31.
Cunningham, H., GATE, a General Architecture for Text
Engineering. Computers and the Humanities, 36(2),
pp.223–254. Available at: http://link.springer.com/
article/10.1023/A%3A1014348124664 [Accessed June
25, 2015].
Ferrucci, D. & Lally, A., 2004. UIMA: an architectural
approach to unstructured information processing in the
corporate research environment. Natural Language
Engineering, 10(3-4), pp.327–348. Available at:
http://journals.cambridge.org/abstract_S13513249040
03523 [Accessed June 25, 2015].
Gaillard, F. & Jones, J., 2009. Collaborative Radiology
Resources: Radiopaedia. org as an Example of a Web
2.0 Radiology Resource. In AMERICAN JOURNAL
OF ROENTGENOLOGY. AMER ROENTGEN RAY
SOC 1891 PRESTON WHITE DR, SUBSCRIPTION
FULFILLMENT, RESTON, VA 22091 USA.
Hahn, U. et al., 2008. An overview of JCoRe, the JULIE
lab UIMA component repository. In Proceedings of
the LREC. pp. 1–7.
Howe, D. et al., 2008. Big data: The future of biocuration.
Nature, 455(7209), pp.47–50. Available at:
http://dx.doi.org/10.1038/455047a [Accessed January
28, 2015].
Jonquet, C. et al., 2009. NCBO annotator: semantic
annotation of biomedical data. In International
Semantic Web Conference.
Kahn, C.E. & Thao, C., 2007. GoldMiner: a radiology
image search engine. AJR. American journal of
roentgenology, 188(6), pp.1475–8. Available at:
http://www.ajronline.org/doi/full/10.2214/AJR.06.174
0 [Accessed June 25, 2015].
Laleci, G.B., Yuksel, M. & Dogac, A., 2013. Providing
semantic interoperability between clinical care and
clinical research domains. IEEE journal of biomedical
and health informatics, 17(2), pp.356–69. Available
at: http://www.ncbi.nlm.nih.gov/pubmed/23008263
[Accessed May 22, 2015].
Leech, G., 1993. Corpus Annotation Schemes. Literary
and Linguistic Computing, 8(4), pp.275–281.
Available at: http://llc.oxfordjournals.org/content/
8/4/275.short [Accessed June 25, 2015].
Liu, S. et al., 2005. RxNorm: prescription for electronic
drug information exchange. IT Professional, 7(5),
pp.17–23. Available at: http://ieeexplore.ieee.org/