5 CONCLUSIONS
In this paper, we described a case study in using on-
tologies for information extraction from clinical doc-
uments. We demonstrated that we managed to build
a system with a high sensitivity – a requirement for
the task of patient recruitment. Improving precision,
however, is still an issue. Future work will focus on
the elimination of false positives by allowing to con-
struct logically more complex criteria.
The experimental data suggest that the process
of patient identification benefits from extracting facts
from structured data. We are planning to obtain more
reliable results by considering more patients and tri-
als. Moreover, the software will be tested by other
departments, too.
A lesson learned in the area of ontologies is that
it can be much easier to construct an ontology for a
specific application instead of building or even using
a general-purpose ontology. However, we also feel
that the lack of German language resources hinders
progress in the domain of semantic technologies suit-
able for German text and web resources.
ACKNOWLEDGEMENTS
The BFG project is partially funded by TSB Tech-
nologiestiftung Berlin, Zukunftsfonds Berlin, and co-
financed by the European Union – European Fund for
Regional Development.
We would like to thank all computer scientists, lin-
guists, ontologists, medical doctors, study nurses, and
administrative staff who participated in the develop-
ment of the software.
REFERENCES
Bodenreider, O. (2004). The unified medical language sys-
tem (umls): integrating biomedical terminology. Nu-
cleic Acids Research, 32(Database-Issue):267–270.
Browne, P. (2009). Jboss Drools Business Rules. From
technologies to solutions. Packt Publishing, Limited.
Cowie, J. and Wilks, Y. (2000). Information extraction.
Handbook of Natural Lang. Proc., pages 241–260.
Dugas, M., Lange, M., Berdel, W., and M
¨
uller-Tidow, C.
(2008). Workflow to improve patient recruitment for
clinical trials within hospital information systems - a
case-study. Trials, 9(1):2.
Gallaire, H., Minker, J., and Nicolas, J.-M. (1984). Logic
and databases: A deductive approach. ACM Comput.
Surv., 16(2):153–185.
Jurafsky, D. and Martin, J. H. (2008). Speech and Language
Processing (2nd Edition) (Prentice Hall Series in Ar-
tificial Intelligence). Prentice Hall, 2 edition.
Kifer, M. (2008). Rule interchange format: The framework.
In Web Reasoning and Rule Systems, volume 5341 of
LLNCS, pages 1 – 11.
Kifer, M., Lausen, G., and Wu, J. (1995). Logical foun-
dations of object-oriented and frame-based languages.
Journal of the ACM, 42(4):741–843.
Lloyd, J. W. (1987). Foundations of Logic Programming,
2nd Edition. Springer.
M
¨
uller, F. (2005). A finite-state approach to shallow pars-
ing and grammatical functions annotation of German.
PhD thesis, University of T
¨
ubingen.
Murphy, S. N., Mendis, M. E., Berkowitz, D. A., and
Chueh, I. K. H. (2006). Integration of clinical and
genetic data in the i2b2 architecture. In AMIA Annu
Symp Proc, page 2009.
Polleres, A. (2007). From SPARQL to rules (and back).
In Williamson, C. L., Zurko, M. E., Patel-Schneider,
P. F., and Shenoy, P. J., editors, WWW, pages 787–796.
ACM.
Reeve, L. (2005). Survey of semantic annotation platforms.
In Proceedings of the 2005 ACM Symposium on Ap-
plied Computing, pages 1634–1638. ACM Press.
Rogers, F. B. (1963). Medical subject headings. Bull Med
Libr Assoc, 51:114 – 116.
Rosse, C. and Mejino, J. (2003). A reference ontology
for biomedical informatics: the foundational model of
anatomy. J Biomed Inform, 36:478–500.
Ruch, P., Gobeill, J., Lovis, C., and Geissb
¨
uhler, A. (2008).
Automatic medical encoding with SNOMED cate-
gories. BMC Medical Inf. and Dec. Making, 8:6.
Scheitz, J. F., Mochmann, H. C., Fiebach, B. W. B., Aude-
bert, H. J., and Nolte, C. H. (2012). J Neurol, 25.
Scheitz, J. F., Mochmann, H. C., Nolte, C. H., Haeusler,
K. G., Audebert, H. J., Heuschmann, P. U., Laufs, U.,
Witzenbichler, B., Schultheiss, H. P., and Endres, M.
(2011). Troponin elevation in acute ischemic stroke
(TRELAS) – protocol of a prospective observational
trial. M. BMC Neurol, 11(98).
Schwaber, K. and Beedle, M. (2001). Agile Software Devel-
opment with Scrum. Prentice Hall PTR, Upper Saddle
River, NJ, USA, 1st edition.
Shvaiko, P. and Euzenat, J. (2011). Ontology matching:
State of the art and future chall. IEEE TKDE, 99.
Staab, S. and Studer, R. (2009). Handbook on Ontologies.
Springer, 2nd edition.
Todorov, K., Geibel, P., and K
¨
uhnberger, K.-U. (2010).
Mining concept similarities for heterogeneous ontolo-
gies. In Perner, P., editor, Advances in Data Mining.
Applications and Theoretical Aspects, volume 6171 of
LNCS, pages 86–100. Springer Berlin / Heidelberg.
Yu, L. (2011). A Developers Guide the Semantic Web.
Springer.
KEOD2013-InternationalConferenceonKnowledgeEngineeringandOntologyDevelopment
236