
The Use of Extensible Markup Language (XML) to Analyse Medical 
Full Text Repositories – An Example from Homeopathy 
Thomas Ostermann, Marc Malik and Christa Raak  
Institute of Integrative Medicine, Witten/Herdecke University, Gerhard-Kienle-Weg 4, Herdecke, Germany 
 
Keywords:  Extensible Markup Language, Homeopathy, Repertorisation, Software. 
Abstract:  Extensible Markup Language (XML) is one of the most popular web languages in the life science used for 
for Semantic Data Analysis in various fields of clinical research. One of these fields is the processing of 
medical full texts. To extract meaningful information out of natural texts is one of the challenges when 
dealing with huge text repositories. We present an application of XML together with linguistic algorithms in 
the processing of texts from a homeopathic materia medica. Our approach enables the user not only to 
search within the symptom descriptions but also offers special features like sequential search within the 
results or the comparison of homeopathic remedies. However user demands of day to day practice and terms 
of information technology have both to be taken carefully into account to further develop this prototype.  
1 INTRODUCTION 
Extensible Markup Language (XML) is one of the 
most popular semantic web language in the life 
science with more than 900 publication between 
1999 and 2010 in PubMed (Ostermann et al., 2014). 
Today it is applied in knowledge transfer in the life 
science (Murray-Rust, 2000) In this field XML has 
managed to become an important tool in clinical 
laboratory procedures (Saadawi and Harrison, 2003) 
but its way into patient care still seems to be far 
behind the possibilities XML is offering.  
In particular the capability of Internet Browsers 
to read, edit and analyse XML documents creates a 
variety of opportunities for Semantic Data Analysis 
(SDA) facilities to be incorporated into clinical 
applications (Bompani et al., 2002).  
XML, when combined with web services, 
semantic data analsis and scripting languages such 
as Java script, can be used to offer a huge amount of 
functionality for the user including text retrieval and 
the generation of summary data through a standard 
web-browser.  
With regards to medical full texts, searching for 
a certain information sometimes is crucial. As 
already pointed out by Grivell in 2002 “natural 
language provides a considerable challenge for 
algorithms to extract meaningful information from 
natural text.”  
This even more becomes a complex problem 
when dealing with huge repositories from the field 
of traditional medical systems. In particular, 
machine readable dictionaries with a codification of 
domain knowledge and literature metadata in 
accordance with a generic and extendible XML 
scheme model have been shown to be suitable in this 
context Ostermann et al., 2009). One open problem 
in in this context is the semantic processing of the so 
called Materia Medicae. Such repositories contain 
structured data on medical symptoms and the 
corresponding remedies i.e. from the field of 
phytotherapy or, like in our case, from homeopathy.  
With a tradition of 200 years of patient care, 
homeopathy is one of the oldest integrative medical 
systems in the field of Traditional European 
Medicine. An essential part of homeopathic case 
taking is the conduction of a comprehensive 
anamnesis followed by individualized finding of a 
remedy that fits the conditions the patient describes. 
This is called repertorisation and today is done with 
the help of computer programs using modern 
database technology (Ostermann et al., 2012).  
Accoring to our own review and with respect to 
other personalized approaches in e-health (Lee et al., 
2008), XML-based processing of such vast resources 
might be beneficial in the complex process of 
homeopathic prescribing. 
219
Ostermann T., Malik M. and Raak C..
The Use of Extensible Markup Language (XML) to Analyse Medical Full Text Repositories – An Example from Homeopathy.
DOI: 10.5220/0005484002190224
In Proceedings of 4th International Conference on Data Management Technologies and Applications (DATA-2015), pages 219-224
ISBN: 978-989-758-103-8
Copyright
c
 2015 SCITEPRESS (Science and Technology Publications, Lda.)