based query processing in oder to search over graph-
structured data, e. g., BLINKS (He et al., 2007). Some
approaches apply a predefined set of query templates,
e. g., (Sacaleanu et al., 2008), where the latter real-
izes a multilingual entailment-based question answer-
ing approach. Other methods for query interpretation
are using deep NLP, e. g., Powerset (Converse et al.,
2008) or large background knowledge created with
high effort, e. g., WolframAlpha. Document search
by fact retrieval is supported by DBPedia, Semantic
Wikis and documents enriched with RDFa or micro-
formats since the document search is processed by re-
trieving the ‘included’ facts.
Our approach is essentially different to these
search methods as it provides fact and document re-
trieval, formal, free text and mixed queries, and also
mixed results, i. e., documents with facts, while all
other existing approaches support only a subset of
this.
5 CONCLUSIONS AND FUTURE
WORK
The evaluation shows the power of our hybrid ap-
proach. It performs best if properties are involved
in the query since they alleviate the weakness of SA,
i. e., noise results caused by uncontrolled spreading.
Also in other cases, the combination performs quite
well due to the more precise fact retrieval results. If
no facts are available for a query, the search approach
performs semantic document retrieval using the meta-
data of the documents. Furthermore, our approach
delivers facts if they are available, and so the user
doesn’t need to browse the documents of the result
list. In future versions, to improve the precision if no
facts are matched, we are going to setup our seman-
tic network with edge weights which express the im-
portance of relations and we plan further evaluations
(e. g., with DBPedia). We also foresee to integrate this
approach in the digital library assistant DiLiA (Seifert
and Kruppa, 2010), by extending it to support com-
plex queries for expert users.
ACKNOWLEDGEMENTS
This research has been supported in part by the THE-
SEUS Program CTC, which is funded by the BMWi
(gn 01MQ07016), and the research project DiLiA
which is co-funded by the ERDF (gn 10140159). The
responsibility for this publication lies with the au-
thors.
REFERENCES
Converse, T., Kaplan, R. M., Pell, B., Prevost, S., Thione,
L., and Walters, C. (2008). Powerset’s natural lan-
guage wikipedia search engine. Report, Powerset inc.,
California.
Crestani, F. (1997). Application of spreading activation
techniques in information retrieval. Artificial Intelli-
gence Review, 11(6):453–482.
Goldschmidt, D. E. and Krishnamoorthy, M. (2005). Archi-
tecting a search engine for the semantic web. In AAAI
Workshop on Contexts and Ontologies.
Grothkast, A., Adrian, B., Schumacher, K., and Dengel, A.
(2008). OCAS: Ontology-based corpus and annota-
tion scheme. In Proc. of the High-level Information
Extraction Workshop 2008, pages 25–35.
He, H., Wang, H., Yang, J., and Yu, P. S. (2007). BLINKS:
Ranked keyword searches on graphs. In Proc. of
the International Conference on Management of Data,
pages 305–316.
Ladwig, G. and Tran, D. T. (2010). Combining keyword
translation with structured query answering for effi-
cient keyword search. In Proc. of the 7th Extended
Semantic Web Conference.
Nodine, M., Fowler, J., Ksiezyk, T., Perry, B., Taylor, M.,
and Unruh, A. (2000). Active information gathering
in InfoSleuth. International Journal of Cooperative
Information Systems, 9(1/2):3–28.
Sacaleanu, B., Orasan, C., Spurk, C., Ou, S., Ferrandez, O.,
Kouylekov, M., and Negri, M. (2008). Entailment-
based question answering for structured data. In
Posters and Demonstrations, 22nd Int. Conference on
Computational Linguistics, pages 29–32.
Sauermann, L., van Elst, L., and Dengel, A. (2007). PIMO
– a framework for representing personal information
models. In Proc. of I-SEMANTICS 2007, pages 270–
277.
Schumacher, K., Sintek, M., and Sauermann, L. (2008).
Combining fact and document retrieval with spread-
ing activation for semantic desktop search. In Proc.
of the 5th European Semantic Web Conference, pages
569–583.
Seifert, I. and Kruppa, M. (2010). A pool of topics: In-
teractive relational topic visualization for information
discovery. In Proc. of the Conference on Visual Infor-
mation Communication, pages 195–207.
Stieger, B. and Aleksy, M. (2009). Utilization of knowl-
edge management for service business processes im-
provement. In Proc. of the Int. Multiconference on
Computer Science and Information Technology, pages
171–175.
Tran, T., Cimiano, P., Rudolph, S., and Studer, R. (2007).
Ontolgy-based interpretation of keywords for seman-
tic search. In Proc. of the 6th Int. Semantic Web Con-
ference, pages 523–536.
Tran, T., Wang, H., Rudolph, S., and Cimiano, P. (2009).
Top-k exploration of query candidates for efficient
keyword search on graph-shaped (RDF) data. In
ICDE, pages 405–416.
Wang, H., Liu, Q., Penin, T., Fu, L., Zhang, L., Tran, T.,
Yu, Y., and Pan, Y. (2009). Semplore: A scalable IR
approach to search the web of data. Journal of Web
Semantics, 7(3):177–188.
SEARCHING WEB 3.0 CONTENT - A Semantic Search Approach
313