(which includes the English Wikipedia). Moreover
the application has been developed as an add-on tool
for Internet Explorer. The created tool is taking as
input data the Google's results and returns the
answers together with the initial results.
The evaluation process includes the execution of
one hundred questions in the two systems. The
questions were constructed from the queries of the
Web Tracks 2009 and 2010(TREC Collections). One
question deemed to have been answered correctly,
when all the correct answers with their respective
texts are returned.
Tables 1 and 2 contain the percentage of the
correct answers in the two systems for our dataset
and show clearly that our application answers
satisfactory the questions.
Table 1: Percentage of correct answers for Indri.
English Wikipedia Category B
Percentage 75% 94%
Table 2: Percentage of correct answers for Google.
Google
Percentage 66%
6 CONCLUSIONS AND FUTURE
WORK
This paper presents a novel idea which allows search
engines to quickly answer to natural language
queries locally. We have also presented a prototype
system based on the integration in a unified ontology
of the texts of the search engine’s results, together
with their syntactic structure. Then applying
reasoning tools in the ontology, the answers are
extracted by executing specific queries.
Future improvements to the question answering
system could be the development of a module for
automatic adjustment of questions submitted in the
wrong way by the user. Moreover we plan to speed
up even further the whole computation by
employing various optimization techniques.
ACKNOWLEDGEMENTS
This research has been co-financed by the European
Union (European Social Fund-ESF) and Greek
national funds through the Operational Program
“Education and Lifelong Learning” of the National
Strategic Reference Framework (NSRF)-Research
Funding Program: Heracleitus II. Investing in
knowledge society through the European Social
Fund.
REFERENCES
Croft, W., Callan, J., Allan, J., Zhai, C., Fisher, D.,
Avrahami, T., Strohman, T., Metzler, D., Ogilvie, P.,
Hoy, M., Lafferty, J., Brown, J., Si, L., Collins-
Thompson, K., Bilotti, M., Feng, F., and Larkey, L.,
2006. The Lemur Project. Available at: <http://www.
lemurproject.org/>, <http://lemurproject.org/clueweb0
9.php/>
Damljanovic, D., Tablan, V. and Bontcheva, K., 2008. A
text-based query interface to owl ontologies. The 6th
Language Resources and Evaluation Conference
(LREC).
Delmonte, R. and Tripodi, R., 2011. Linguistically-Based
Reranking of Google’s Snippets with GreG. Studies in
Computational Intelligence,Vol. 361, p. 59-79.
Heinrich, M., Gaedke, M., 2011. WebSoDa: A Tailored
Data Binding Framework for Web Programmers
Leveraging the WebSocket Protocol and HTML5
Microdata. Lecture Notes in Computer Science.
Khare, R., 2006. Microformats: the next (small) thing on
the semantic Web?. Internet Computing, IEEE 2006.
Kotov, A., Zhai, C., 2010. Towards natural question
guided search. WWW '10 Proceedings of the 19th
international conference on World wide web ACM
NewYork. Available at: <http://portal.acm.org/citation
.cfm?id=1772690&picked=prox&cfid=27961346&cft
oken=36016737>.
Lorand, D., Rusu, D., Fortuna, B., Mladenic, D. and
Grobelnik, M., 2009. Question answering based on
semantic graphs. Proc. of the Workshop on Semantic
Search.
Moise, M., Gheorghe, C., 2010. Developing question
answering (QA) systems using the patterns. WSEAS
Transactions on Computers.
Saias, J. and Quaresma, P., 2003. A methodology to create
ontology-based information retrieval systems. Lecture
Notes in Computer Science, Vol. 2902, p.424-434.
Shearer, R., Motik, B. and Horrocks, I., 2008. HermiT: a
Highly-Efficient OWL Reasoner. In Proceedings of
OWLED'2008.
Strohman, T., Metzler, D., Turtle, H. and Croft, W., 2000.
Indri: A language-model based search engine for
complex queries. Center for Intelligence Information
Retrieval University of Massachusetts Amherst,
Available at: <http://lemurproject.org/indri/>.
Yates A., Cafarella M., Banko M., Etzioni O., Broadhead
M., Soderland S., 2007. TextRunner: open information
extraction on the web. Proceedings of Human
Language Technologies.
ANONTOLOGY-BASEDQUESTIONANSWERINGSYSTEMEXPLOITINGSEARCHENGINES'RESULTS
429