Table 2: Our TREC experimental result.
False Partially True True
124 29 254
30.46% 7.12% 62.40%
5.2 GeoNet
5
In order to have more practical evaluation, we
decided to design a specific test collection for
evaluating web-based question answering systems.
Therefore, we selected GeoNet -a web site
containing millions records of various locations
around the world with their geographical
characteristics like altitude, latitudes, respective
province and country, an so on- to forming such
collection.
We gathered near 5000 records from this web
site using a special crawler and then converted them
to a ready-to-use XML format for further
applications.
In table3 AMD1 means (capital of provices), AMD2
(big cities), AMD3 (small cities), and AMD4
(villages and other small places).
To have a fair evaluation, during construction of
this test case, before selection of queries, we divided
all countries in the world to three categories from
their internet access facilities point of view: 1)
developed countries like Canada and China, 2)
developing countries like Argentina, Australia,
Belgium, 3) undeveloped countries: Angola,
Bahrain, Bhutan, Bolivia, Brazil, Burma, Chad, and
Congo.
6 CONCLUSIONS AND FUTURE
WORKS
Because of high complexity and low efficiency of
NLP-based methods in question answering systems,
we tried to propose a density-based algorithm which
uses fuzzy logic to provide high-quality answers.
Our algorithm shows promising results even in a
noisy, open-domain environment like web.
Because of difficulty of construction of other
question types’ databases, we have implemented this
algorithm just for spatial queries, but it can be
applied to other types of questions easily. Currently,
we are extending our system to include “when” and
“who” questions.
REFERENCES
D.Radev, K.Libner, and W. Fan, ”Getting answers to
natural language questions on the web ” American
Society for information Science and Technology, vol.
53, pp. 359-364, January 2002.
D. Radev, W. Fan, H.Qi, H. Wu, and A. Grewal,
“probabilistic question answering on the web,” in
WWW ’02: Proceedings of the eleventh international
conference on World Wide Web, (Honolulu, Hawaii,
USA), ACM Press, 2002.
E. Agichtein, S. Lawrence, and L. Gravano, “Learning to
find answers to questions on the web,” 2004.
C. C. T. Kwok, O. Etzioni, and D. S.Weld, “Scaling
question answering to the web,” in WWW ’01:
Proceedings of the tenth international conference on
World Wide Web, (Hong Kong, Hong Kong), pp. 150-
161, ACM Press, 2001.
H. Yang and T .-S. Chua, “Fada: find all distinct answers,”
in WWW Alt. ’04: Proceedings of the 13
th
international World Wide Web conference on
Alternate track papers & posters, (New York, NY,
USA), pp. 304-305, ACM Press, 2004.
G. Ramakrishnan, S. Chakrabarti, D. Paranjpe, and P.
Bhattacharya, “Is question answering an acquired
skill?,” in WWW ’04: Proceedings of the 13
th
international conference on World Wide Web, (New
York, NY, USA), pp. 111-120, ACM Press, 2004.
G. Neumann and F. Xu, “Mining natural language answers
from the web” Web Intelligence and Agent Systems,
vol. 2, pp. 123-135, January 2004.
D. Roussinov and J. Robles, “Learning patterns to answer
open domain questions on the web,” in SIGIR ’04:
Proceedings of the 27
th
annual international
conference on Research and development in
information retrieval, (Sheffield, United Kingdem),
pp. 500-501, ACM Press, 2004.
J. Lin and B. Katz, “Question answering from the web
using knowledge annotation and knowledge mining
techniques,” in CIKM ’03: Proceedings of the twelfth
international conference on information knowledge
management, (New Orleans, LA, USA), pp. 116-123,
ACM Press, 2003.
Richard T. Carback III ,” A Survey of Algorithms for
Question Answering on the Web”
http://trw.umbc.edu:16080/~rick/QAsurvey.pdf,
March 2005.
APPENDIX
1
Natural Language Processing (NLP)
2
Knowledge base (KB)
3
http://www.wikipedia.com
4
http://wordnet.princeton.edu
5
http://earth-info.nga.mil/gns/html/index.html
GEOGRAPHICAL QUESTION ANSWERING SYSTEM
313