FINDING DISTINCT ANSWERS IN WEB SNIPPETS

Alejandro Figueroa, Günter Neumann

Abstract

This paper presents ListWebQA , a question answering system aimed specifically at discovering answers to list questions in web snippets. ListWebQA retrieves snippets likely to contain answers by means of a query rewriting strategy, and extracts answers according to their syntactic and semantic similarities afterwards. These similarities are determined by means of a set of surface syntactic patterns and a Latent Semantic Kernel. Results show that our strategy is effective in strengthening current web question answering techniques.

References

  1. Cederberg, S. and Windows, D. (2003). Using lsa and noun coordination information to improve the precision and recall of automatic hyponymy extraction. In Conference on Natural Language Learning (CoNLL-2003), pages 111-118, Edmonton, Canada.
  2. Hearst, M. (1992). Automatic acquisition of hyponomys from large text corpora. In Fourteenth International Conference on computational Linguistics, pages 539- 545, Nantes, France.
  3. Katz, B., Bilotti, M., Felshin, S., Fernandes, A., Hildebrandt, W., Katzir, R., Lin, J., Loreto, D., Marton, G., Mora, F., and Uzuner, O. (2004). Answering multiple questions on a topic from heterogeneous resources. In TREC 2004, Gaithersburg, Maryland.
  4. Katz, B., Lin, J., Loreto, D., Hildebrandt, W., Bilotti, M., Felshin, S., Fernandes, A., Marton, G., and Mora, F. (2003). Integrating web-based and corpus-based techniques for question answering. In TREC 2003, pages 426-435, Gaithersburg, Maryland.
  5. Katz, B., Marton, G., Borchardt, G., Brownell, A., Felshin, S., Loreto, D., Louis-Rosenberg, J., Lu, B., Mora, F., Stiller, S., Uzuner, O., and Wilcox, A. (2005). External knowledge sources for question answering. In TREC 2005, Gaithersburg, Maryland.
  6. Schone, P., Ciany, G., Cutts, R., Mayfield, J., and Smith, T. (2005). Qactis-based question answering at trec 2005. In TREC 2005, Gaithersburg, Maryland.
  7. Shawe-Taylor, J. and Cristianini, N. (2004). Kernel methods for pattern analysis, chapter 10, pages 335-339. Cambridge University Press.
  8. Shinzato, K. and Torisawa, K. (2004a). Acquiring hyponymy relations from web documents. In HLTNAACL 2004, pages 73-80, Boston, MA, USA.
  9. Shinzato, K. and Torisawa, K. (2004b). Extracting hyponyms of prespecified hypernyms from itemizations and headings in web documents. In COLING 7804, pages 938-944, Geneva, Switzerland.
  10. Sombatsrisomboon, R., Matsuo, P., and Ishizuka, M. (2003). Acquisition of hypernyms and hyponyms from the www. In 2nd International Workshop on Active Mining, Maebashi, Japan.
  11. Voorhees, E. M. (2001). Overview of the trec 2001 question answering track. In TREC 2001, pages 42-51, Gaithersburg, Maryland.
  12. Voorhees, E. M. (2003). Overview of the trec 2003 question answering track. In TREC 2003, pages 54-68, Gaithersburg, Maryland.
  13. Wu, L., Huang, X., Zhou, Y., Zhang, Z., and Lin, F. (2005). Fduqa on trec2005 qatrack. In TREC 2005, Gaithersburg, Maryland.
  14. Yang, H. and Chua, T. (2004a). Effectiveness of web page classification on finding list answers. In SIGIR 7804, pages 522-523, Sheffield, United Kingdom.
  15. Yang, H. and Chua, T. (2004b). Web-based list question answering. In Proceedings of COLING 7804, pages 1277- 1283, Geneva, Switzerland.
Download


Paper Citation


in Harvard Style

Figueroa A. and Neumann G. (2008). FINDING DISTINCT ANSWERS IN WEB SNIPPETS . In Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST, ISBN 978-989-8111-27-2, pages 26-33. DOI: 10.5220/0001518900260033


in Bibtex Style

@conference{webist08,
author={Alejandro Figueroa and Günter Neumann},
title={FINDING DISTINCT ANSWERS IN WEB SNIPPETS},
booktitle={Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,},
year={2008},
pages={26-33},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001518900260033},
isbn={978-989-8111-27-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Fourth International Conference on Web Information Systems and Technologies - Volume 2: WEBIST,
TI - FINDING DISTINCT ANSWERS IN WEB SNIPPETS
SN - 978-989-8111-27-2
AU - Figueroa A.
AU - Neumann G.
PY - 2008
SP - 26
EP - 33
DO - 10.5220/0001518900260033