TOWARDS AN APPROACH BASED ON VERIFIABILITY ASPECTS TO HELP IN THE QUALITY EVALUATION OF TEXTUAL WEB PAGES

Daniel Lichtnow, Leandro Krug Wives, José Palazzo Moreira de Oliveira

2012

Abstract

This work presents an approach based on verifiability aspects to evaluate Web pages with textual content. In the work, verifiability is related to the existence of references to information sources. In this sense, we take into account that textual Web pages with references to information sources use to be better than Web pages without references to information sources. Thus, aspects related to automatically identification of verifiability indicators in textual Web pages are presented. For the given context, the results of preliminary experiments show that verifiability aspects can be useful to infer the quality of texts present on the Web addressed to Web users with little knowledge about a specific subject.

References

  1. Amento, B., Terveen, L., and Hill, W., 2000. Does “authority” mean quality? predicting expert quality ratings of web documents. In SIGIR 7800: Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval, pages 296-303, New York, NY, USA. ACM.
  2. Batini, C., Cabitza, F., Cappiello, C., and Francalanci, C., 2008. A comprehensive data quality methodology for web and structured data. Int. J. Innov. Comput. Appl., 1(3):205-218.
  3. Batini, C., Cappiello, C., Francalanci, C., and Maurino, A., 2009. Methodologies for data quality assessment and improvement. ACM Comput. Surv., 41(3):1-52.
  4. Bethard, S., Wetzer, P., Butcher, K., Martin, J. H., and Sumner, T., 2009. Automatically characterizing resource quality for educational digital libraries. In JCDL 7809: Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries, pages 221-230, New York, NY, USA. ACM.
  5. Brin, S. and Page, L., 1998. The anatomy of a large-scale hypertextual web search engine. Comput. Netw. ISDN Syst., 30(1-7):107-117.
  6. Dalip, D., Gonçalves, M. A., Cristo, M., and Calado, P., 2009. Automatic quality assessment of content created collaboratively by web communities: a case study of wikipedia. In JCDL 7809: Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries, pages 295-304, New York, NY, USA. ACM.
  7. Denecke, K. and Nejdl, W., 2009. How valuable is medical social media data? content analysis of the medical web. Inf. Sci., 179(12):1870-1880.
  8. Finkel, J. R., Grenager, T., and Manning, C., 2005. Incorporating non-local information into information extraction systems by gibbs sampling. In ACL 7805: Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics, pages 363- 370, Morristown, NJ, USA. Association for Computational Linguistics.
  9. Garzotto, F., Mainetti, L., and Paolini, P., 1997. Designing model hypermedia applications. In HYPERTEXT 7897: Proceedings of the eighth ACM conference on Hypertext, pages 38-47, New York, NY, USA. ACM.
  10. Hawking, D., Craswell, N., Bailey, P., and Griffihs, K., 2001. Measuring search engine quality. Inf. Retr., 4:33-59.
  11. HONCODE, 2009. Health on the net foundation. http://www.hon.ch/
  12. Kato, Y., Kawahara, D., Inui, K., Kurohashi, S., and Shibata, T., 2008. Extracting the author of web pages. In WICOW 7808: Proceeding of the 2nd ACM workshop on Information credibility on the web, pages 35-42, New York, NY, USA. ACM.
  13. Kohlschütter, C. and Nejdl, W., 2008. A densitometric approach to web page segmentation. In CIKM 7808: Proceeding of the 17th ACM conference on Information and knowledge management, pages 1173- 1182, New York, NY, USA. ACM.
  14. Naumann, F. and Rolker, C., 1999. Do metadata models meet iq requirements. In Proceedings of the International Conference on Information Quality (IQ), pages 99-114.
  15. Naumann, F. and Rolker, C., 2000. Assessment methods for information quality criteria. In Proceedings of the International Conference on Information Quality (IQ), Cambridge, MA, pages 148-162.
  16. Pernici, B. and Scannapieco, M., 2002. Data quality in web information systems. In ER 7802: Proceedings of the 21st International Conference on Conceptual Modeling, pages 397-413, London, UK. SpringerVerlag.
  17. Rose, D. E. and Levinson, D., 2004. Understanding user goals in web search. In WWW 7804: Proceedings of the 13th international conference on World Wide Web, pages 13-19, New York, NY, USA. ACM.
  18. Salton, G., Wong, A., and Yang, C. S., 1975. A vector space model for automatic indexing. Commun. ACM, 18(11):613-620.
  19. Wang, R. Y. and Strong, D. M., 1996. Beyond accuracy: what data quality means to data consumers. J. Manage. Inf. Syst., 12(4):5-33.
  20. Yin, X., Han, J., and Yu, P. S., 2007. Truth discovery with multiple conflicting information providers on the web. In KDD 7807: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 1048-1052, New York, NY, USA. ACM.
  21. Zhu, X. and Gauch, S., 2000. Incorporating quality metrics in centralized/distributed information retrieval on the world wide web. In SIGIR 7800: Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval, pages 288-295, New York, NY, USA. ACM.
  22. 1http://en.wikipedia.org/wiki/Wikipedia:Verifiability
  23. 2http://urbanlegends.about.com/od/medical/a/asparagus_cancer.ht m
  24. 3http://www.newscientist.com/article/dn10971-cheap-safe-drugkills-most-cancers.html
  25. 4http://code.google.com/apis/ajaxsearch/web.html
  26. 5http://www.snopes.com/medical/disease/asparagus.asp
  27. 6http://www.nlm.nih.gov/medlineplus/
Download


Paper Citation


in Harvard Style

Lichtnow D., Krug Wives L. and Palazzo Moreira de Oliveira J. (2012). TOWARDS AN APPROACH BASED ON VERIFIABILITY ASPECTS TO HELP IN THE QUALITY EVALUATION OF TEXTUAL WEB PAGES . In Proceedings of the 8th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST, ISBN 978-989-8565-08-2, pages 689-694. DOI: 10.5220/0003935906890694


in Bibtex Style

@conference{webist12,
author={Daniel Lichtnow and Leandro Krug Wives and José Palazzo Moreira de Oliveira},
title={TOWARDS AN APPROACH BASED ON VERIFIABILITY ASPECTS TO HELP IN THE QUALITY EVALUATION OF TEXTUAL WEB PAGES},
booktitle={Proceedings of the 8th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,},
year={2012},
pages={689-694},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0003935906890694},
isbn={978-989-8565-08-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 8th International Conference on Web Information Systems and Technologies - Volume 1: WEBIST,
TI - TOWARDS AN APPROACH BASED ON VERIFIABILITY ASPECTS TO HELP IN THE QUALITY EVALUATION OF TEXTUAL WEB PAGES
SN - 978-989-8565-08-2
AU - Lichtnow D.
AU - Krug Wives L.
AU - Palazzo Moreira de Oliveira J.
PY - 2012
SP - 689
EP - 694
DO - 10.5220/0003935906890694