AUTOMATIC COLLECTION OF AUTHORSHIP INFORMATION FOR WEB PUBLICATIONS

Daniel Lichtnow, Ana Marilza Pernas, Edimar Manica, Fahad Kalil, José Palazzo M. de Oliveira, Valderi Reis Quietinho Leithardt

Abstract

The authorship is an important criteria to evaluate content quality. Frequently, Web users have to spend a lot of time in Web searchers to find information about author’s expertise. This paper presents an approach to help Web users in this task. The approach consists of: a set of techniques to extract information about authors from Web and an architecture of an extraction tool. An application scenario is presented, in which the user can read details about a specific author of a Web page when reading the document.

References

  1. Aleman-Meza, B., Bojars, U., Boley, H., Breslin, J. G., Mochol, M., Nixon, L. J., Polleres, A., and Zhdanova, A. V. (2007). Combining RDF Vocabularies for Expert Finding. In Proc. of the 4th European Conference on the Semantic Web: Research and Applications, pages 235-250, Berlin, Springer-Verlag.
  2. Balog, K. (2008). The SIGIR 2008 workshop on future challenges in expertise retrieval (fCHER). SIGIR Forum 42(2) 46-52.
  3. Balog, K., Azzopardi, L. A. and Rijke de M. (2009) Resolving person names in Web people search., in Weaving Services and People on the World Wide Web, pages 301-323 Springer, Berlin, SpringerVerlag.
  4. Berners-Lee, T. (1997) Cleaning up the User Interface, Section-The “Oh, yeah?”-Button, Retrieved May 4, 2009, from http://www.w3.org/DesignIssues/UI.html
  5. Bizer, C. and Cyganiak, R. (2009). Quality-driven information filtering using the WIQA policy framework. Web Semant. 7(1).
  6. Borges, E. N., Galante, R. de M., Gonçalves, M. A. (2008). Uma Abordagem Efetiva e Eficiente para Deduplicação de Metadados Bibliográficos de Objetos Digitais. In: Proc. of the XXIII SBBD, pages 76-90, São Paulo, Brazil, SBC.
  7. Brin, S. and Page, L. (1998). The anatomy of a large-scale hypertextual Web search engine. Comput. Netw. ISDN Syst. 30(1-7), 107-117.
  8. Etzioni, O., Banko, M., Soderland, S., and Weld, D. S. (2008). Open information extraction from the Web. Commun. ACM 51(12), 68-74.
  9. Hartig, O. (2009). Provenance Information in the Web of Data, in Proc. of the Linked Data on the Web Workshop at WWW
  10. Hirsch, J. E. (2005). An index to quantify an individual's scientific research output. PNAS 102 (46), 16569- 16572
  11. Huynh, D., Mazzocchi, S., and Karger, D. (2007). Piggy Bank: Experience the Semantic Web inside your Web browser. Web Semant. 5(1), 16-27.
  12. Kayed, M. and Shaalan, K. F. (2006). A Survey of Web Information Extraction Systems. IEEE Trans. on Knowl. and Data Eng. 18(10), 1411-1428.
  13. Macdonald, C. and Ounis, I. (2006). Voting for candidates: adapting data fusion techniques for an expert search task. In Proc. of the 15th ACM international Conference on information and Knowledge Management, pages 387-396 New York, NY, ACM Press.
  14. Stamatakis, K. et al. AQUA, a system assisting labelling experts assess health Web resources. In Procs. of iSHIMR, 2007.
  15. Serdyukov, P., Aly, R., Hiemstra, D. University of Twente at the TREC 2008 Enterprise Track: Using the Global Web as an expertise evidence source. In Procs. of 16th TREC.
  16. Wang Y., Liu Z. (2007) Automatic detecting indicators for quality of health information on the Web, International Journal of Medical Informatics, 76(8), 575-582.
  17. Xi, W. and Fox, E. A. (2002) Machine Learning Approach for Homepage Finding Task In Procs. of 9th International Symposium on String Processing and Information Retrieval, pages 145-159.
  18. 1 http://www.hon.ch/
  19. 2 http://www.ncbi.nlm.nih.gov/pubmed/
  20. 3 http://dublincore.org/
  21. 4 http://xmlns.com/foaf/spec/
  22. 5 http://saxon.sourceforge.net
  23. 6 http://web-harvest.sourceforge.net/
  24. 7 http://www.nlm.nih.gov/mesh/
Download


Paper Citation


in Harvard Style

Lichtnow D., Marilza Pernas A., Manica E., Kalil F., Palazzo M. de Oliveira J. and Reis Quietinho Leithardt V. (2010). AUTOMATIC COLLECTION OF AUTHORSHIP INFORMATION FOR WEB PUBLICATIONS . In Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 1: WEBIST, ISBN 978-989-674-025-2, pages 339-344. DOI: 10.5220/0002773603390344


in Bibtex Style

@conference{webist10,
author={Daniel Lichtnow and Ana Marilza Pernas and Edimar Manica and Fahad Kalil and José Palazzo M. de Oliveira and Valderi Reis Quietinho Leithardt},
title={AUTOMATIC COLLECTION OF AUTHORSHIP INFORMATION FOR WEB PUBLICATIONS},
booktitle={Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 1: WEBIST,},
year={2010},
pages={339-344},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002773603390344},
isbn={978-989-674-025-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 1: WEBIST,
TI - AUTOMATIC COLLECTION OF AUTHORSHIP INFORMATION FOR WEB PUBLICATIONS
SN - 978-989-674-025-2
AU - Lichtnow D.
AU - Marilza Pernas A.
AU - Manica E.
AU - Kalil F.
AU - Palazzo M. de Oliveira J.
AU - Reis Quietinho Leithardt V.
PY - 2010
SP - 339
EP - 344
DO - 10.5220/0002773603390344