BRAZILIAN HEALTH-RELATED CONTENT WEB SEARCH PORTAL - Presentation on a Method for its Development and Preliminary Results

Felipe Mancini, Alex Esteves Jaccoud Falcão, Anderson Diniz Hummel, Thiago Martini Costa, Cristina Lucia Feijo Ortolani, Fabio Teixeira, Ivan Torres Pisa

Abstract

The increase in the amount of available information on the world wide web is inexorable, which, on one hand, provides the web user with more information. On the other hand, however, web searches become increasingly more difficult to handle due to the increasing number of retrieved documents. The present study is a proposal of development for a Brazilian search portal specific for health-related content. The aim of such development is to provide web users, mainly the non-specialist ones, with the largest number possible of web pages relevant to their search terms and inferred search intentions. The proposed search portal integrates web mining-based filters and a decision-making support tool. The preliminary study results show that among the algorithms tested to incorporate a filter module specific for health-related content - artificial neural networks, logistic regression and nearest neighbor clustering (NNC) -, the application of NNC resulted in the automated web health-related content classifier with the best performance for sensitivity and specificity 0.92 and 1.00 respectively.

References

  1. Abraham, J., & Reddy, M. (2007). Quality of Healthcare Websites: A Comparison of a General-Purpose vs. Domain-Specific Search Engine. AMIA Symposium Proceedings, (p. 858).
  2. Ajax. (01 of 01 of 2008). Ajax. Access in 11 of 07 of 2008, available in Ajax: http://www.w3schools.com/Ajax/Default.Asp
  3. Berkow, R., Beers, M., Bogin, R., & Fletcher, A. (01 of 01 of 2003). Manual Merck de Informação Médica: Saúde para a Família. Access in 08 of 07 of 2008, available in Merck: http://www.msdbrazil.com/msdbrazil/patients/manual_Merck/prefacio. html
  4. Bireme. (01 of 01 of 2008). VHL. Access in 11 of 07 of 2008, available in VHL: http://www.bireme.br/php/index.php?lang=en
  5. Bishop, C. (2007). Pattern Recognition and Machine Learning. Springer: New Jersey.
  6. Burnham, K., & Anderson, D. (2004). Model Selection and Multi-Model Inference. Berlim: Springer.
  7. CETIC. (01 of 11 of 2007). TIC Domicílios e usuários 2007. Access in 07 of 07 of 2008, available in CETIC: http://www.cetic.br/usuarios/tic/2007/rel-int-10.htm
  8. Chang, P., Hou, I., Hsu, C., & HF, L. (2006). Are Google or Yahoo a good portal for getting quality healthcare web information. AMIA Annu Symp Proc, (p. 878).
  9. DeCS. (01 of 01 of 2008). DeCS - Health Sciences Descriptors. Access in 07 of 07 of 2008, available in http://decs.bvs.br/I/homepagei.htm
  10. Duda, R., Hart, P., & Stork, D. (2000). Pattern Classification. New York: Wiley-Interscience.
  11. Dunford II, T. (2008). Advanced Search Engine Optimization: A Logical Approach. Maui: American Creations of Maui.
  12. Erl, T. (2007). SOA Principles of Service Design. Prentice Hall: New York.
  13. Falcão AEJ. HealthRank: Construção e Avaliação de um Software para Medir Adequação à Códigos de Ética e Relevância de Websites em Saúde Utilizando Métodos de Mídia Social e Indicadores Automatizados. Master Thesys -Federal University of São Paulo, 2008.
  14. Haykin, S. (1999). Neural Networks: a Comprehensive Foundation. New Jersey: Prentice-Hall.
  15. Hersh, W. (2003). Information Retrieval : a Health and Biomedical Perspective. New York: Springer.
  16. HITI. (12 of 06 of 2000; ). HITI. Access in 02 of 22 of 2008, available in HITI: http://hitiweb.mitretek.org/docs/policy.html
  17. HON. (07 of 01 of 2008). HON. Access in 10 of 07 of 2008, available in HON: http://www.hon.ch/
  18. Java. (11 of 07 of 2008). Java. Access in 11 of 07 of 2008, available in Java: http://java.sun.com/
  19. Kosala, R., & Blockeel, H. (2000). Web Mining Research: a Survey. ACM SIGKDD Exploration , pp. 1-15.
  20. Lopes, I. (2004). New paradigms for evaluation of the information quality health retrieved on the web. Ciência da Informação , pp. 81-90.
  21. MeSH. (04 of 01 of 2008). MeSH. Access in 14 of 07 of 2008, available in MeSH: http://www.nlm.nih.gov/mesh/
  22. Metz, C. (1978). Basic principles of ROC analysis. Seminars in Nucl Med , pp. 283-298.
  23. MiniAjax. (01 of 01 of 2008). MiniAjax. Access in 11 of 07 of 2008, available in MiniAjax: http://miniajax.com/
  24. Musen, M., Shahar, Y., & Shortliffe, E. (2006). Clinical Decision-Support Systems. New York: SpringerVerlag.
  25. MySQL. (01 of 01 of 2008). MySQL. Access in 09 of 07 of 2008, available in MySQL: http://www.mysql.com/
  26. Nilsen, J. (01 of 01 of 2005). Ten Usability Heuristics. Access in 11 of 07 of 2008, available in Ten Usability Heuristics: http://www.useit.com/papers/heuristic/heuristic_list.ht ml
  27. O'Reilly, T. (30 of 09 of 2005). What Is Web 2.0: Design Patterns and Business Models for the Next Generation of Software. Access in 10 of 07 of 2008, available in http://www.oreillynet.com/pub/a/oreilly/tim/news/200 5/09/30/ what-is-web-20.html
  28. Papazoglou, M. (2007). Web Services: Principles and Technology. Prentice Hall: New York.
  29. Perl. (01 of 01 of 2008). Perl. Access in 08 of 07 of 2008, available in Perl: http://www.perl.org/about.html
  30. PHP. (2008 of 01 of 01). PHP. Access in 2008 of 07 of 17, available in PHP: www.php.net
  31. Poll, T. H. (31 of 07 of 2008). Harris Poll Shows Number of "Cyberchondriacs" - Adults Who Have Ever Gone Online for Health Information- Increases to an Estimated 160 Million Nationwide. Access in 11 of 07 of 2008, available in The Harris Poll: http://www.harrisinteractive.com/harris_poll/index.asp ?PID=792.
  32. Silva, R., & Roque, A. (2000). Clinical medical diagnosis using a signal-processing approach. Conference on Mathematics and Engineering Techniques in Medicine and Biological (pp. 13-18). Las Vegas: CSREA Press.
  33. Silva, WM. Navegar é preciso: Avaliação of impactos do uso da internet na relação médico-paciente. Master Thesis - University of São Paulo, 2006.
  34. Tang, H., & NG, J. (10 of 11 of 2006). Googling for a diagnosis-use of Google as a diagnostic aid: internet based study. British Medical Journal , pp. 1143-1145.
  35. Tardelli, A., Anção, M., Packer, A., & Sigulem, D. (2004). An implementation of the trigram phrase matching method for text similarity problems. Stud Health Technol Inform , pp. 43-49.
  36. Toms, E., & Latter, C. (2007). How consumers search for health information. Health Informatics Journal , pp. 213-223.
  37. Witten, I., & Frank, E. (2005). Data Mining: Practical machine learning tools and techniques. San Francisco: Morgan Kaufmann.
Download


Paper Citation


in Harvard Style

Mancini F., Esteves Jaccoud Falcão A., Diniz Hummel A., Martini Costa T., Lucia Feijo Ortolani C., Teixeira F. and Torres Pisa I. (2009). BRAZILIAN HEALTH-RELATED CONTENT WEB SEARCH PORTAL - Presentation on a Method for its Development and Preliminary Results . In Proceedings of the International Conference on Health Informatics - Volume 1: HEALTHINF, (BIOSTEC 2009) ISBN 978-989-8111-63-0, pages 306-310. DOI: 10.5220/0001552303060310


in Bibtex Style

@conference{healthinf09,
author={Felipe Mancini and Alex Esteves Jaccoud Falcão and Anderson Diniz Hummel and Thiago Martini Costa and Cristina Lucia Feijo Ortolani and Fabio Teixeira and Ivan Torres Pisa},
title={BRAZILIAN HEALTH-RELATED CONTENT WEB SEARCH PORTAL - Presentation on a Method for its Development and Preliminary Results},
booktitle={Proceedings of the International Conference on Health Informatics - Volume 1: HEALTHINF, (BIOSTEC 2009)},
year={2009},
pages={306-310},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001552303060310},
isbn={978-989-8111-63-0},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Health Informatics - Volume 1: HEALTHINF, (BIOSTEC 2009)
TI - BRAZILIAN HEALTH-RELATED CONTENT WEB SEARCH PORTAL - Presentation on a Method for its Development and Preliminary Results
SN - 978-989-8111-63-0
AU - Mancini F.
AU - Esteves Jaccoud Falcão A.
AU - Diniz Hummel A.
AU - Martini Costa T.
AU - Lucia Feijo Ortolani C.
AU - Teixeira F.
AU - Torres Pisa I.
PY - 2009
SP - 306
EP - 310
DO - 10.5220/0001552303060310