A BAYESIAN NETWORK TO STRUCTURE A DATA QUALITY MODEL FOR WEB PORTALS

Angélica Caro, Coral Calero, Houari Sahraoui, Ghazwa Malak, Mario Piattini

Abstract

The technological advances and the use of the internet have favoured the appearance of a great diversity of web applications, among them Web portals. Through them, organizations develop their businesses in a highly competitive environment. One decisive factor for this competitiveness is the assurance of its data quality. In previous works, a data quality model for Web portals has been developed. The model is represented as a matrix that links the user expectations of data web quality to the portal functionalities. Into this matrix a set of 34 attributes where classified. However, the quality attributes on this model have not an operational structure, necessary to be used actual assessment. In this paper we present how we have structured these attributes by means of a probabilistic approach, using Bayesian Networks. The final objective is to use the Bayesian network obtained for evaluating the quality of a data portal (or a subset of its characteristics).

References

  1. Baldi, P. et al. (2003). Modeling the Internet and the Web; Probabilistic Methods and Algorithms. Wiley
  2. Bouzeghoub, M. and V. Peralta (2004). A Framework for Analysis of data Freshness. International Workshop on Information Quality in Information Systems, (IQIS2004), Paris, France, ACM.
  3. Burgess, M., et al. (2004). Quality Measures and The Information Consumer. Proceeding of the Ninth International Conference on Information Quality.
  4. Cappiello, C., et al. (2004). Data quality assessment from the users perspective. International Workshop on Information Quality in Information Systems, (IQIS2004), Paris, Francia, ACM.
  5. Caro , A., et al. (2006). Defining a quality model for portal data. International Conference on Web Engineering, ICWE-2006, Palo Alto, California, USA.
  6. Collins, H. (2001). Corporate Portal Definition and Features, AMACOM.
  7. Eppler, M., et al. (2003). Quality Criteria of ContentDriven Websites and Their Influence on Customer Satisfaction and Loyalty: An Empirical Test of an Information Quality Framework. Proceeding of the Eighth International Conference on Information Quality.
  8. Fugini, M., et al. (2002). Data Quality in Cooperative Web Information Systems.Personal Communication. citeseer.ist.psu.edu/fugini02data.html.
  9. Gertz, M., et al. (2004). "Report on the Dagstuhl Seminar "Data Quality on the Web"." SIGMOD Record vol. 33, Nº 1: 127-132.
  10. Graefe, G. (2003). Incredible Information on the Internet: Biased Information Provision and a Lack of Credibility as a Cause of Insufficient Information Quality. Proceeding of the Eighth International Conference on Information Quality.
  11. ISO-25012 (2006). " ISO/IEC 25012: Software Engineering - Software Quality Requirements and Evaluation (SQuaRE) - Data Quality Model (Draft)."
  12. Katerattanakul, P. and K. Siau (1999). Measuring Information Quality of Web Sites: Development of an Instrument. Proceeding of the 20th International Conference on Information System.
  13. Katerattanakul, P. and K. Siau (2001). Information quality in internet commerce desing. Information and Database Quality. M. Piattini, C. Calero and M. Genero, Kluwer Academic Publishers.
  14. Mahdavi, M., et al. (2004). A Collaborative Approach for Caching Dynamic Data in Portal Applications. Proceedings of the 5th conference on Australian database.
  15. Malak, G, Sahraoui, H, Badri, L, Badri, M. (2006). A Proposal of a Probabilistic Framework for Web-Based Applications Quality, Proceedings of the 10th ECOOP Workshop on Quantitative Approaches in ObjectOriented Software Engineering, (QAOOSE06).
  16. Melkas, H. (2004). Analyzing Information Quality in Virtual service Networks with Qualitative Interview Data. Proceeding of the Ninth International Conference on Information Quality.
  17. Moraga, M. Á., et al. (2004). Comparing different quality models for portals. To appear on Online Information Review, 2006.
  18. Moustakis, V., et al. (2004). Website Quality Assesment Criteria. Proceeding of the Ninth International Conference on Information Quality.
  19. Naumann, F. and C. Rolker (2000). Assesment Methods for Information Quality Criteria. Proceeding of the Fifth International Conference on Information Quality.
  20. Neil, M., Fenton, N.E., Nielsen, L., (2000). Building large-scale Bayesian Networks. The Knowledge Engineering Review, 15(3). 257-284
  21. Pressman, R. (2001). Software Engineering: a Practitioner's Approach. 5/e, McGraw-Hill.
  22. Redman, T. (2000). Data Quality: The field guide. Boston, Digital Press.
  23. Strong, D., et al. (1997). "Data Quality in Context." Communications of the ACM Vol. 40, Nº 5: 103 -110.
  24. Wang, R. and D. Strong (1996). "Beyond accuracy: What data quality means to data consumers." Journal of Management Information Systems; Armonk; Spring 1996 12(4): 5-33.
  25. Yang, Z., et al. (2004). "Development and validation of an instrument to measure user perceived service quality of information presenting Web portals." Information and Management. Elsevier Science 42: 575-589.
Download


Paper Citation


in Harvard Style

Caro A., Calero C., Sahraoui H., Malak G. and Piattini M. (2006). A BAYESIAN NETWORK TO STRUCTURE A DATA QUALITY MODEL FOR WEB PORTALS . In Proceedings of the First International Conference on Software and Data Technologies - Volume 2: ICSOFT, ISBN 978-972-8865-69-6, pages 147-152. DOI: 10.5220/0001312301470152


in Bibtex Style

@conference{icsoft06,
author={Angélica Caro and Coral Calero and Houari Sahraoui and Ghazwa Malak and Mario Piattini},
title={A BAYESIAN NETWORK TO STRUCTURE A DATA QUALITY MODEL FOR WEB PORTALS},
booktitle={Proceedings of the First International Conference on Software and Data Technologies - Volume 2: ICSOFT,},
year={2006},
pages={147-152},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001312301470152},
isbn={978-972-8865-69-6},
}


in EndNote Style

TY - CONF
JO - Proceedings of the First International Conference on Software and Data Technologies - Volume 2: ICSOFT,
TI - A BAYESIAN NETWORK TO STRUCTURE A DATA QUALITY MODEL FOR WEB PORTALS
SN - 978-972-8865-69-6
AU - Caro A.
AU - Calero C.
AU - Sahraoui H.
AU - Malak G.
AU - Piattini M.
PY - 2006
SP - 147
EP - 152
DO - 10.5220/0001312301470152