REPUTATION-BASED SELECTION OF WEB INFORMATION SOURCES

Donato Barbagallo, Cinzia Cappiello, Chiara Francalanci, Maristella Matera

Abstract

The paper compares Google’s ranking with the ranking obtained by means of a multi-dimensional source reputation index. The data quality literature defines reputation as a dimension of information quality that measures the trustworthiness and importance of an information source. Reputation is recognized as a multi-dimensional quality attribute. The variables that affect the overall reputation of an information source are related to the institutional clout of the source, to the relevance of the source in a given context, and to the general quality of the source’s information content. We have defined a set of variables measuring the reputation of Web information sources along these dimensions. These variables have been empirically assessed for the top 20 sources identified by Google as a response to 100 queries in the tourism domain. Then, we have compared Google’s ranking and the ranking obtained along each reputation variable for all queries. Results show that the assessment of reputation represents a tangible aid to the selection of information sources.

References

  1. Artz, D., Gil, Y., 2007. A survey of trust in computer science and the Semantic Web. J. Web Sem. 5(2): 58- 71.
  2. Anholt, S., 2009. Competitive Identity: The New Brand Management for Nations, Cities and Regions. Palgrave Macmillan, Eds.
  3. Atzeni, P., Merialdo, P., Sindoni, G., 2001. Web site evaluation: Methodology and case study. DASWIS2001: International Workshop on data Semantics in Web Information systems, Yokohama, Japan.
  4. Bagozzi, R. P., Yi, Y., 1988. On the evaluation of structural equation models, Journal of the Academy of Marketing Science, vol. 16, no. 1, pp. 74-94.
  5. Barbagallo, D., Cappiello, C., Francalanci, C., Matera, M., 2009. Reputation Based Self-Service Environments, ComposableWeb 2009: International Workshop on lightweight integration on the Web, San Sebastian, Spain, 2009, pp.12-17.
  6. Barbagallo, D., Cappiello, C., Francalanci, C., Matera, M., 2010. A Reputation-based DSS: the INTEREST Approach, ENTER: International Conference On Information Technology and Travel&Tourism, February 2010.
  7. Batini, C., Cappiello, C., Francalanci, C., Maurino, A., 2009. Methodologies for data quality assessment and improvement. ACM Computing Surveys, 41(3).
  8. Balasubramaniam, S., Lewis, G. A., Simanta S., Smith, D. B., 2008. Situated Software: Concepts, Motivation, Technology, and the Future. IEEE Software, Nov-Dec 2008, pp. 50-55.
  9. Brin, S., Page, L., 1998. The Anatomy of a Large-Scale Hypertextual Web Search Engine. Computer Networks, 30(1-7): 107-117.
  10. Chen, X., Ding, C., 2008. QoS Based Ranking for Web Search. Proc. of International Conference on Web Intelligence and Intelligent Agent Technology, pp. 747-750.
  11. Chen, K., Zhang, Y., Zheng, Z., Zha, H., Sun, G., 2008. Adapting ranking functions to user preference. Data Engineering Workshop, ICDEW, pp.580-587.
  12. DeStefano, D. LeFevre, J. A., 2007. Cognitive load in hypertext reading: A review. Computers in Human Behavior 23 (3), pp. 1616-1641.
  13. Fornell, C., Larcker, D. F., 1981. Evaluating structural equation models with unobservable variables and measurement errors: Algebra and statistics. Journal of Marketing Research, vol. 18, no. 3, pp. 383-388.
  14. Gackowski, Z., 2006. Redefining information quality: the operations management approach. Eleventh International Conference on Information Quality (ICIQ06). Boston, MA, USA, 399-419.
  15. Gupta, S., Jindal, A., 2008. Contrast of link based web ranking techniques. International Symposium on Biometrics and Security Technologies (ISBAST), pp. 1-6.
  16. Hair, J., Anderson, R., Tatham, R., Black, W., 1998. Multivariate data analysis (5th ed.). Upper Saddle River, New Jersey: Prentice Hall.
  17. Jaccard, J., Choi, K. W., 1996. LISREL approaches to interaction effects in multiple regression. Thousand Oaks, CA: Sage Publications.
  18. Jiang, S., Zilles, S., Holte, R., 2008. Empirical Analysis of the Rank Distribution of Relevant Documents in Web Search. International Conference on Web Intelligence and Intelligent Agent Technology, pp. 208-213.
  19. Kendall, M. G., Babington Smith, B., 1938. Randomness and Random Sampling Numbers. Journal of the Royal Statistical Society, 101(1), pp. 147-166.
  20. Kleinberg, J. M., 1999. Hubs, authorities, and communities. ACM Comput. Surv. 31(4es): 5.
  21. Lamberti, F., Sanna, A., Demartini, C., 2009. A RelationBased Page Rank Algorithm for Semantic Web Search Engines. IEEE Transactions on Knowledge and Data Engineering, 21(1), pp. 123 - 136.
  22. Louta, M., Anagnostopoulos, I., Michalas, A., 2008. Efficient internet search engine service provisioning exploiting a collaborative web result ranking mechanism. IEEE International Conference on Systems, Man and Cybernetics, pp. 1477-1482.
  23. Mare, R. D., 1980. Social background and school continuation decisions. Journal of, the American Statistical Association, 75, pp. 295-305.
  24. Mecella, M., Scannapieco, M., Virgillito, A., Baldoni, R., Catarci, T., Batini, C., 2003. The DaQuinCIS Broker: Querying Data and Their Quality in Cooperative Information Systems. J. Data Semantics, 1, pp. 208- 232.
  25. Skopik, F., Truong, H. L., Dustdar, S., 2009. Trust and Reputation Mining in Professional Virtual Communities. International Conference on Web Engineering, pp. 76-90.
  26. Yu, J., Benatallah, B., Saint-Paul, R., Casati, F., Daniel, F., Matera, M., 2007. A framework for rapid integration of presentation components. International Conference on the World Wide Web, pp. 923-932.
  27. Yu, J., Benatallah, B., Casati, F., Daniel, F., Matera, M. Saint-Paul, R., 2007. Mixup: A Development and Runtime Environment for Integration at the Presentation Layer. International Conference on Web Engineering, pp. 479-484
Download


Paper Citation


in Harvard Style

Barbagallo D., Cappiello C., Francalanci C. and Matera M. (2010). REPUTATION-BASED SELECTION OF WEB INFORMATION SOURCES . In Proceedings of the 12th International Conference on Enterprise Information Systems - Volume 4: ICEIS, ISBN 978-989-8425-07-2, pages 30-37. DOI: 10.5220/0002908400300037


in Bibtex Style

@conference{iceis10,
author={Donato Barbagallo and Cinzia Cappiello and Chiara Francalanci and Maristella Matera},
title={REPUTATION-BASED SELECTION OF WEB INFORMATION SOURCES},
booktitle={Proceedings of the 12th International Conference on Enterprise Information Systems - Volume 4: ICEIS,},
year={2010},
pages={30-37},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002908400300037},
isbn={978-989-8425-07-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 12th International Conference on Enterprise Information Systems - Volume 4: ICEIS,
TI - REPUTATION-BASED SELECTION OF WEB INFORMATION SOURCES
SN - 978-989-8425-07-2
AU - Barbagallo D.
AU - Cappiello C.
AU - Francalanci C.
AU - Matera M.
PY - 2010
SP - 30
EP - 37
DO - 10.5220/0002908400300037