SmartNews: Bringing Order into Comments Chaos

Marina Litvak, Leon Matz

Abstract

Various news sites exist today where internet audience can read the most recent news and see what other people think about. Most sites do not organize comments well and do not filter irrelevant content. Due to this limitation, readers who are interested to know other people’s opinion regarding any specific topic, have to manually follow relevant comments, reading and filtering a lot of irrelevant text. In this work, we introduce a new approach for retrieving and ranking the relevant comments for a given paragraph of news article and vice versa. We use Topic-Sensitive PageRank for ranking comments/paragraphs relevant for a user-specified paragraph/comment. The browser extension implementing our approach (called SmartNews) for Yahoo! News is publicly available.

References

  1. Agarwal, D., Chen, B.-C., and Pang, B. (2011). Personalized recommendation of user comments via factor models. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP 7811, pages 571-582.
  2. Agichtein, E., Castillo, C., Donato, D., Gionis, A., and Mishne, G. (2008). Finding high-quality content in social media. In Proceedings of the international conference on Web search and web data mining, WSDM 7808, pages 183-194.
  3. Avrachenkov, K., Litvak, N., and Pham, K. S. (2007). Distribution of pagerank mass among principle components of the web.
  4. Brin, S. and Page, L. (1998). The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, 30(1-7):107-117.
  5. Dalal, O., Sengemedu, S. H., and Sanyal, S. (2012). Multiobjective ranking of comments on web. In Proceedings of the 21st international conference on World Wide Web, pages 419-428.
  6. 9Unzip the archive, press ”Load unpacked extension”
  7. in ”Developer mode” of chrome ”Extensions” tool, and choose the unzipped plugin folder. 10Currently, we are performing an experiment aimed at
  8. 11http://news.yahoo.com/
  9. Hsu, C.-F., Khabiri, E., and Caverlee, J. (2009). Ranking comments on the social web. In Proceedings of the 2009 International Conference on Computational Science and Engineering - Volume 04, pages 90-97.
  10. Hu, M., Sun, A., and peng Lim, E. (2008). Commentsoriented document summarization: Understanding documents with readers feedback. In In Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval. SIGIR 08. ACM.
  11. Jindal, N. and Liu, B. (2008). Opinion spam and analysis. In Proceedings of the international conference on Web search and web data mining, WSDM 7808, pages 219- 230.
  12. Mihalcea, R., Tarau, P., and Figa, E. (2004). Pagerank on semantic networks, with application to word sense disambiguation. In In Proceedings of The 20st International Conference on Computational Linguistics (COLING 2004).
  13. Mishne, G. (2005). Blocking blog spam with language model disagreement. In In Proceedings of the First International Workshop on Adversarial Information Retrieval on the Web (AIRWeb).
  14. Mishne, G. (2007). Using blog properties to improve retrieval. In In Proceedings of the International Conference on Weblogs and Social Media (ICWSM 2007).
  15. Otterbacher, J., Erkan, G., and Radev, D. R. (2005). Using random walks for question-focused sentence retrieval. In In Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP, pages 915-922.
  16. Otterbacher, J., Erkan, G., and Radev, D. R. (2009). Biased lexrank: Passage retrieval using random walks with question-based priors. Inf. Process. Manage., 45(1):42-54.
  17. Salton, G., Singhal, A., Mitra, M., and Buckley, C. (1997). Automatic text structuring and summarization. Information Processing and Management, 33(2):193-207.
  18. Salton, G., Yang, C., and Wong, A. (1975). A vector-space model for information retrieval. Communications of the ACM, 18.
  19. Sobek, M. (2003). A Survey of Google's PageRank. http://pr.efactory.de/.
  20. Szabo, G. and Huberman, B. A. (2010). Predicting the popularity of online content. Communications of the ACM, 53(8):80-88.
Download


Paper Citation


in Harvard Style

Litvak M. and Matz L. (2013). SmartNews: Bringing Order into Comments Chaos . In Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing - Volume 1: KDIR, (IC3K 2013) ISBN 978-989-8565-75-4, pages 191-196. DOI: 10.5220/0004618301910196


in Bibtex Style

@conference{kdir13,
author={Marina Litvak and Leon Matz},
title={SmartNews: Bringing Order into Comments Chaos},
booktitle={Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing - Volume 1: KDIR, (IC3K 2013)},
year={2013},
pages={191-196},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004618301910196},
isbn={978-989-8565-75-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the International Conference on Knowledge Discovery and Information Retrieval and the International Conference on Knowledge Management and Information Sharing - Volume 1: KDIR, (IC3K 2013)
TI - SmartNews: Bringing Order into Comments Chaos
SN - 978-989-8565-75-4
AU - Litvak M.
AU - Matz L.
PY - 2013
SP - 191
EP - 196
DO - 10.5220/0004618301910196