Author's Paper Similarity Prediction based on the Similarity of Textual References to Visual Features

Mostafa Alli

Abstract

In this paper we introduce a mechanism to find similar papers of an author, based on the author’s previous publications. In other words, since the author(s) of a paper are more likely to publish similar work(s) to their paper, we use this intuition to seek related papers based on the visual similarity of those papers. The visuality here is the figures and or tables that are commonly used by authors to describe their method structure and/or the result of their experiments. Since similar works of authors are focused on solving similar problems as well as developing and improving similar techniques, we noticed that comparing these visual features among their publications would help to spot most similar papers of those authors. We call our method, Similarity of Textual References to Visual Features which means, we compare parts of content of any two arbitrary papers that have references to any figures and/or tables. In our experiment we show that how we can use this similarity together with other factors of a paper to form a Boolean function which helps to build an indexation for papers based on the number of their authors. In this way, we omit time consuming process of papers’ content determined analysis, such as, textual content analysis, building coauthor network, citation network etc. In addition, our Boolean function has the ability of adjusting level of Sensitivity. If we want to achieve higher accuracy of similar papers, the Boolean function needs to be enabled for more conditions.

References

  1. Alli, M. (2015). Papers similarity based on the summarization merits. In 2nd International Conference on Behavioral, Economic, and Socio-Cultural Computing (BESC2015)(Accepted).
  2. Bogers, T. and van den Bosch, A. (2008). Recommending scientific articles using citeulike. In Proceedings of the 2008 ACM Conference on Recommender Systems, RecSys 7808, pages 287-290, New York, NY, USA. ACM.
  3. Brin, S. and Page, L. (1998). The anatomy of a large-scale hypertextual web search engine. Comput. Netw. ISDN Syst., 30(1-7):107-117.
  4. Buyukkokten, O., Garcia-Molina, H., and Paepcke, A. (2001a). Accordion summarization for end-game browsing on pdas and cellular phones. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, CHI 7801, pages 213-220, New York, NY, USA. ACM.
  5. 14The denominator in Equation 3 will be equal to zero Buyukkokten, O., Garcia-Molina, H., and Paepcke, A. (2001b). Seeing the whole in parts: Text summarization for web browsing on handheld devices. In Proceedings of the 10th International Conference on World Wide Web, WWW 7801, pages 652-662, New York, NY, USA. ACM.
  6. Chen, H., Gou, L., Zhang, X., and Giles, C. L. (2011). Collabseer: a search engine for collaboration discovery. In Proceedings of the 2011 Joint International Conference on Digital Libraries, JCDL 2011, Ottawa, ON, Canada, June 13-17, 2011, pages 231-240.
  7. Good, N., Schafer, J. B., Konstan, J. A., Borchers, A., Sarwar, B., Herlocker, J., and Riedl, J. (1999). Combining collaborative filtering with personal agents for better recommendations. In Proceedings of the Sixteenth National Conference on Artificial Intelligence and the Eleventh Innovative Applications of Artificial Intelligence Conference Innovative Applications of Artificial Intelligence, AAAI 7899/IAAI 7899, pages 439-446.
  8. Han, S., He, D., Brusilovsky, P., and Yue, Z. (2013). Coauthor prediction for junior researchers. In Greenberg, A., Kennedy, W., and Bos, N., editors, Social Computing, Behavioral-Cultural Modeling and Prediction, volume 7812 of Lecture Notes in Computer Science, pages 274-283. Springer Berlin Heidelberg.
  9. He, Q., Pei, J., Kifer, D., Mitra, P., and Giles, L. (2010). Context-aware citation recommendation. In Proceedings of the 19th International Conference on World Wide Web, WWW 7810, pages 421-430.
  10. Hong, K., Jeon, H., and Jeon, C. (2013a). Advanced personalized research paper recommendation system based on expanded userprofile through semantic analysis. International Journal of Digital Content Technology and its Applications(JDCTA), 7(15):67-76.
  11. Hong, K., Jeon, H., and Jeon, C. (2013b). Personalized research paper recommendation system using keyword extraction based on userprofile. Journal of Convergence Information Technology(JCIT), 8(16):106-116.
  12. Levine, T. R. and Hullett, C. R. (2002). Eta squared, partial eta squared, and misreporting of effect size in communication research. Human Communication Research, 28(4):612-625.
  13. Ma, N., Guan, J., and Zhao, Y. (2008). Bringing pagerank to the citation analysis. Inf. Process. Manage., 44(2):800-810.
  14. Nykl, M., Jezek, K., Fiala, D., and Dostál, M. (2014). Pagerank variants in the evaluation of citation networks. J. Informetrics, 8(3):683-692.
  15. Pohl, S., Radlinski, F., and Joachims, T. (2007). Recommending related papers based on digital library access records. In Proceedings of the 7th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL 7807, pages 417-418, New York, NY, USA. ACM.
  16. Sayyadi, H. and Getoor, L. (2009). Futurerank: Ranking scientific articles by predicting their future pagerank. In In Proc. of the 9th SIAM International Conference on Data Mining, pages 533-544.
  17. Stanovich, K. E. (1986). Matthew effects in reading: Some consequences of individual differences in the acquisition of literacy. In Reading Research Quarterly, volume 22.
  18. Sugiyama, K. and Kan, M. (2010). Scholarly paper recommendation via user's recent research interests. In Proceedings of the 2010 Joint International Conference on Digital Libraries, JCDL 2010, Gold Coast, Queensland, Australia, June 21-25, 2010, pages 29-38.
  19. Sun, Y., Barber, R., Gupta, M., Aggarwal, C., and Han, J. (2011). Co-author relationship prediction in heterogeneous bibliographic networks. In Advances in Social Networks Analysis and Mining (ASONAM), 2011 International Conference on, pages 121-128.
  20. Tang, J., Sun, J., Wang, C., and Yang, Z. (2009). Social influence analysis in large-scale networks. In Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 7809, pages 807-816, New York, NY, USA. ACM.
  21. Tang, J., Zhang, D., and Yao, L. (2007). Social network extraction of academic researchers. In Data Mining, 2007. ICDM 2007. Seventh IEEE International Conference on, pages 292-301.
  22. Tang, J., Zhang, J., Yao, L., Li, J., Zhang, L., and Su, Z. (2008). Arnetminer: Extraction and mining of academic social networks. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 7808, pages 990-998, New York, NY, USA. ACM.
  23. Vellino, A. (2009). Recommending journal articles with pagerank ratings. In In Recommender Systems 2009.
  24. Vellino, A. (2010). A comparison between usage-based and citation-based methods for recommending scholarly research articles. Proceedings of the American Society for Information Science and Technology, 47(1):1-2.
  25. Wang, C., Han, J., Jia, Y., Tang, J., Zhang, D., Yu, Y., and Guo, J. (2010). Mining advisor-advisee relationships from research publication networks. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 7810, pages 203-212, New York, NY, USA. ACM.
Download


Paper Citation


in Harvard Style

Alli M. (2015). Author's Paper Similarity Prediction based on the Similarity of Textual References to Visual Features . In Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: DART, (IC3K 2015) ISBN 978-989-758-158-8, pages 637-643. DOI: 10.5220/0005629006370643


in Bibtex Style

@conference{dart15,
author={Mostafa Alli},
title={Author's Paper Similarity Prediction based on the Similarity of Textual References to Visual Features},
booktitle={Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: DART, (IC3K 2015)},
year={2015},
pages={637-643},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005629006370643},
isbn={978-989-758-158-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 1: DART, (IC3K 2015)
TI - Author's Paper Similarity Prediction based on the Similarity of Textual References to Visual Features
SN - 978-989-758-158-8
AU - Alli M.
PY - 2015
SP - 637
EP - 643
DO - 10.5220/0005629006370643