Web-based Demonstration of Semantic Similarity Detection Using Citation Pattern Visualization for a Cross Language Plagiarism Case

Bela Gipp, Norman Meuschke, Corinna Breitinger, Jim Pitman, Andreas Nürnberger

Abstract

In a previous paper, we showed that analyzing citation patterns in the well-known plagiarized thesis by K. T. zu Guttenberg clearly outperformed current detection methods in identifying cross-language plagiarism. However, the experiment was a proof of concept and we did not provide a prototype. This paper presents a fully functional, web-based visualization of citation patterns for this verified cross-language plagiarism case, allowing the user to interactively experience the benefits of citation pattern analysis for plagiarism detection. Using examples from the Guttenberg plagiarism case, we demonstrate that the citation pattern visualization reduces the required examiner effort to verify the extent of plagiarism.

References

  1. FANO, R. M. 1956. Documentation in Action. Reinhold Publ. Co., New York, Chapter Information Theory and the Retrieval of Recorded Information, 238-244.
  2. GIPP, B. 2013. Citation-based Plagiarism Detection: Applying Citation Pattern Analysis to Identify Currently Non-Machine-Detectable Disguised Plagiarism in Scientific Publications. Ph.D. thesis, Department of Computer Science, Otto-von-Guericke University Magdeburg, Germany.
  3. GIPP, B. AND MEUSCHKE, N. 2011. Citation Pattern Matching Algorithms for Citation-based Plagiarism Detection: Greedy Citation Tiling, Citation Chunking and Longest Common Citation Sequence. In Proceedings of the 11th ACM Symposium on Document Engineering. ACM, Mountain View, CA, USA, 249-258.
  4. GIPP, B., MEUSCHKE, N., AND BEEL, J. 2011. Comparative Evaluation of Text- and Citation-based Plagiarism Detection Approaches using GuttenPlag. In Proceedings of 11th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL'11). ACM, Ottawa, Canada, 255-258.
  5. GIPP, B., MEUSCHKE, N., AND BREITINGER, C. 2014. Citation-based Plagiarism Detection: Practicability on a Large-scale Scientific Corpus. Journal of the American Society for Information Science and Technology (to appear).
  6. GIPP, B., MEUSCHKE, N., BREITINGER, C., LIPINSKI, M., AND NÜRNBERGER, A. 2013. Demonstration of the First Citation-based Plagiarism Detection Prototype. In Proceedings of the 36th International ACM SIGIR conference on research and development in Information Retrieval. ACM, Dublin, Ireland, 1119- 1120.
  7. GUTTENPLAG WIKI. 2011. Eine kritische Auseinandersetzung mit der Dissertation von KarlTheodor Freiherr zu Guttenberg: Verfassung und Verfassungsvertrag. Konstitutionelle Entwicklungsstufen in den USA und der EU. Online Source. Retrieved Apr. 25, 2012 from: http://de.guttenplag.wikia.com/wiki/GuttenPlag_Wiki.
  8. STEIN, B., LIPKA, N., AND PRETTENHOFER, P. 2011. Intrinsic Plagiarism Analysis. Language Resources and Evaluation 45, 1, 63-82.
  9. STEIN, B., MEYER ZU EISSEN, S., AND POTTHAST, M. 2007. Strategies for Retrieving Plagiarized Documents. In Proceedings of the 30th Annual International ACM SIGIR Conference. ACM, 825-826.
  10. WEBER-WULFF, D. 2012. Portal Plagiat - Softwaretest Report 2012. Online Source. Retrieved Nov. 27, 2012 from: http://plagiat.htw-berlin.de/collusion-test-2012/.
Download


Paper Citation


in Harvard Style

Gipp B., Meuschke N., Breitinger C., Pitman J. and Nürnberger A. (2014). Web-based Demonstration of Semantic Similarity Detection Using Citation Pattern Visualization for a Cross Language Plagiarism Case . In Proceedings of the 16th International Conference on Enterprise Information Systems - Volume 2: ISS, (ICEIS 2014) ISBN 978-989-758-028-4, pages 677-683. DOI: 10.5220/0004985406770683


in Bibtex Style

@conference{iss14,
author={Bela Gipp and Norman Meuschke and Corinna Breitinger and Jim Pitman and Andreas Nürnberger},
title={Web-based Demonstration of Semantic Similarity Detection Using Citation Pattern Visualization for a Cross Language Plagiarism Case},
booktitle={Proceedings of the 16th International Conference on Enterprise Information Systems - Volume 2: ISS, (ICEIS 2014)},
year={2014},
pages={677-683},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004985406770683},
isbn={978-989-758-028-4},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 16th International Conference on Enterprise Information Systems - Volume 2: ISS, (ICEIS 2014)
TI - Web-based Demonstration of Semantic Similarity Detection Using Citation Pattern Visualization for a Cross Language Plagiarism Case
SN - 978-989-758-028-4
AU - Gipp B.
AU - Meuschke N.
AU - Breitinger C.
AU - Pitman J.
AU - Nürnberger A.
PY - 2014
SP - 677
EP - 683
DO - 10.5220/0004985406770683