loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Jan Kasprzak ; Michal Brandejs ; Miroslav Kripač and Pavel Šmerk

Affiliation: Faculty of Informatics, Masaryk University, Czech Republic

Keyword(s): University, Plagiarism, Similar Documents, Cluster, Information System, Theses.

Related Ontology Subjects/Areas/Topics: Databases and Information Systems Integration ; Enterprise Information Systems ; Object-Oriented Database Systems ; Web Databases

Abstract: One of the drawbacks of e-learning methods such as Web-based submission and evaluation of students’ papers and essays is that it has become easier for students to plagiarize the work of other people. In this paper we present a computer-based system for discovering similar documents, which has been in use at Masaryk University in Brno since August 2006, and which will also be used in the forthcoming Czech national archive of graduate theses. We also focus on practical aspects of this system: achieving near real-time response to newly imported documents, and computational feasibility of handling large sets of documents on commodity hardware. We also show the possibilities and problems with parallelization of this system for running on a distributed cluster of computers.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 3.21.46.24

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Kasprzak, J.; Brandejs, M.; Kripač, M. and Šmerk, P. (2008). DISTRIBUTED SYSTEM FOR DISCOVERING SIMILAR DOCUMENTS. In Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 4: ICEIS; ISBN 978-989-8111-36-4; ISSN 2184-4992, SciTePress, pages 437-440. DOI: 10.5220/0001687604370440

@conference{iceis08,
author={Jan Kasprzak. and Michal Brandejs. and Miroslav Kripač. and Pavel Šmerk.},
title={DISTRIBUTED SYSTEM FOR DISCOVERING SIMILAR DOCUMENTS},
booktitle={Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 4: ICEIS},
year={2008},
pages={437-440},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001687604370440},
isbn={978-989-8111-36-4},
issn={2184-4992},
}

TY - CONF

JO - Proceedings of the Tenth International Conference on Enterprise Information Systems - Volume 4: ICEIS
TI - DISTRIBUTED SYSTEM FOR DISCOVERING SIMILAR DOCUMENTS
SN - 978-989-8111-36-4
IS - 2184-4992
AU - Kasprzak, J.
AU - Brandejs, M.
AU - Kripač, M.
AU - Šmerk, P.
PY - 2008
SP - 437
EP - 440
DO - 10.5220/0001687604370440
PB - SciTePress