WEB AUTHENTIC AND SIMILAR TEXTS DETECTION USING AR DIGITAL SIGNATURE

Marios Poulos, Nikos Skiadopoulos, George Bokos

Abstract

In this paper, we propose a new identification technique based on an AR model with a complexity of size O(n) times in web form, with the aim of creating a unique serial number for texts and to detect authentic or similar texts. For the implementation of this purpose, we used an Autoregressive Model (AR) 15th order, and for the identification procedure, we employed the cross-correlation algorithm. Empirical investigation showed that the proposed method may be used as an accurate method for identifying same, similar, or different conceptual texts. This unique identification method for texts in combination with SCI and DOI may be the solution to many problems that the information society faces, such as plagiarism and clone detections, copyright related issues, and tracking, and also in many facets of the education process, such as lesson planning and student evaluation. The advantages of the exported serial number are obvious, and we aim to highlight them while discussing its combination with DOI. Finally, this method may be used by the information services sector and the publishing industry for standard serial-number definition identification, as a copyright management system, or both.

References

  1. Baxter, I. Yahin, A. Moura, L. & Clone Anna, M. S., 1998. Detection using abstract syntax trees. In: Proc. ICSM, (Intl. Conference on Software Maintenance)Conference title. . Location. Date of Conference, Publisher: Place of publication.
  2. Box, G.E.P. Jenkins, G.M. & Reinsel G.C., 1970. Time Series Analysis Forecasting and Gontrol. Wiley John Wiley & Sons, Inc.
  3. Chanchal K. Roya,J. R. Cordy, A. & Koschkeb, R., 2009. Comparison and evaluation of code clone detection techniques and tools: A qualitative approach. Science of Computer Programming, 74(#) pp.470-495.
  4. Chen, X. Francia B., Li, M. Mckinnon, B. & Seker A., 2004. Shared information and program plagiarism detection. IEEE Trans. Information Theory, 7 (#),pp.1545-1550.
  5. Klein M.& Nelson M. L., 2008. Revisiting Lexical Signatures to (Re-)Discover Web Pages. In Proceedings of ECDL 7808, pages 371-382.
  6. Lukashenko, R., et. al., 2007. Computer-Based Plagiarism Detection Methods and Tools: An Overview. In (International Conference on Computer Systems and Technologies Conference title. City,Bulgaria 14-15 June 2007. Publisher, Place of publication.
  7. Morrison, N. & Donald F., 1976. Multivariate Statistical Methods. New York: McGraw-Hill Book Company.
  8. Phelps, T. A.& Wilensky, R., 2000. Robust Hyperlinks: Cheap, Everywhere, Now. In Proceedings of
  9. Digital Documents and Electronic Publishing 2000. (DDEP00), September 2000.
  10. Robinson, D. &Coar K., 2004. The Common Gateway Interface (CGI) Version 1.1. RFC 3875, Oct. 2004.
  11. Stone, M., 1977. An Asymptotic Equivalence of Choice of Model by Cross-Validation and Akaike's Criterion. Journal of the Royal Statistical Society, Ser. B 39, pp.44-47.
  12. Xiao C. Wang W., Lin X. & Yu, J. X., 2008. Efficient similarity joins for near duplicate detection, In WWW 7808.
Download


Paper Citation


in Harvard Style

Poulos M., Skiadopoulos N. and Bokos G. (2010). WEB AUTHENTIC AND SIMILAR TEXTS DETECTION USING AR DIGITAL SIGNATURE . In Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 1: WEBIST, ISBN 978-989-674-025-2, pages 89-94. DOI: 10.5220/0002803600890094


in Bibtex Style

@conference{webist10,
author={Marios Poulos and Nikos Skiadopoulos and George Bokos},
title={WEB AUTHENTIC AND SIMILAR TEXTS DETECTION USING AR DIGITAL SIGNATURE},
booktitle={Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 1: WEBIST,},
year={2010},
pages={89-94},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002803600890094},
isbn={978-989-674-025-2},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 6th International Conference on Web Information Systems and Technology - Volume 1: WEBIST,
TI - WEB AUTHENTIC AND SIMILAR TEXTS DETECTION USING AR DIGITAL SIGNATURE
SN - 978-989-674-025-2
AU - Poulos M.
AU - Skiadopoulos N.
AU - Bokos G.
PY - 2010
SP - 89
EP - 94
DO - 10.5220/0002803600890094