Retrieving Similar Documents from the Web

Authors

  • A.R. Pereira Jr Federal University of Minas Gerais, Av. Antonio Carlos,6627, Pampulha, Belo Horizonte, 31270-901, Brazil
  • N. Ziviani Federal University of Minas Gerais, Av. Antonio Carlos,6627, Pampulha, Belo Horizonte, 31270-901, Brazil

Keywords:

retrieving similar documents, web, document similarity, fingerprint, plagiarism

Abstract

Abstract: This paper presents a mechanism for detecting and retrieving documents for the web with a similarity relation to a suspicious document. The process is composed of three stages: a) generation of a "fingerprint" of the suspicious document, b) gathering candidate documents from the web and c) comparison of each candidate document and the suspicious document.

 

Downloads

Download data is not yet available.

Downloads

Published

2004-06-11

How to Cite

Pereira Jr, A., & Ziviani, N. . (2004). Retrieving Similar Documents from the Web. Journal of Web Engineering, 2(4), 247–261. Retrieved from https://journals.riverpublishers.com/index.php/JWE/article/view/4357

Issue

Section

Articles