Automatic Conceptual Analysis for Plagiarism Detection

Heinz Dreher
InSITE 2007  •  Volume 7  •  2007
In order to detect plagiarism, comparisons must be made between a target document (the suspect) and reference documents. Numerous automated systems exist which check at the text-string level. If the scope is kept constrained, as for example in within-cohort plagiarism checking, then performance is very reasonable. On the other hand if one extends the focus to a very large corpus such as the WWW then performance can be reduced to an impracticable level. The three case studies presented in this paper give insight into the text-string comparators, whilst the third case study considers the very new and promising conceptual analysis approach to plagiarism detection which is now made achievable by the very computationally efficient Normalised Word Vector algorithm. The paper concludes with a caution on the use of high-tech in the absence of high-touch.
academic malpractice, conceptual analysis, conceptual footprint, semantic footprint, Normalised Word Vector, NWV, plagiarism.
