SRL based Plagiarism Detection System for Malayalam Documents

Automatic techniques of measuring plagiarism between documents have gained importance in the recent years because of the availability of enormous volume of information over the internet. . The most general form of detecting plagiarism is by computing similarity between a source document and a possib...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of computer science issues 2015-11, Vol.12 (6), p.91-91
Hauptverfasser: Sindhu, L, Idicula, Sumam Mary
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Automatic techniques of measuring plagiarism between documents have gained importance in the recent years because of the availability of enormous volume of information over the internet. . The most general form of detecting plagiarism is by computing similarity between a source document and a possibly plagiarised document. Existing plagiarism detection systems are mainly designed for detection in English.. Moreover, plagiarism detection systems using natural language processing techniques are still very limited. Automated plagiarism detection systems so far have involved minimal syntactic and semantic linguistic techniques. Even though, in some systems shallow techniques have been included as part of the preprocessing stage, studies involving deep techniques are less. Very negligible research has been done for plagiarism detection in Malayalam text documents. This paper presents a method for plagiarism detection in Malayalam documents based on extracting the semantic roles and computing their similarity to detect plagiarism. The technique can detect documents created by direct copy methods, replacement of words with similar ones , changing the order of words or restructuring the sentences and also converting the sentence from active/ passive to passive/active.
ISSN:1694-0814
1694-0784