| | SLO | ENG | Cookies and privacy

Bigger font | Smaller font

Show document Help

Title:TextProc - a natural language processing framework and its use as plagiarism detection system
Authors:ID Brezovnik, Janez (Author)
ID Ojsteršek, Milan (Author)
Files:.pdf RAZ_Brezovnik_Janez_i2011.pdf (438,30 KB)
MD5: 0CD78747728D9731172F2A23192A552F
PID: 20.500.12556/dkum/99b172da-976f-4b90-95b1-4ab69a66f7f1
 
Language:English
Work type:Unknown
Typology:1.01 - Original Scientific Article
Organization:FERI - Faculty of Electrical Engineering and Computer Science
Abstract:A natural language processing framework called TextProc is described in this paper. First the frameworks software architecture is described. The architecture is made of several parts and all of them are described in detail. Natural language processing capabilities are implemented as software plug-ins. Plug-ins can be put together into processes that perform a practical natural processing function. Several practical TextProc processes are briefly described, like part-of-speech tagging, named entity tagging and others. One of those is capable to perform plagiarism detection on texts in Slovenian language, which is explained in detail. This process is actually used in digital library of University of Maribor. The integration of digital library with TextProc is also briefly described. At the end of this paper some ideas for future development are given.
Keywords:natural language processing, text processing, text mining, Slovenian language, plagiarism detection
Year of publishing:2011
PID:20.500.12556/DKUM-27306 New window
UDC:004.777
ISSN on article:2074-1316
COBISS.SI-ID:14856982 New window
NUK URN:URN:SI:UM:DK:4G2ITJG2
Publication date in DKUM:01.06.2012
Views:3022
Downloads:88
Metadata:XML DC-XML DC-RDF
Categories:Misc.
:
BREZOVNIK, Janez and OJSTERŠEK, Milan, 2011, TextProc - a natural language processing framework and its use as plagiarism detection system. International journal of education and information technologies [online]. 2011. [Accessed 8 January 2025]. Retrieved from: https://dk.um.si/IzpisGradiva.php?lang=eng&id=27306
Copy citation
  
Average score:
0.5
1
1.5
2
2.5
3
3.5
4
4.5
5
(0 votes)
Your score:Voting is allowed only for logged in users.
Share:Bookmark and Share


Searching for similar works...Please wait....
Hover the mouse pointer over a document title to show the abstract or click on the title to get all document metadata.

Record is a part of a journal

Title:International journal of education and information technologies
Shortened title:Int. j. educ. inf. technol.
Publisher:WSEAS
ISSN:2074-1316
COBISS.SI-ID:23247143 New window

Secondary language

Language:English
Keywords:procesiranje naravnih jezikov, tekstovno procesiranje, detekcija plagiatov, slovenski jezik


Comments

Leave comment

You must log in to leave a comment.

Comments (0)
0 - 0 / 0
 
There are no comments!

Back
Logos of partners University of Maribor University of Ljubljana University of Primorska University of Nova Gorica