| | SLO | ENG | Cookies and privacy

Bigger font | Smaller font

Show document

Title:TextProc - a natural language processing framework and its use as plagiarism detection system
Authors:Brezovnik, Janez (Author)
Ojsteršek, Milan (Author)
Files:.pdf RAZ_Brezovnik_Janez_i2011.pdf (438,30 KB)
 
Language:English
Work type:Unknown ()
Typology:1.01 - Original Scientific Article
Organization:FERI - Faculty of Electrical Engineering and Computer Science
Abstract:A natural language processing framework called TextProc is described in this paper. First the frameworks software architecture is described. The architecture is made of several parts and all of them are described in detail. Natural language processing capabilities are implemented as software plug-ins. Plug-ins can be put together into processes that perform a practical natural processing function. Several practical TextProc processes are briefly described, like part-of-speech tagging, named entity tagging and others. One of those is capable to perform plagiarism detection on texts in Slovenian language, which is explained in detail. This process is actually used in digital library of University of Maribor. The integration of digital library with TextProc is also briefly described. At the end of this paper some ideas for future development are given.
Keywords:natural language processing, text processing, text mining, Slovenian language, plagiarism detection
Year of publishing:2011
UDC:004.777
ISSN on article:2074-1316
COBISS_ID:14856982 Link is opened in a new window
NUK URN:URN:SI:UM:DK:4G2ITJG2
Views:1480
Downloads:52
Metadata:XML RDF-CHPDL DC-XML DC-RDF
Categories:Misc.
:
  
Average score:(0 votes)
Your score:Voting is allowed only for logged in users.
Share:AddThis
AddThis uses cookies that require your consent. Edit consent...

Hover the mouse pointer over a document title to show the abstract or click on the title to get all document metadata.

Record is a part of a journal

Title:International journal of education and information technologies
Shortened title:Int. j. educ. inf. technol.
Publisher:WSEAS
ISSN:2074-1316
COBISS.SI-ID:23247143 New window

Secondary language

Language:English
Keywords:procesiranje naravnih jezikov, tekstovno procesiranje, detekcija plagiatov, slovenski jezik


Comments

Leave comment

You have to log in to leave a comment.

Comments (0)
0 - 0 / 0
 
There are no comments!

Back
Logos of partners University of Maribor University of Ljubljana University of Primorska University of Nova Gorica