| | SLO | ENG | Cookies and privacy

Bigger font | Smaller font

Show document Help

Title:SEGMENTACIJA BESEDIL DIPLOMSKIH NALOG IZ DIGITALNE KNJIŽNICE UNIVERZE V MARIBORU
Authors:ID Žerdin, Marcel (Author)
ID Ojsteršek, Milan (Mentor) More about this mentor... New window
ID Kežmah, Boštjan (Comentor)
Files:.pdf VS_Zerdin_Marcel_2011.pdf (2,29 MB)
MD5: 55DED23294258670BE15830F46703F74
PID: 20.500.12556/dkum/db284267-8cf3-4d48-93f5-596c7e0af44a
 
Language:Slovenian
Work type:Bachelor thesis/paper
Organization:FERI - Faculty of Electrical Engineering and Computer Science
Abstract:Diplomsko delo zajema predstavitev načrtovanja in implementacije programske rešitve za segmentiranje diplomskih del iz Digitalne knjižnice Univerze v Mariboru (DKUM). V delu smo najprej opisali področje procesiranja naravnega jezika in ujemanja vzorcev. Zatem smo opisali programsko rešitev. Predstavili smo postopek pridobitve čistega teksta iz dokumentov PDF, nato analizo zgradbe diplomskih nalog in njihovo segmentiranje. Podali smo tudi opis razvojnega okolja ter opisali težave in omejitve, na katere smo naleteli med razvojem programske rešitve. V zaključku smo podali nekaj sklepnih misli o rezultatih in možnostih nadaljnjega dela.
Keywords:segmentiranje besedila, procesiranje naravnega jezika, ujemanje vzorcev, regularni izrazi
Place of publishing:Maribor
Publisher:[M. Žerdin]
Year of publishing:2011
PID:20.500.12556/DKUM-20567 New window
UDC:004.45:004.5(043.2)
COBISS.SI-ID:15601430 New window
NUK URN:URN:SI:UM:DK:FAHREQ4Y
Publication date in DKUM:23.09.2011
Views:2506
Downloads:204
Metadata:XML DC-XML DC-RDF
Categories:KTFMB - FERI
:
Copy citation
  
Average score:(1 vote)
Your score:Voting is allowed only for logged in users.
Share:Bookmark and Share


Hover the mouse pointer over a document title to show the abstract or click on the title to get all document metadata.

Secondary language

Language:English
Title:TEXT SEGMENTATION OF DIPLOMA WORKS FROM DIGITAL LIBRARY OF THE UNIVERSITY OF MARIBOR
Abstract:This work presents the design and implementation details of the software solution for the segmentation of diplomas available from the digital library of the University of Maribor. First, we describe the theoretical background that covers the fields of natural language processing and pattern matching. Afterwards, we move to the practical aspect of our solution where we present the acquisition, structure analysis, and segmentation of diplomas. The description of development environment, problems, and restrictions that we came across during the process of development is also considered part of the practical aspect. In the conclusion we wrote some final thoughts on results and future work.
Keywords:text segmentation, natural language processing, pattern matching, regular expressions


Comments

Leave comment

You must log in to leave a comment.

Comments (0)
0 - 0 / 0
 
There are no comments!

Back
Logos of partners University of Maribor University of Ljubljana University of Primorska University of Nova Gorica