| | SLO | ENG | Cookies and privacy

Bigger font | Smaller font

Show document

Title:SEGMENTACIJA BESEDIL DIPLOMSKIH NALOG IZ DIGITALNE KNJIŽNICE UNIVERZE V MARIBORU
Authors:Žerdin, Marcel (Author)
Ojsteršek, Milan (Mentor) More about this mentor... New window
Kežmah, Boštjan (Co-mentor)
Files:.pdf VS_Zerdin_Marcel_2011.pdf (2,29 MB)
 
Language:Slovenian
Work type:Bachelor thesis/paper (mb11)
Organization:FERI - Faculty of Electrical Engineering and Computer Science
Abstract:Diplomsko delo zajema predstavitev načrtovanja in implementacije programske rešitve za segmentiranje diplomskih del iz Digitalne knjižnice Univerze v Mariboru (DKUM). V delu smo najprej opisali področje procesiranja naravnega jezika in ujemanja vzorcev. Zatem smo opisali programsko rešitev. Predstavili smo postopek pridobitve čistega teksta iz dokumentov PDF, nato analizo zgradbe diplomskih nalog in njihovo segmentiranje. Podali smo tudi opis razvojnega okolja ter opisali težave in omejitve, na katere smo naleteli med razvojem programske rešitve. V zaključku smo podali nekaj sklepnih misli o rezultatih in možnostih nadaljnjega dela.
Keywords:segmentiranje besedila, procesiranje naravnega jezika, ujemanje vzorcev, regularni izrazi
Year of publishing:2011
Publisher:[M. Žerdin]
Source:Maribor
UDC:004.45:004.5(043.2)
COBISS_ID:15601430 Link is opened in a new window
NUK URN:URN:SI:UM:DK:FAHREQ4Y
Views:1785
Downloads:146
Metadata:XML RDF-CHPDL DC-XML DC-RDF
Categories:KTFMB - FERI
:
  
Average score:(1 vote)
Your score:Voting is allowed only for logged in users.
Share:AddThis
AddThis uses cookies that require your consent. Edit consent...

Hover the mouse pointer over a document title to show the abstract or click on the title to get all document metadata.

Secondary language

Language:English
Title:TEXT SEGMENTATION OF DIPLOMA WORKS FROM DIGITAL LIBRARY OF THE UNIVERSITY OF MARIBOR
Abstract:This work presents the design and implementation details of the software solution for the segmentation of diplomas available from the digital library of the University of Maribor. First, we describe the theoretical background that covers the fields of natural language processing and pattern matching. Afterwards, we move to the practical aspect of our solution where we present the acquisition, structure analysis, and segmentation of diplomas. The description of development environment, problems, and restrictions that we came across during the process of development is also considered part of the practical aspect. In the conclusion we wrote some final thoughts on results and future work.
Keywords:text segmentation, natural language processing, pattern matching, regular expressions


Comments

Leave comment

You have to log in to leave a comment.

Comments (0)
0 - 0 / 0
 
There are no comments!

Back
Logos of partners University of Maribor University of Ljubljana University of Primorska University of Nova Gorica