| | SLO | ENG | Cookies and privacy

Bigger font | Smaller font

Show document

Title:PRIMERJAVA RAZLIČNIH ALGORITMOV ZA DOLOČANJE KOLOKACIJ MED BESEDAMI
Authors:Brodnjak, Dejan (Author)
Ojsteršek, Milan (Mentor) More about this mentor... New window
Files:.pdf VS_Brodnjak_Dejan_2011.pdf (1,48 MB)
 
Language:Slovenian
Work type:Bachelor thesis/paper (mb11)
Organization:FERI - Faculty of Electrical Engineering and Computer Science
Abstract:Kolokacije so besedne zveze, ki se v besedilih pojavljajo pogosteje kot bi se po naključju. V diplomskem delu bomo spoznali njihov pomen in uporabo pri procesiranju besedil v slovenskem jeziku. Pogledali si bomo tudi korpus jos1M, ki ga bomo uporabljali kot vhod v algoritme za določanje kolokacij. Implementirali bomo dva algoritma za določanje kolokacij (frekvenčni in razpršeni). Z morfološkim filtriranjem bomo izrazili kolokacije. Na koncu bomo algoritma primerjali.
Keywords:procesiranje naravnega jezika, kolokacije, jos1M korpus
Year of publishing:2011
Publisher:[D. Brodnjak]
Source:Maribor
UDC:004.934.1'1(043.2)
COBISS_ID:15600662 Link is opened in a new window
NUK URN:URN:SI:UM:DK:APWULQAI
Views:1157
Downloads:77
Metadata:XML RDF-CHPDL DC-XML DC-RDF
Categories:KTFMB - FERI
:
  
Average score:(0 votes)
Your score:Voting is allowed only for logged in users.
Share:AddThis
AddThis uses cookies that require your consent. Edit consent...

Hover the mouse pointer over a document title to show the abstract or click on the title to get all document metadata.

Secondary language

Language:English
Title:EVALUATION OF DIFFERENT ALGORITHMS FOR COLOCATION DETERMINATION
Abstract:Collocation defines a sequence of words or terms thatco-occure more often that would be expected by chance. We will explain a meaning of collocations and their usage in processing of Slovenian text. We also describe jos1M corpus which is used for input intoalgorithms for determining of collocations. Two different algorithms for the determination of collocations (frequency and the Mean and Variance algorithm) is implemented and compared in the practical part of the thesis.
Keywords:natural language processing, collocation, jos1M corpus


Comments

Leave comment

You have to log in to leave a comment.

Comments (0)
0 - 0 / 0
 
There are no comments!

Back
Logos of partners University of Maribor University of Ljubljana University of Primorska University of Nova Gorica