| | SLO | ENG | Cookies and privacy

Bigger font | Smaller font

Show document Help

Title:PRIMERJAVA RAZLIČNIH ALGORITMOV ZA DOLOČANJE KOLOKACIJ MED BESEDAMI
Authors:ID Brodnjak, Dejan (Author)
ID Ojsteršek, Milan (Mentor) More about this mentor... New window
Files:.pdf VS_Brodnjak_Dejan_2011.pdf (1,48 MB)
MD5: D18FE8D84042E12C4F8B37D169D86D71
PID: 20.500.12556/dkum/95239835-296c-4918-bc21-58c4d87f2783
 
Language:Slovenian
Work type:Bachelor thesis/paper
Organization:FERI - Faculty of Electrical Engineering and Computer Science
Abstract:Kolokacije so besedne zveze, ki se v besedilih pojavljajo pogosteje kot bi se po naključju. V diplomskem delu bomo spoznali njihov pomen in uporabo pri procesiranju besedil v slovenskem jeziku. Pogledali si bomo tudi korpus jos1M, ki ga bomo uporabljali kot vhod v algoritme za določanje kolokacij. Implementirali bomo dva algoritma za določanje kolokacij (frekvenčni in razpršeni). Z morfološkim filtriranjem bomo izrazili kolokacije. Na koncu bomo algoritma primerjali.
Keywords:procesiranje naravnega jezika, kolokacije, jos1M korpus
Place of publishing:Maribor
Publisher:[D. Brodnjak]
Year of publishing:2011
PID:20.500.12556/DKUM-20646 New window
UDC:004.934.1'1(043.2)
COBISS.SI-ID:15600662 New window
NUK URN:URN:SI:UM:DK:APWULQAI
Publication date in DKUM:30.09.2011
Views:1991
Downloads:149
Metadata:XML DC-XML DC-RDF
Categories:KTFMB - FERI
:
Copy citation
  
Average score:(0 votes)
Your score:Voting is allowed only for logged in users.
Share:Bookmark and Share


Hover the mouse pointer over a document title to show the abstract or click on the title to get all document metadata.

Secondary language

Language:English
Title:EVALUATION OF DIFFERENT ALGORITHMS FOR COLOCATION DETERMINATION
Abstract:Collocation defines a sequence of words or terms thatco-occure more often that would be expected by chance. We will explain a meaning of collocations and their usage in processing of Slovenian text. We also describe jos1M corpus which is used for input intoalgorithms for determining of collocations. Two different algorithms for the determination of collocations (frequency and the Mean and Variance algorithm) is implemented and compared in the practical part of the thesis.
Keywords:natural language processing, collocation, jos1M corpus


Comments

Leave comment

You must log in to leave a comment.

Comments (0)
0 - 0 / 0
 
There are no comments!

Back
Logos of partners University of Maribor University of Ljubljana University of Primorska University of Nova Gorica