| | SLO | ENG | Piškotki in zasebnost

Večja pisava | Manjša pisava

Iskanje po katalogu digitalne knjižnice Pomoč

Iskalni niz: išči po
išči po
išči po
išči po
* po starem in bolonjskem študiju

Opcije:
  Ponastavi


1 - 10 / 58
Na začetekNa prejšnjo stran123456Na naslednjo stranNa konec
1.
Influence of highly inflected word forms and acoustic background on the robustness of automatic speech recognition for human–computer interaction
Andrej Žgank, 2022, izvirni znanstveni članek

Opis: Automatic speech recognition is essential for establishing natural communication with a human–computer interface. Speech recognition accuracy strongly depends on the complexity of language. Highly inflected word forms are a type of unit present in some languages. The acoustic background presents an additional important degradation factor influencing speech recognition accuracy. While the acoustic background has been studied extensively, the highly inflected word forms and their combined influence still present a major research challenge. Thus, a novel type of analysis is proposed, where a dedicated speech database comprised solely of highly inflected word forms is constructed and used for tests. Dedicated test sets with various acoustic backgrounds were generated and evaluated with the Slovenian UMB BN speech recognition system. The baseline word accuracy of 93.88% and 98.53% was reduced to as low as 23.58% and 15.14% for the various acoustic backgrounds. The analysis shows that the word accuracy degradation depends on and changes with the acoustic background type and level. The highly inflected word forms’ test sets without background decreased word accuracy from 93.3% to only 63.3% in the worst case. The impact of highly inflected word forms on speech recognition accuracy was reduced with the increased levels of acoustic background and was, in these cases, similar to the non-highly inflected test sets. The results indicate that alternative methods in constructing speech databases, particularly for low-resourced Slovenian language, could be beneficial.
Ključne besede: human–computer interaction, automatic speech recognition, acoustic modeling, highly inflected word forms, acoustic background
Objavljeno v DKUM: 28.03.2025; Ogledov: 0; Prenosov: 2
.pdf Celotno besedilo (1,12 MB)
Gradivo ima več datotek! Več...

2.
Strategies for managing time and costs in speech corpus creation : insights from the Slovenian ARTUR corpus
Darinka Verdonik, Andreja Bizjak, Andrej Žgank, Mirjam Sepesy Maučec, Mitja Trojar, Jerneja Žganec Gros, Marko Bajec, Iztok Lebar Bajec, Simon Dobrišek, 2024, izvirni znanstveni članek

Opis: Parliamentary debates represent an essential part of democratic discourse and provide insights into various socio-demographic and linguistic phenomena - parliamentary corpora, which contain transcripts of parliamentary debates and extensive metadata, are an important resource for parliamentary discourse analysis and other research areas. This paper presents the Slovenian parliamentary corpus siParl, the latest version of which contains transcripts of plenary sessions and other legislative bodies of the Assembly of the Republic of Slovenia from 1990 to 2022, comprising more than 1 million speeches and 210 million words. We outline the development history of the corpus and also mention other initiatives that have been influenced by siParl (such as the Parla-CLARIN encoding and the ParlaMint corpora of European parliaments), present the corpus creation process, ranging from the initial data collection to the structural development and encoding of the corpus, and given the growing influence of the ParlaMint corpora, compare siParl with the Slovenian ParlaMint-SI corpus. Finally, we discuss updates for the next version as well as the long-term development and enrichment of the siParl corpus.
Ključne besede: recording speech, transcribing speech, transcription guidelines, Less-resourced language
Objavljeno v DKUM: 04.02.2025; Ogledov: 0; Prenosov: 8
.pdf Celotno besedilo (1,09 MB)
Gradivo ima več datotek! Več...

3.
Uvod v telekomunikacije : seminarske vaje
Andrej Žgank, 2023, drugo učno gradivo

Ključne besede: telekomunikacije, uvod v signale, teorija verjetnosti, analiza signalov, vaje
Objavljeno v DKUM: 14.12.2023; Ogledov: 520; Prenosov: 34
.pdf Celotno besedilo (741,26 KB)

4.
Acoustic Gender and Age Classification as an Aid to Human–Computer Interaction in a Smart Home Environment
Damjan Vlaj, Andrej Žgank, 2023, izvirni znanstveni članek

Opis: The advanced smart home environment presents an important trend for the future of human wellbeing. One of the prerequisites for applying its rich functionality is the ability to differentiate between various user categories, such as gender, age, speakers, etc. We propose a model for an efficient acoustic gender and age classification system for human–computer interaction in a smart home. The objective was to improve acoustic classification without using high-complexity feature extraction. This was realized with pitch as an additional feature, combined with additional acoustic modeling approaches. In the first step, the classification is based on Gaussian mixture models. In thesecond step, two new procedures are introduced for gender and age classification. The first is based on the count of the frames with the speaker’s pitch values, and the second is based on the sum of the frames with pitch values belonging to a certain speaker. Since both procedures are based on pitch values, we have proposed a new, effective algorithm for pitch value calculation. In order to improve gender and age classification, we also incorporated speech segmentation with the proposed voice activity detection algorithm. We also propose a procedure that enables the quick adaptation of the classification algorithm to frequent smart home users. The proposed classification model with pitch values has improved the results in comparison with the baseline system.
Ključne besede: acoustic classification, acoustic signal processing, Gaussian mixture model, pitch analysis, smart home
Objavljeno v DKUM: 11.12.2023; Ogledov: 471; Prenosov: 23
.pdf Celotno besedilo (2,07 MB)
Gradivo ima več datotek! Več...

5.
6.
Analiza vpliva emocij pri vrednotenju kakovosti govora : magistrsko delo
Maja Črešnjovnjak, 2023, magistrsko delo

Opis: Magistrsko delo analizira vpliv emocij na vrednotenje kakovosti govora. Prikazani sta objektivna in subjektivna analiza vpliva emocionalnih zvočnih posnetkov in degradacij govornih kodekov na vrednotenje kakovosti govora. Posebna pozornost je posvečena tudi VoIP in simulaciji izgube paketov v prenosnem kanalu. Simulacije smo izvedli na slovenski govorni bazi Interface. Objektivno vrednotenje kakovosti govora je vključevalo metrike PESQ, NISQA in VISQOL.
Ključne besede: VoIP, objektivna analiza kakovosti govora, subjektivna analiza kakovosti govora, zvok, degradacije komunikacijskega kanala.
Objavljeno v DKUM: 13.10.2023; Ogledov: 299; Prenosov: 65
.pdf Celotno besedilo (2,79 MB)

7.
Prenova oddaljenega dostopa do poslovalnic s tehnologijo dinamičnega večtočkovnega navideznega zasebnega omrežja : magistrsko delo
Nina Ožir, 2022, magistrsko delo

Opis: Magistrsko delo predstavlja prenovo varnega omrežnega dostopa poslovalnic do centralne lokacije večjega podjetja z opremo podjetja Cisco. Obstoječe omrežje je bilo izvedeno z navideznim zasebnim omrežjem med dvema lokacijama (S2S VPN). Novi način varnega oddaljenega dostopa do vedno večjega števila poslovalnic smo izvedli z razširljivo tehnologijo, ki je enostavna za vzdrževanje. To je tehnologija dinamičnega večtočkovnega navideznega zasebnega omrežja, ki je v zaključnem delu tudi podrobno predstavljena. Rezultat je enovito in varno omrežje, ki zagotavlja neprekinjeno in kakovostno delovanje.
Ključne besede: omrežna varnost, oddaljeni dostop, navidezno zasebno omrežje, DMVPN
Objavljeno v DKUM: 18.05.2022; Ogledov: 772; Prenosov: 87
.pdf Celotno besedilo (3,84 MB)

8.
Spoken corpus Gos VideoLectures 4.0
Darinka Verdonik, Tomaž Potočnik, Mirjam Sepesy Maučec, Tomaž Erjavec, Simona Majhenič, Andrej Žgank, 2019, zaključena znanstvena zbirka raziskovalnih podatkov

Objavljeno v DKUM: 09.07.2020; Ogledov: 6067; Prenosov: 15
URL Povezava na datoteko

9.
Govorni, dialoški in multimodalni jezikovni viri : pregled stanja
Darinka Verdonik, Andrej Žgank, Simona Majhenič, Izidor Mlakar, 2020, elaborat, predštudija, študija

Ključne besede: multimodalni jezikovni viri, jezikovni viri
Objavljeno v DKUM: 13.05.2020; Ogledov: 1249; Prenosov: 89
.pdf Celotno besedilo (364,75 KB)
Gradivo ima več datotek! Več...

10.
Zaznavanje varnostnih groženj v komunikacijskih omrežjih in ukrepanje ob njih
Aljaž Gaber, 2020, magistrsko delo

Opis: V magistrskem delu smo se ukvarjali z ATP rešitvijo podjetja Trend Micro, DDI in DDA. Za izvedbo magistrskega dela smo podrobneje spoznali postopke uporabe DDI in DDA. Najprej smo opisali zmožnosti in funkcije DDI in DDA. Postavili smo ustrezni testni sistem in spremljali grožnje, ki se pojavljajo v komunikacijskih omrežjih, ter njihovo delovanje preučevali v peskovniku. Poleg tega smo primerjali vplive različnih vrst zlonamerne programske kode in analizirali postopke ukrepanj ob zaznanih varnostnih incidentih. V zadnjem delu naloge so predstavljeni predlogi za izboljšanje informacijske varnosti, ki smo jih definirali s pomočjo rezultatov obravnavanja groženj z DDI in DDA.
Ključne besede: informacijska varnost, napredni pristopi varovanja omrežja, informacijske grožnje, peskovnik
Objavljeno v DKUM: 24.02.2020; Ogledov: 1322; Prenosov: 242
.pdf Celotno besedilo (4,95 MB)

Iskanje izvedeno v 1.07 sek.
Na vrh
Logotipi partnerjev Univerza v Mariboru Univerza v Ljubljani Univerza na Primorskem Univerza v Novi Gorici