| | SLO | ENG | Piškotki in zasebnost

Večja pisava | Manjša pisava

Izpis gradiva Pomoč

Naslov:Online speech/music segmentation based on the variance mean of filter bank energy
Avtorji:ID Kos, Marko (Avtor)
ID Grašič, Matej (Avtor)
ID Kačič, Zdravko (Avtor)
Datoteke:.pdf EURASIP_Journal_on_Advances_in_Signal_Processing_2009_Kos,_Grasic,_Kacic_Online_SpeechMusic_Segmentation_Based_on_the_Variance_Mean_of_F.pdf (1,49 MB)
MD5: 8B5048DCB865A849DDEFACB1DACE6F0B
 
URL https://asp-eurasipjournals.springeropen.com/articles/10.1155/2009/628570
 
Jezik:Angleški jezik
Vrsta gradiva:Znanstveno delo
Tipologija:1.01 - Izvirni znanstveni članek
Organizacija:FERI - Fakulteta za elektrotehniko, računalništvo in informatiko
Opis:This paper presents a novel feature for online speech/music segmentation basedon the variance mean of filter bank energy (VMFBE). The idea that encouraged the feature's construction is energy variation in a narrow frequency sub-band. The energy varies more rapidly, and to a greater extent for speech than for music. Therefore, an energy variance in such a sub-band isgreater for speech than for music. The radio broadcast database and the BNSIbroadcast news database were used for feature discrimination and segmentation ability evaluation. The calculation procedure of the VMFBE feature has 4 out of 6 steps in common with the MFCC feature calculation procedure. Therefore, it is a very convenient speech/music discriminator for use in real-time automatic speech recognition systems based on MFCC features, because valuable processing time can be saved, and computation load is only slightly increased. Analysis of the feature's speech/music discriminative ability shows an average error rate below 10% for radio broadcast material and it outperforms other features used for comparison, by more than 8%. The proposed feature as a stand-alone speech/music discriminator in a segmentation system achieves an overall accuracy of over 94% on radio broadcast material.
Ključne besede:online speech segmentation, algorithm, speech techniques
Status publikacije:Objavljeno
Verzija publikacije:Objavljena publikacija
Leto izida:2009
Št. strani:str. 1-13
Številčenje:Letn. 2009
PID:20.500.12556/DKUM-66441 Novo okno
ISSN:1687-6172
UDK:004.9
COBISS.SI-ID:13644822 Novo okno
DOI:10.1155/2009/628570 Novo okno
ISSN pri članku:1687-6172
NUK URN:URN:SI:UM:DK:GYTEKQ8Z
Datum objave v DKUM:26.06.2017
Število ogledov:1337
Število prenosov:443
Metapodatki:XML DC-XML DC-RDF
Področja:Ostalo
:
Kopiraj citat
  
Skupna ocena:(0 glasov)
Vaša ocena:Ocenjevanje je dovoljeno samo prijavljenim uporabnikom.
Objavi na:Bookmark and Share


Postavite miškin kazalec na naslov za izpis povzetka. Klik na naslov izpiše podrobnosti ali sproži prenos.

Gradivo je del revije

Naslov:EURASIP Journal on Advances in Signal Processing
Skrajšan naslov:EURASIP J. Adv. Signal Process.
Založnik:Springer
ISSN:1687-6172
COBISS.SI-ID:5849428 Novo okno

Licence

Licenca:CC BY 4.0, Creative Commons Priznanje avtorstva 4.0 Mednarodna
Povezava:http://creativecommons.org/licenses/by/4.0/deed.sl
Opis:To je standardna licenca Creative Commons, ki daje uporabnikom največ možnosti za nadaljnjo uporabo dela, pri čemer morajo navesti avtorja.
Začetek licenciranja:26.06.2017

Sekundarni jezik

Jezik:Slovenski jezik
Ključne besede:online govorne segmentacije, parametri govora, tehnike govora


Komentarji

Dodaj komentar

Za komentiranje se morate prijaviti.

Komentarji (0)
0 - 0 / 0
 
Ni komentarjev!

Nazaj
Logotipi partnerjev Univerza v Mariboru Univerza v Ljubljani Univerza na Primorskem Univerza v Novi Gorici