| | SLO | ENG | Cookies and privacy

Bigger font | Smaller font

Show document Help

Title:Online speech/music segmentation based on the variance mean of filter bank energy
Authors:ID Kos, Marko (Author)
ID Grašič, Matej (Author)
ID Kačič, Zdravko (Author)
Files:.pdf EURASIP_Journal_on_Advances_in_Signal_Processing_2009_Kos,_Grasic,_Kacic_Online_SpeechMusic_Segmentation_Based_on_the_Variance_Mean_of_F.pdf (1,49 MB)
MD5: 8B5048DCB865A849DDEFACB1DACE6F0B
 
URL https://asp-eurasipjournals.springeropen.com/articles/10.1155/2009/628570
 
Language:English
Work type:Scientific work
Typology:1.01 - Original Scientific Article
Organization:FERI - Faculty of Electrical Engineering and Computer Science
Abstract:This paper presents a novel feature for online speech/music segmentation basedon the variance mean of filter bank energy (VMFBE). The idea that encouraged the feature's construction is energy variation in a narrow frequency sub-band. The energy varies more rapidly, and to a greater extent for speech than for music. Therefore, an energy variance in such a sub-band isgreater for speech than for music. The radio broadcast database and the BNSIbroadcast news database were used for feature discrimination and segmentation ability evaluation. The calculation procedure of the VMFBE feature has 4 out of 6 steps in common with the MFCC feature calculation procedure. Therefore, it is a very convenient speech/music discriminator for use in real-time automatic speech recognition systems based on MFCC features, because valuable processing time can be saved, and computation load is only slightly increased. Analysis of the feature's speech/music discriminative ability shows an average error rate below 10% for radio broadcast material and it outperforms other features used for comparison, by more than 8%. The proposed feature as a stand-alone speech/music discriminator in a segmentation system achieves an overall accuracy of over 94% on radio broadcast material.
Keywords:online speech segmentation, algorithm, speech techniques
Publication status:Published
Publication version:Version of Record
Year of publishing:2009
Number of pages:str. 1-13
Numbering:Letn. 2009
PID:20.500.12556/DKUM-66441 New window
ISSN:1687-6172
UDC:004.9
ISSN on article:1687-6172
COBISS.SI-ID:13644822 New window
DOI:10.1155/2009/628570 New window
NUK URN:URN:SI:UM:DK:GYTEKQ8Z
Publication date in DKUM:26.06.2017
Views:1337
Downloads:449
Metadata:XML DC-XML DC-RDF
Categories:Misc.
:
Copy citation
  
Average score:(0 votes)
Your score:Voting is allowed only for logged in users.
Share:Bookmark and Share


Hover the mouse pointer over a document title to show the abstract or click on the title to get all document metadata.

Record is a part of a journal

Title:EURASIP Journal on Advances in Signal Processing
Shortened title:EURASIP J. Adv. Signal Process.
Publisher:Springer
ISSN:1687-6172
COBISS.SI-ID:5849428 New window

Licences

License:CC BY 4.0, Creative Commons Attribution 4.0 International
Link:http://creativecommons.org/licenses/by/4.0/
Description:This is the standard Creative Commons license that gives others maximum freedom to do what they want with the work as long as they credit the author.
Licensing start date:26.06.2017

Secondary language

Language:Slovenian
Keywords:online govorne segmentacije, parametri govora, tehnike govora


Comments

Leave comment

You must log in to leave a comment.

Comments (0)
0 - 0 / 0
 
There are no comments!

Back
Logos of partners University of Maribor University of Ljubljana University of Primorska University of Nova Gorica