Title: | Online speech/music segmentation based on the variance mean of filter bank energy |
---|
Authors: | ID Kos, Marko (Author) ID Grašič, Matej (Author) ID Kačič, Zdravko (Author) |
Files: | EURASIP_Journal_on_Advances_in_Signal_Processing_2009_Kos,_Grasic,_Kacic_Online_SpeechMusic_Segmentation_Based_on_the_Variance_Mean_of_F.pdf (1,49 MB) MD5: 8B5048DCB865A849DDEFACB1DACE6F0B
https://asp-eurasipjournals.springeropen.com/articles/10.1155/2009/628570
|
---|
Language: | English |
---|
Work type: | Scientific work |
---|
Typology: | 1.01 - Original Scientific Article |
---|
Organization: | FERI - Faculty of Electrical Engineering and Computer Science
|
---|
Abstract: | This paper presents a novel feature for online speech/music segmentation basedon the variance mean of filter bank energy (VMFBE). The idea that encouraged the feature's construction is energy variation in a narrow frequency sub-band. The energy varies more rapidly, and to a greater extent for speech than for music. Therefore, an energy variance in such a sub-band isgreater for speech than for music. The radio broadcast database and the BNSIbroadcast news database were used for feature discrimination and segmentation ability evaluation. The calculation procedure of the VMFBE feature has 4 out of 6 steps in common with the MFCC feature calculation procedure. Therefore, it is a very convenient speech/music discriminator for use in real-time automatic speech recognition systems based on MFCC features, because valuable processing time can be saved, and computation load is only slightly increased. Analysis of the feature's speech/music discriminative ability shows an average error rate below 10% for radio broadcast material and it outperforms other features used for comparison, by more than 8%. The proposed feature as a stand-alone speech/music discriminator in a segmentation system achieves an overall accuracy of over 94% on radio broadcast material. |
---|
Keywords: | online speech segmentation, algorithm, speech techniques |
---|
Publication status: | Published |
---|
Publication version: | Version of Record |
---|
Year of publishing: | 2009 |
---|
Number of pages: | str. 1-13 |
---|
Numbering: | Letn. 2009 |
---|
PID: | 20.500.12556/DKUM-66441 |
---|
ISSN: | 1687-6172 |
---|
UDC: | 004.9 |
---|
ISSN on article: | 1687-6172 |
---|
COBISS.SI-ID: | 13644822 |
---|
DOI: | 10.1155/2009/628570 |
---|
NUK URN: | URN:SI:UM:DK:GYTEKQ8Z |
---|
Publication date in DKUM: | 26.06.2017 |
---|
Views: | 1337 |
---|
Downloads: | 449 |
---|
Metadata: | |
---|
Categories: | Misc.
|
---|
:
|
Copy citation |
---|
| | | Average score: | (0 votes) |
---|
Your score: | Voting is allowed only for logged in users. |
---|
Share: | |
---|
Hover the mouse pointer over a document title to show the abstract or click
on the title to get all document metadata. |