| | SLO | ENG | Cookies and privacy

Bigger font | Smaller font

Show document Help

Title:A comprehensive noise robust speech parameterization algorithm using wavelet packet decomposition-based denoising and speech feature representation techniques
Authors:ID Kotnik, Bojan (Author)
ID Kačič, Zdravko (Author)
Files:.pdf EURASIP_Journal_on_Advances_in_Signal_Processing_2007_Kotnik,_Kacic_A_Comprehensive_Noise_Robust_Speech_Parameterization_Algorithm_Using.pdf (984,48 KB)
MD5: B11834830317136189B6F90E818F907E
 
URL https://asp-eurasipjournals.springeropen.com/articles/10.1155/2007/64102
 
Language:English
Work type:Scientific work
Typology:1.01 - Original Scientific Article
Organization:FERI - Faculty of Electrical Engineering and Computer Science
Abstract:This paper concerns the problem of automatic speech recognition in noise-intense and adverse environments. The main goal of the proposed work is the definition, implementation, and evaluation of a novel noise robust speech signal parameterization algorithm. The proposed procedure is based on time-frequency speech signal representation using wavelet packet decomposition. A new modified soft thresholding algorithm based on time-frequency adaptive threshold determination was developed to efficiently reduce the level of additive noise in the input noisy speech signal. A two-stage Gaussian mixture model (GMM)-based classifier was developed to perform speech/nonspeech as well as voiced/unvoiced classification. The adaptive topology of the wavelet packet decomposition tree based on voiced/unvoiced detection was introduced to separately analyze voiced and unvoiced segments of the speech signal. The main feature vector consists of a combination of log-root compressed wavelet packet parameters, and autoregressive parameters. The final output feature vector is produced using a two-staged feature vector postprocessing procedure. In the experimental framework, the noisy speech databases Aurora 2 and Aurora 3 were applied together with corresponding standardized acoustical model training/testing procedures. The automatic speech recognition performance achieved using the proposed noise robust speech parameterization procedure was compared to the standardized mel-frequency cepstral coefficient (MFCC) feature extraction procedures ETSI ES 201 108 and ETSI ES 202 050.
Keywords:speech parametrization, algorithm, speech techniques
Publication status:Published
Publication version:Version of Record
Year of publishing:2007
Number of pages:str. 1-20
Numbering:Letn. 2007
PID:20.500.12556/DKUM-66439 New window
ISSN:1687-6172
UDC:004.9
ISSN on article:1687-6172
COBISS.SI-ID:11385622 New window
DOI:10.1155/2007/64102 New window
NUK URN:URN:SI:UM:DK:IYT9PUUY
Publication date in DKUM:26.06.2017
Views:1707
Downloads:406
Metadata:XML DC-XML DC-RDF
Categories:Misc.
:
Copy citation
  
Average score:(0 votes)
Your score:Voting is allowed only for logged in users.
Share:Bookmark and Share


Hover the mouse pointer over a document title to show the abstract or click on the title to get all document metadata.

Record is a part of a journal

Title:EURASIP Journal on Advances in Signal Processing
Shortened title:EURASIP J. Adv. Signal Process.
Publisher:Springer
ISSN:1687-6172
COBISS.SI-ID:5849428 New window

Licences

License:CC BY 4.0, Creative Commons Attribution 4.0 International
Link:http://creativecommons.org/licenses/by/4.0/
Description:This is the standard Creative Commons license that gives others maximum freedom to do what they want with the work as long as they credit the author.
Licensing start date:26.06.2017

Secondary language

Language:Slovenian
Keywords:algoritmi, parametri govora, tehnike govora


Comments

Leave comment

You must log in to leave a comment.

Comments (0)
0 - 0 / 0
 
There are no comments!

Back
Logos of partners University of Maribor University of Ljubljana University of Primorska University of Nova Gorica