Sökning: id:"swepub:oai:DiVA.org:kth-7235" >
HMM-based gain-mode...
-
Zhao, David YuhengKTH,Ljud- och bildbehandling
(författare)
HMM-based gain-modeling for enhancement of speech in noise
- Artikel/kapitelEngelska2007
Förlag, utgivningsår, omfång ...
Nummerbeteckningar
-
LIBRIS-ID:oai:DiVA.org:kth-7235
-
https://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-7235URI
-
https://doi.org/10.1109/TASL.2006.885256DOI
Kompletterande språkuppgifter
-
Språk:engelska
-
Sammanfattning på:engelska
Ingår i deldatabas
Klassifikation
-
Ämneskategori:ref swepub-contenttype
-
Ämneskategori:art swepub-publicationtype
Anmärkningar
-
QC 20100825
-
Accurate modeling and estimation of speech and noise gains facilitate good performance of speech. enhancement methods using data-driven prior models. In this paper, we propose a hidden Markov model (HMM)-based speech enhancement method using explicit gain modeling. Through the introduction of stochastic gain variables, energy variation in both speech and noise is explicitly modeled in a unified framework. The speech gain models the energy variations of the speech phones, typically due to differences in pronunciation and/or different vocalizations of individual speakers. The noise gain helps to improve the tracking of the time-varying energy of nonstationary noise. The expectationmaximization (EM) algorithm is used to perform offline estimation of the time-invariant model parameters. The time-varying model'parameters are estimated online using the recursive EM algorithm. The. proposed gain modeling techniques are applied to a novel Bayesian speech estimator, and the performance of the proposed enhancement method is evaluated through objective and subjective tests. The experimental results confirm the advantage of explicit gain modeling, particularly for nonstationary noise sources.
Ämnesord och genrebeteckningar
Biuppslag (personer, institutioner, konferenser, titlar ...)
-
Kleijn, BastiaanKTH,Ljud- och bildbehandling(Swepub:kth)u1bm0bvj
(författare)
-
KTHLjud- och bildbehandling
(creator_code:org_t)
Sammanhörande titlar
-
Ingår i:IEEE transactions on speech and audio processing15:3, s. 882-8921063-66761558-2353
Internetlänk
Hitta via bibliotek
Till lärosätets databas