Speech enhancement using a minimum mean-square error short-time spectral modulation magnitude estimator

Publication Type:Journal Article
Year of Publication:2012
Authors:Paliwal, Schwerin, Wójcicki
Journal:Speech Communication
Volume:54
Issue:2
Pagination:282 - 305
Date Published:Jan-02-2012
ISSN:01676393
Keywords:Analysis-modification-synthesis (AMS), MMSE short-time modulation magnitude estimator (MME), MMSE short-time spectral magnitude estimator (AME), Modulation domain, Modulation magnitude spectrum, Modulation spectrum, Speech enhancement
Abstract:

In this paper we investigate the enhancement of speech by applying MMSE short-time spectral magnitude estimation in the modu- lation domain. For this purpose, the traditional analysis-modification-synthesis framework is extended to include modulation domain processing. We compensate the noisy modulation spectrum for additive noise distortion by applying the MMSE short-time spectral mag- nitude estimation algorithm in the modulation domain. A number of subjective experiments were conducted. Initially, we determine the parameter values that maximise the subjective quality of stimuli enhanced using the MMSE modulation magnitude estimator. Next, we compare the quality of stimuli processed by the MMSE modulation magnitude estimator to those processed using the MMSE acoustic magnitude estimator and the modulation spectral subtraction method, and show that good improvement in speech quality is achieved through use of the proposed approach. Then we evaluate the effect of including speech presence uncertainty and log-domain processing on the quality of enhanced speech, and find that this method works better with speech uncertainty. Finally we compare the quality of speech enhanced using the MMSE modulation magnitude estimator (when used with speech presence uncertainty) with that enhanced using different acoustic domain MMSE magnitude estimator formulations, and those enhanced using different modulation domain based enhancement algorithms. Results of these tests show that the MMSE modulation magnitude estimator improves the quality of processed stimuli, without introducing musical noise or spectral smearing distortion. The proposed method is shown to have better noise suppres- sion than MMSE acoustic magnitude estimation, and improved speech quality compared to other modulation domain based enhance- ment methods considered.

URL:http://linkinghub.elsevier.com/retrieve/pii/S016763931100135X
DOI:10.1016/j.specom.2011.09.003
Short Title:Speech Communication
BioAcoustica ID: 
Non biological: 
Scratchpads developed and conceived by (alphabetical): Ed Baker, Katherine Bouton Alice Heaton Dimitris Koureas, Laurence Livermore, Dave Roberts, Simon Rycroft, Ben Scott, Vince Smith