Estimators of the Magnitude-Squared Spectrum and Methods for Incorporating SNR

**study tips** · 08-05-2013, 03:23 PM

Estimators of the Magnitude-Squared Spectrum and Methods for Incorporating SNR Uncertainty

.pdf

Estimators of the Magnitude.pdf (Size: 1.85 MB / Downloads: 44)

Abstract

Statistical estimators of the magnitude-squared
spectrum are derived based on the assumption that the magnitude-
squared spectrum of the noisy speech signal can be computed
as the sum of the (clean) signal and noise magnitude-squared
spectra. Maximum a posterior (MAP) and minimum mean square
error (MMSE) estimators are derived based on a Gaussian statistical
model. The gain function of the MAP estimator was found
to be identical to the gain function used in the ideal binary mask
(IdBM) that is widely used in computational auditory scene analysis
(CASA). As such, it was binary and assumed the value of 1 if
the local signal-to-noise ratio (SNR) exceeded 0 dB, and assumed
the value of 0 otherwise. By modeling the local instantaneous SNR
as an F-distributed random variable, soft masking methods were
derived incorporating SNR uncertainty. The soft masking method,
in particular, which weighted the noisy magnitude-squared spectrum
by the a priori probability that the local SNR exceeds 0 dB
was shown to be identical to the Wiener gain function. Results
indicated that the proposed estimators yielded significantly better
speech quality than the conventional minimum mean square error
spectral power estimators, in terms of yielding lower residual
noise and lower speech distortion.

INTRODUCTION

ANUMBER of estimators of the signal magnitude spectrum
have been proposed for speech enhancement (see
review in [1, Ch. 7]). The minimum mean square error (MMSE)
estimators [2], [3] of the magnitude spectrum, in particular, have
been found to perform consistently well, in terms of speech
quality, in a number of noisy conditions [4]. Several MMSE
estimators of the power spectrum [5]–[7] or more general the
th-power magnitude spectrum [8] have also been proposed. In
some applications such as speech coding [6], where the autocorrelation
coefficients might be needed, the optimal power-spectrum
estimator might be more useful than the magnitude estimator.

Maximum a Posterior (MAP) Estimator

The a posterior probability density (14) function is monotonic,
and when (expressed in dB) changes its sign, the density
changes its direction (increasing versus decreasing). This
simplifies the maximization a great deal.

Soft Masking by Incorporating a Priori SNR Uncertainty

Assuming independence between the clean speech and noise
magnitude-squared spectra, we can easily use (12) and (13) to
model the hypothesis probability given the a priori SNR . As
we do not use any other constraint or assumption, we refer to
this hypothesis probability as the a priori SNR uncertainty.

Soft Masking Based on Posteriori SNR Uncertainty

Clearly the above SMPR estimator did not incorporate information
about the noisy observations, as it relied solely on
a priori information about the instantaneous SNR . It is reasonable
to expect that a better estimator could be developed by
incorporating posteriori information about the SNR at each frequency
bin. In this case, we incorporate the assumption given in
(11) to compute the hypothesis probability, which is referred to
as a posteriori SNR uncertainty.

CONCLUSION

Statistical estimators of the magnitude-squared spectrum
were derived based on the assumption that the magnitude-
squared spectrum of the noisy speech signal can be
computed as the sum of the clean signal and noise magnitude-
squared spectrum. Aside from the two traditional
estimators, based on MAP and MMSE principles, two additional
soft masking methods were derived incorporating
SNR uncertainty. Overall, when compared to the conventional
MMSE spectral power estimators [6], [7]

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Investigation and Analysis of Inception Voltage and Field Distribution	seminar presentation	1	25,638,320	22-09-2017, 11:06 AM Last Post: jaseela123
	Spread Spectrum ppt	seminar post	1	1,108	19-09-2017, 03:41 PM Last Post: jaseela123
	Wind Energy and Production of Hydrogen and Electricity — Opportunities for Renewable	seminar tips	1	1,087	06-09-2017, 11:19 AM Last Post: jaseela123
	dynamic spectrum management	seminar tips	1	778	31-08-2017, 01:32 PM Last Post: jaseela123
	DESIGN AND PERFORMANCE ANALYSIS OF CSLA AND CLAA FOR 32-BIT UNSIGNED MULTIPLIER	dhanabhagya	0	560	13-02-2016, 04:29 PM Last Post: dhanabhagya
	DATA LOGGING TO COLLECT AND DISPLAY TEMPERATURE WITH TIME AND DAY	dhanabhagya	0	451	11-02-2016, 03:49 PM Last Post: dhanabhagya
	Energy Saving and Advanced Hybrid, Battery Electric, and Fuel Cell Vehicles	dhanabhagya	0	647	02-01-2016, 12:52 PM Last Post: dhanabhagya
	MODELING POWER SYSTEM LOAD USING INTELLIGENT METHODS SEMINAR REPORT	seminar code	0	544	04-09-2014, 11:19 AM Last Post: seminar code
	Implementation of 10T- SRAMs in 45-nm for Fast and Low power and comparing	seminar projects maker	0	514	12-06-2014, 10:40 AM Last Post: seminar projects maker
	Study the characteristics of 65nm STRAINED SILICON PMOS transistor incorporating	seminar projects maker	0	557	19-05-2014, 04:05 PM Last Post: seminar projects maker

Quick Reply
Message Type your reply to this message here. Disable Smilies	You have selected one or more posts to quote. Quote these posts now or deselect them.