We should note here that some of the methods covered in this chapter
do not compensate for channel distortion, but are designed purely for
removing the noise signal
. This is appropriate for improving
quality when it is assumed the channel was designed properly for clean
speech; note that our ears are insensitive to small phase
distortions or global spectral shifts. On the other hand,
compensation of the channel may be critical to improve robustness in
ASR systems. For example, the simple act of changing the recording
microphone can drastically affect recognition accuracy.
Note that the model excludes multiple-microphone systems (
is a
scalar), which employ beam-forming [1,2] and
noise cancellation [3] techniques. Beam-formers
adapt the gains of a microphone array to place nulls at the noise
source, while noise-cancellers assume the availability of a separate
reference to the noise signal. Although these methods can be extremely
effective, they are restrictive in their requirements on the
availability and placement of multiple microphones. This chapter
considers only single-microphone methods.