Search Spectral Audio Signal Processing
Would you like to be notified by email when Julius Orion Smith III publishes a new entry into his blog?
For the channel vocoder, we only need to determine the amplitude of the signal. One technique is to apply an envelope follower to each subband. Specifically, this can be done by rectification and subsequent low pass filtering. This produces an approximation of the energy in each subband. This approach has been applied in parametric speech models.
In the case of the Phase Vocoder, we need to determine both the amplitude and the phase of the signal in each subband. Under the assumption of no more than one varying sinusoid in each subband, we can represent the signal in each channel as
In order to determine these signals, it is helpful to express the
channel signal
in its complex ``analytic'' representation. We
will denote this by
. Ideally, the imaginary part of the
analytic signal is obtained from its real part using the Hilbert
transform:
Practical Hilbert transformers were covered in the previous lecture.
Hence,
We normally work in practice with instantaneous frequency deviation instead of phase:
Note that
is a narrowband signal centered about the channel
frequency
. It is common to heterodyne the channel
output signals to ``base band'' by shifting its spectrum by
so as to center the channel bandwidth about zero. This is
accomplished by modulating the analytic signal by
to get
For each of the subbands, we get data which looks like the following:
Once we have data in this form, we can compress it
There are many inherent problems with this technique:
Note: in some phase-vocoder applications, such as time-scale-modification and pitch shifting, the instantaneous frequencies of the channel signals are not explicitly computed. We will return to this topic after we have introduced the Weighted Overlap-Add (WOLA) method for short-time Fourier analysis, modification, and resynthesis.
