Sign in

username:

password:



Not a member?

Search Online Books



Search tips

Free Online Books



Chapters

Chapter Contents:

Search Spectral Audio Signal Processing

  

Book Index | Global Index


Would you like to be notified by email when Julius Orion Smith III publishes a new entry into his blog?

  

Cepstral Smoothing

The spectral envelope obtained by cepstral smoothing is defined as

$\displaystyle Y_m = \hbox{\sc DFT}[w \cdot \underbrace{\hbox{\sc DFT}^{-1}\log(X_m)}_{\hbox{complex cepstrum}}]
$

where $ w$ is a lowpass window in the cepstral domain, e.g.,

$\displaystyle w(n) = \left\{\begin{array}{ll}
1, & \vert n\vert< n_c \\ [5pt]
0.5, & \vert n\vert=n_c \\ [5pt]
0, & \vert n\vert>n_c \\
\end{array}\right.
$

  • The log-spectrum of $ X_m$ is thus lowpass filtered (the complex cepstrum of $ x$ is ``liftered'') to obtain a smooth spectral envelope.
  • For voice, $ n_c$ should be set below the pitch period in samples.
  • Cepstral coefficients are typically used in speech recognition.
  • For best results, use an audio-warped spectral axis, such as the ERB scale (Appendix E) or its early ancestor, the Mel frequency scale. Mel Frequency Cepstral Coefficients (MFCC) remain standard in speech recognition front ends.


Order a Hardcopy of Spectral Audio Signal Processing

Previous: Spectral Envelope Extraction
Next: Linear Prediction Spectral Envelope

written by Julius Orion Smith III
Julius Smith's background is in electrical engineering (BS Rice 1975, PhD Stanford 1983). He is presently Professor of Music and Associate Professor (by courtesy) of Electrical Engineering at Stanford's Center for Computer Research in Music and Acoustics (CCRMA), teaching courses and pursuing research related to signal processing applied to music and audio systems. See http://ccrma.stanford.edu/~jos/ for details.


Comments


No comments yet for this page


Add a Comment
You need to login before you can post a comment (best way to prevent spam). ( Not a member? )