DSPRelated.com
Books

Applied Speech and Audio Processing: With Matlab Examples

McLoughlin, Ian 2009

Applied Speech and Audio Processing is a MATLAB-based, one-stop resource that blends speech and hearing research in describing the key techniques of speech and audio processing. This practically oriented text provides MATLAB examples throughout to illustrate the concepts discussed and to give the reader hands-on experience with important techniques. Chapters on basic audio processing and the characteristics of speech and hearing lay the foundations of speech signal processing, which are built upon in subsequent sections explaining audio handling, coding, compression, and analysis techniques. The final chapter explores a number of advanced topics that use these techniques, including psychoacoustic modelling, a subject which underpins MP3 and related audio formats. With its hands-on nature and numerous MATLAB examples, this book is ideal for graduate students and practitioners working with speech or audio systems.


Why Read This Book

You will get a hands-on, example-rich introduction to speech and audio processing with MATLAB code you can run and adapt. The book links fundamentals (spectral analysis, STFT) to application areas such as feature extraction, perceptual coding and audio enhancement, so you can prototype and evaluate real algorithms quickly.

Who Will Benefit

Undergraduate/graduate students and practicing engineers building and prototyping speech or audio signal-processing algorithms who want MATLAB-based, application-focused guidance.

Level: Intermediate — Prerequisites: Basic signals & systems and digital signal processing concepts (Fourier transform, sampling, filtering) plus basic proficiency in MATLAB.

Get This Book

Key Takeaways

  • Implement time- and frequency-domain analysis (STFT, spectrograms, windowing) in MATLAB
  • Extract common speech/audio features such as LPC coefficients and MFCCs and use them for analysis
  • Detect pitch and perform formant and temporal analysis of speech signals
  • Apply psychoacoustic concepts to simple perceptual audio coding and understand MP3-related ideas
  • Prototype basic audio coding/compression and evaluate perceptual quality
  • Design and test basic speech enhancement and audio-processing pipelines using MATLAB

Topics Covered

  1. Introduction and MATLAB basics for audio
  2. Physical and perceptual characteristics of sound and hearing
  3. Digital audio handling: sampling, quantization, and file formats
  4. Time-domain and frequency-domain analysis; DFT and windowing
  5. Short-time Fourier transform, spectrograms and time-frequency displays
  6. Speech production, acoustic phonetics, and voiced/unvoiced models
  7. Feature extraction: LPC, cepstrum, MFCCs, and pitch detection
  8. Audio coding and compression basics; perceptual coding principles
  9. Psychoacoustic modeling and its role in perceptual audio coding
  10. Audio enhancement, noise reduction and simple source separation
  11. Advanced topics and case studies; MATLAB examples and exercises
  12. Appendices: MATLAB code listings, datasets and implementation notes

Languages, Platforms & Tools

MATLABMATLAB Signal Processing Toolbox (assumed)MATLAB Audio Toolbox (relevant examples)Standard audio file formats (WAV, MP3 for examples)

How It Compares

More application- and MATLAB-oriented than classic theory texts like Rabiner & Schafer or Gold & Morgan; less depth in formal theory but much more hands-on prototyping material.

Related Books