speechcoding | regading PWI-ZFE speech coder

hi, i completed my M.tech in DSP. I did my project on " Implementation of Mixed Multi-Band PrototypeWaveform Interpolation-Zinc Function Excitation (MMBE-PWI-ZFE) Speech coder. here i used Zinc pulses i used for reconstructing the speech signal at the decoder instead of normal Dirac delta pulses". Here I got one doubt, " why can't we use pureSinc or Cosc signal". and I am giving the full details of ZINC function.please clarrify it . Zinc Basis Functions [Sukkar89] suggest that a signal representation based on "orthogonal function decomposition" can be done via a set of basis functions that have similar characteristics to the signal being modeled. The "zinc basis functions" are proposed to model LPC excitation signals (that are inherently band-limited and pulse-type). The zinc function is defined as: Z(t) = A Sinc(t) + B Cosc(t), where Sinc(t) = [sin(2*pi*fc*t)]/(2*pi*fc*t) and and Cosc(t)= [1 - cos(2*pi*fc*t)]/(2*pi*fc*t) here A,B and fc (cut-off frequency) are constants. Time shifted zinc functions: Z(t-k) = A Sinc(t-k) + B Cosc(t-k). Each Zinc function is itself composed of orthogonal functions Sinc(t-k) and Cosc(t-k), for any value of k. It can be proved the orthogonal set of (weighted) zinc functions can fully span the space of all band-limited signals. r(t) = sum{Z(t)} = sum { A Sinc(t-nT) + B Cosc(t-nT) }, where k=nT. An upsampled residual y(n) is computed. The goal is to optimally represent y(n) with a finite order zinc function model yz(n). A mean sq error L between y(n) and yz(n) is minimized to compute the parameters A and B giving an optimal p-order model corresponding to the P smallest values of L. It has been shown that for the same order, the zinc function model is superior to the Fourier series model (for voiced signals). Unvoiced signals are represented using white noise source. Zinc funtions are shown to closely model the perceptually important pitch pulses with a relatively low-order model. Several techniques are proposed to minimize the "roughness" heard in the synthetic speech, due to secondary pitch pulses in the excitation. thanks in advance. ravichandra Win TVs, Bikes, DVD players and more!Click onYahoo! India Promos

Hi ravi, One simple explanation of using both sinc and cosc functions to model a signal is that these two functions are respectively evn and odd function. Thus, sinc function is more suitable to model the even part of the signal and cosc the odd part. Using only one of them leads to the use of a much higher order to reach the same approximation in case when you use both functions. BTW, is it possible to have a copy of your project report ? Best regards

regading PWI-ZFE speech coder

Sign in

You might also like...

Search forums

Free PDF Downloads

Blogs - Hall of Fame

Discussion Groups

Quick Links

About DSPRelated.com

Social Networks

The Related Media Group