I'm interested in speech recognition. One of current problems seems to be requiring a "good" acoustic environment. I have "gut feel" that appropriate filtering of input would be useful. That got me thinking of restrictions on filtering methods. It also got me interested in DSP. I start with a basic presupposition that however humans recognize speech is, in some sense, the "best" way. That leads me to believe that any filtering should be constant group delay -- ie linear phase. I have an idea of what might be a useful frequency response - 4 or 5 'humps' corresponding to formant frequencies. If implemented as a FIR filter can it be 'linear phase'. Looking at a pdf discussing various windows, I think individual frequency responses similar to shape of a "Blackman Window" would be optimal. The individual pass band peaks would probably be at least a 1/2 octave apart. what would be relative advantages of implementing: 1. by adding outputs of individual filters 2. a single filter with appropriate frequency response Separate topic -- how many issues have I missed? BTW -- Remember the number one rule of education: A paragraph in a book doesn't give you a license to stop thinking. (seen on WEB somewhere)
questions raised by reading and thinking with possibly missing background
Started by ●December 10, 2005
Reply by ●December 10, 20052005-12-10
Richard Owlett wrote: ...> I start with a basic presupposition that however humans recognize speech > is, in some sense, the "best" way. That leads me to believe that any > filtering should be constant group delay -- ie linear phase.I don't see the connection between "human" and "constant group delay". Is there an unexamined (or at least unexpressed) assumption involved?> I have an idea of what might be a useful frequency response - 4 or 5 > 'humps' corresponding to formant frequencies.Formant frequencies vary among individuals, and we are very sensitive to such variations. (I can distinguish my identical twin sisters by voice or sneeze.)> If implemented as a FIR filter can it be 'linear phase'. > > Looking at a pdf discussing various windows, I think individual > frequency responses similar to shape of a "Blackman Window" would be > optimal. The individual pass band peaks would probably be at least a 1/2 > octave apart.How would you measure the frequency response of a window? (I don't mean to claim that you cant, provided other factors are specified.) Windows are typically applied to time-domain data.> what would be relative advantages of implementing: > 1. by adding outputs of individual filtersEasier to design?> 2. a single filter with appropriate frequency responseeasier to program?> Separate topic -- how many issues have I missed?I don't know. How many false assumptions have you made? At this stage, spell out your assumptions, even if it seems tedious.> BTW -- Remember the number one rule of education: A paragraph in a book > doesn't give you a license to stop thinking. > (seen on WEB somewhere)That's a good rule, but "the one rule"? Nah! Jerry -- Engineering is the art of making what you want from things you can get. �����������������������������������������������������������������������
Reply by ●December 11, 20052005-12-11
Jerry Avins wrote:> Richard Owlett wrote: > > ... > >> I start with a basic presupposition that however humans recognize >> speech is, in some sense, the "best" way. That leads me to believe >> that any filtering should be constant group delay -- ie linear phase. > > I don't see the connection between "human" and "constant group delay". > Is there an unexamined (or at least unexpressed) assumption involved?Yes I was explicitly assuming the connection. Maybe "presuming" would be a better word. But as it led to my question, we'll just take it as a given whether or not it is an accurate description of reality.> >> I have an idea of what might be a useful frequency response - 4 or 5 >> 'humps' corresponding to formant frequencies. > > Formant frequencies vary among individuals, and we are very sensitive to > such variations. (I can distinguish my identical twin sisters by voice > or sneeze.)Yes, and I was going to identify formant frequencies during training mode. Doing much violence to standard nomenclature, I was thinking in terms of "voice pass" and "interference stop" filters. The "why" and "reasonableness" is *OT* for my question. Please take as a given that I wish a passband filter with a particular arbitrary lumpy shape which is also linear phase.> >> If implemented as a FIR filter can it be 'linear phase'. >> >> Looking at a pdf discussing various windows, I think individual >> frequency responses similar to shape of a "Blackman Window" would be >> optimal. The individual pass band peaks would probably be at least a >> 1/2 octave apart. > > How would you measure the frequency response of a window? (I don't mean > to claim that you cant, provided other factors are specified.) Windows > are typically applied to time-domain data.Ahh, but that's why I said "the *SHAPE* of a 'Blackman Window' ". I was looking at various windows and their transforms. A DFT/IFT has no way of knowing that the numbers it feeds upon are in time or frequency domain.> >> what would be relative advantages of implementing: >> 1. by adding outputs of individual filters > > Easier to design? > >> 2. a single filter with appropriate frequency response > > easier to program?I'll try to rephrase in "domain neutral" terms. -- Then again that will cause more problems than it's worth ;{ [--- perhaps very relevant side issue If linear superposition applies in time/frequency domain, does it survive FT to frequency/time domain followed by IFT back to time/frequency domain? ---] I'll restate my problem. For arbitrary and unchangeable reasons I wish a filter defined in frequency domain to have certain characteristics. 1. it *shall* be linear phase 2. its passband is of arbitrary shape a. it can be treated as a whole b. it can be seen as linear superposition of a few simple terms So I repeat my basic question What would be relative advantages of implementing: 1. by adding outputs of individual filters 2. a single filter with appropriate frequency response> >> Separate topic -- how many issues have I missed? > > > I don't know. How many false assumptions have you made? At this stage, > spell out your assumptions, even if it seems tedious. > >> BTW -- Remember the number one rule of education: A paragraph in a >> book doesn't give you a license to stop thinking. >> (seen on WEB somewhere) > > > That's a good rule, but "the one rule"? Nah!It only said "number one rule", it did not say "the *only* rule" ;)
Reply by ●December 11, 20052005-12-11
Richard Owlett <rowlett@atlascomm.net> writes:> I'll restate my problem. > For arbitrary and unchangeable reasons I wish a filter defined in > frequency domain to have certain characteristics. > > 1. it *shall* be linear phase > 2. its passband is of arbitrary shape > a. it can be treated as a whole > b. it can be seen as linear superposition of a few simple terms > > So I repeat my basic question > What would be relative advantages of implementing: > 1. by adding outputs of individual filtersThis option might be nice if you're thinking along the lines of a graphic equalizer: being able to arbitrarily change the gain (volume) of a particular band of frequencies might be useful.> 2. a single filter with appropriate frequency responseThis might be good if you know that the response doesn't have to change much. Changing this sort of a filter on-the-fly, though, might be problematic. Ciao, Peter K.
Reply by ●December 11, 20052005-12-11
Richard Owlett wrote:> Jerry Avins wrote: > >> Richard Owlett wrote: >> >> ... >> >>> I start with a basic presupposition that however humans recognize >>> speech is, in some sense, the "best" way. That leads me to believe >>> that any filtering should be constant group delay -- ie linear phase. >> >> >> I don't see the connection between "human" and "constant group delay". >> Is there an unexamined (or at least unexpressed) assumption involved? > > > Yes I was explicitly assuming the connection. Maybe "presuming" would be > a better word. But as it led to my question, we'll just take it as a > given whether or not it is an accurate description of reality. > >> >>> I have an idea of what might be a useful frequency response - 4 or 5 >>> 'humps' corresponding to formant frequencies. >> >> >> Formant frequencies vary among individuals, and we are very sensitive >> to such variations. (I can distinguish my identical twin sisters by >> voice or sneeze.) > > > Yes, and I was going to identify formant frequencies during training > mode. Doing much violence to standard nomenclature, I was thinking in > terms of "voice pass" and "interference stop" filters. The "why" and > "reasonableness" is *OT* for my question. > > Please take as a given that I wish a passband filter with a particular > arbitrary lumpy shape which is also linear phase. > >> >>> If implemented as a FIR filter can it be 'linear phase'. >>> >>> Looking at a pdf discussing various windows, I think individual >>> frequency responses similar to shape of a "Blackman Window" would be >>> optimal. The individual pass band peaks would probably be at least a >>> 1/2 octave apart. >> >> >> How would you measure the frequency response of a window? (I don't >> mean to claim that you cant, provided other factors are specified.) >> Windows are typically applied to time-domain data. > > > Ahh, but that's why I said "the *SHAPE* of a 'Blackman Window' ". > I was looking at various windows and their transforms. > A DFT/IFT has no way of knowing that the numbers it feeds upon are in > time or frequency domain.The the differences between the shapes of filters is subtle. If those filters without steps at the ends, I find it difficult to distinguish a Blackman from Nuttall, Blackman-Harris, von Hann, and others. What distinguishing feature of Blackman attracts you?>>> what would be relative advantages of implementing: >>> 1. by adding outputs of individual filters >> >> >> Easier to design? >> >>> 2. a single filter with appropriate frequency response >> >> >> easier to program? > > > I'll try to rephrase in "domain neutral" terms. > -- Then again that will cause more problems than it's worth ;{ > > [--- perhaps very relevant side issue > If linear superposition applies in time/frequency domain, does > it survive FT to frequency/time domain followed by IFT back to > time/frequency domain? > ---]Yes> I'll restate my problem. > For arbitrary and unchangeable reasons I wish a filter defined in > frequency domain to have certain characteristics. > > 1. it *shall* be linear phase > 2. its passband is of arbitrary shape > a. it can be treated as a whole > b. it can be seen as linear superposition of a few simple terms > > So I repeat my basic question > What would be relative advantages of implementing: > 1. by adding outputs of individual filtersEasier to design?> 2. a single filter with appropriate frequency responseEasier to program? ... Jerry -- Engineering is the art of making what you want from things you can get. �����������������������������������������������������������������������
Reply by ●December 12, 20052005-12-12
Peter K. wrote:> Richard Owlett <rowlett@atlascomm.net> writes: > > >>I'll restate my problem. >>For arbitrary and unchangeable reasons I wish a filter defined in >>frequency domain to have certain characteristics. >> >>1. it *shall* be linear phase >>2. its passband is of arbitrary shape >> a. it can be treated as a whole >> b. it can be seen as linear superposition of a few simple terms >> >>So I repeat my basic question >>What would be relative advantages of implementing: >>1. by adding outputs of individual filters > > > This option might be nice if you're thinking along the lines of a > graphic equalizer: being able to arbitrarily change the gain (volume) > of a particular band of frequencies might be useful. > > >>2. a single filter with appropriate frequency response > > > This might be good if you know that the response doesn't have to > change much. Changing this sort of a filter on-the-fly, though, might > be problematic. > > Ciao, > > Peter K. > >It would be set once per "user".
Reply by ●December 12, 20052005-12-12
Jerry Avins wrote:> Richard Owlett wrote: > >> Jerry Avins wrote: >> >>> Richard Owlett wrote: >>> ... >>>> I start with a basic presupposition that however humans recognize >>>> speech is, in some sense, the "best" way. That leads me to believe >>>> that any filtering should be constant group delay -- ie linear phase. >>> >>> I don't see the connection between "human" and "constant group >>> delay". Is there an unexamined (or at least unexpressed) assumption >>> involved? >> >> Yes I was explicitly assuming the connection. Maybe "presuming" would >> be a better word. But as it led to my question, we'll just take it as >> a given whether or not it is an accurate description of reality. >> >>> >>>> I have an idea of what might be a useful frequency response - 4 or 5 >>>> 'humps' corresponding to formant frequencies. >>> >>> Formant frequencies vary among individuals, and we are very sensitive >>> to such variations. (I can distinguish my identical twin sisters by >>> voice or sneeze.) >> >> Yes, and I was going to identify formant frequencies during training >> mode. Doing much violence to standard nomenclature, I was thinking in >> terms of "voice pass" and "interference stop" filters. The "why" and >> "reasonableness" is *OT* for my question. >> >> Please take as a given that I wish a passband filter with a particular >> arbitrary lumpy shape which is also linear phase. >> >>> >>>> If implemented as a FIR filter can it be 'linear phase'. >>>> >>>> Looking at a pdf discussing various windows, I think individual >>>> frequency responses similar to shape of a "Blackman Window" would be >>>> optimal. The individual pass band peaks would probably be at least a >>>> 1/2 octave apart. >>> >>> How would you measure the frequency response of a window? (I don't >>> mean to claim that you cant, provided other factors are specified.) >>> Windows are typically applied to time-domain data. >> >> Ahh, but that's why I said "the *SHAPE* of a 'Blackman Window' ". >> I was looking at various windows and their transforms. >> A DFT/IFT has no way of knowing that the numbers it feeds upon are in >> time or frequency domain. > > > The the differences between the shapes of filters is subtle. If those > filters without steps at the ends, I find it difficult to distinguish a > Blackman from Nuttall, Blackman-Harris, von Hann, and others. What > distinguishing feature of Blackman attracts you?I have a pdf of unknown title ( got saved as Windows.pdf ) written by Craig Stuart Sapp <craig@ccrma.stanford.edu> 25 Feb 1997. I has a collection of various windows and their transforms. The particular Blackman window illustrated had a "nice" central lobe and all the residual lobes were of "uniform" shape and at least 60 dB down. *DARN YOU MR. AVINS* You just made me read rather than just look at pretty pictures ;{ The plot of the particular Blackman-Harris window had max side lobes another 20 dB down, but scale of drawing emphasized the side lobes near the central one. Transform of illustrated Hann window -- too much slop Transform of illustrated Hann-Poisson window has a "pleasing shape" with less "rejection" off central peak. I've been "hit over head with 2x4" on another issue. What a implications of all these being symmetric about some point. Obviously if I'm going to have "passband 1 of width a centered at freq b" and "passband 2 of width y centered at freq z" what strange effects will asymmetry have?> >>>> what would be relative advantages of implementing: >>>> 1. by adding outputs of individual filters >>> >>> Easier to design? >>> >>>> 2. a single filter with appropriate frequency response >>> >>> easier to program? >> >> I'll try to rephrase in "domain neutral" terms. >> -- Then again that will cause more problems than it's worth ;{ >> >> [--- perhaps very relevant side issue >> If linear superposition applies in time/frequency domain, does >> it survive FT to frequency/time domain followed by IFT back to >> time/frequency domain? >> ---] > > > Yes > >> I'll restate my problem. >> For arbitrary and unchangeable reasons I wish a filter defined in >> frequency domain to have certain characteristics. >> >> 1. it *shall* be linear phase >> 2. its passband is of arbitrary shape >> a. it can be treated as a whole >> b. it can be seen as linear superposition of a few simple terms >> >> So I repeat my basic question >> What would be relative advantages of implementing: >> 1. by adding outputs of individual filters > > > Easier to design? > >> 2. a single filter with appropriate frequency response > > > Easier to program? > > ... > > Jerry
Reply by ●December 12, 20052005-12-12
Richard Owlett wrote:> ... > That got me thinking ;<What are the *NECESSARY* conditions for a FIR filter of an arbitrary shape in the frequency domain to be "linear phase". One of the references I was reading stated that "a FIR filter would be 'linear phase' if its coefficients were symmetric about the middle coefficient." Is that a "sufficient" condition or a "necessary" condition? What implication does it have for the passband response?
Reply by ●December 12, 20052005-12-12
Richard Owlett <rowlett@atlascomm.net> writes:> Richard Owlett wrote: > >> ... >> That got me thinking ;< > > What are the *NECESSARY* conditions for a FIR filter of an arbitrary > shape in the frequency domain to be "linear phase". > > One of the references I was reading stated that "a FIR filter would be > 'linear phase' if its coefficients were symmetric about the middle > coefficient." > > Is that a "sufficient" condition or a "necessary" condition? > What implication does it have for the passband response?Hi Richard, It is a sufficient condition. A trivial example of an FIR filter that does not meet this condition but is still linear phase is the FIR given by h[0] = 0, h[1] = 0, and h[2] = 1. I've heard that a linear-phase filter has magnitude and phase responses that are Hilbert transforms of each other, but I've never been interested enough to investigate. -- % Randy Yates % "Rollin' and riding and slippin' and %% Fuquay-Varina, NC % sliding, it's magic." %%% 919-577-9882 % %%%% <yates@ieee.org> % 'Living' Thing', *A New World Record*, ELO http://home.earthlink.net/~yatescr
Reply by ●December 12, 20052005-12-12
Randy Yates <yates@ieee.org> writes:> [...] > I've heard that a linear-phase filter has magnitude and phase > responses that are Hilbert transforms of each other, but I've > never been interested enough to investigate.Sorry - correction!: Those are *minimum-phase* filters. -- % Randy Yates % "Watching all the days go by... %% Fuquay-Varina, NC % Who are you and who am I?" %%% 919-577-9882 % 'Mission (A World Record)', %%%% <yates@ieee.org> % *A New World Record*, ELO http://home.earthlink.net/~yatescr