Mithun
Perceptual coding is based on the frequency response of the ear.
The best way to learn about things instead of getting a book is the
net.
And if you do get a book get the one by the acoustics guru J.F Moore
....please kindly check this up as its been a long time since I touched this
BIBLE. If I mistook the author name please do a quick search on this also.
In the meantime, try doing a quick search on terms like temporal masking,
psychoacoustics and audio compression. Many audio compression algos are based on
this assuming you have a bandwidth ranging from 20 to 20 kHz.
For speech which in narrowband is bandlimited to 3.4 and wideband to 7 kHz
you can use the similar concepts.
basically in a nutshell perceptual coding means that you can afford to
quantize as long as you are below the noise level. In speech you have resonant
modes termed as formants. Here you can afford typically more compression and
still get away with it with regards to the human ear not perceiving it.
My friend Arijit is pretty much into this field and you can ask him
more details, am ccing his email herein.
S.Jaisimha
hi all,
Shree Jaisimha
|