• Recent
  • Compose
  • Select the "Compose" tab to start a new discussion

Bandwidth Extension

Started by seid...@yahoo.com in Speech Coding4 years ago

Dear Sir/Mrs My research field is speech enhancement and want to know about bandwidth extension more. If any one have comprehensive...

Dear Sir/Mrs My research field is speech enhancement and want to know about bandwidth extension more. If any one have comprehensive information about these methods please help me to solve my problem about codebook method. I can not convert my LSF's coefficients for narrow band speech to wideband speech LSF's. Regards. Seyed Farid Mousavipour


about G729 framing details

Started by renzhengshu3 in Speech Coding4 years ago 4 replies

HI, all I have a question on G729 framing details I used a G729 codec to encode a .wav file before encoding, it is about 300 KB ...

HI, all I have a question on G729 framing details I used a G729 codec to encode a .wav file before encoding, it is about 300 KB after encoding using G729 without VAD , the encoded G729 file is still about 300 KB, why isn't there any compression on bytes? after encoding using G729 with VAD, the encoded file is about 250KB, only 50 KB smaller? Besides, according to a pos...


Is there a better way to optimize IIR implementation in speech processing?

Started by jogg...@gmail.com in Speech Coding4 years ago

Hi, all Nowadays I work on the optimization of speech processing, but most of my previous work is about video codec, so I have little...

Hi, all Nowadays I work on the optimization of speech processing, but most of my previous work is about video codec, so I have little experience about speech processing. In speech processing, IIR filter is always employed. In IIR filter data dependency makes optimization using SIMD difficult. The following code is about IIR. I can't find a better way to optimize it. whi...


question about linear prediction of the chirp

Started by Farzane Ahmadi in Speech Coding4 years ago

Hi I am trying to measure the resonances of an arbitrary cavity. I use an ultrasonic signal x to excite the cavity and try to estimate the...

Hi I am trying to measure the resonances of an arbitrary cavity. I use an ultrasonic signal x to excite the cavity and try to estimate the resonances from the cavity output (y). x --> [cavity] --> y I use a chirp pulse train as input (x) to excite the cavity. http://www.mathworks.com/help/toolbox/signal/chirp.html Chirp has a flat spectrum so in frequency domain it has a uniform freq


NB-AMR seek implementation

Started by Padmanabha V Reddy in Speech Coding4 years ago 3 replies

Hi, I am finding that after seek the output of NB-AMR decoder is distorted. I have observed this behavior with 1. codecs from Google (as...

Hi, I am finding that after seek the output of NB-AMR decoder is distorted. I have observed this behavior with 1. codecs from Google (as part Android), as well as 2. in PC based decoder called AMRPlayer software(For this, I have actually dumped the input given to Google's decoder and fed it to AMRPlayer, thereby I'm simulating the seek but not doing the seek even though it has it). ...


ITU-T G.723.1 Speech Coder: A Matlab Implementation P. Kabal Department of Electrical & Computer Engineering McGill University

Started by lu dinh vu in Speech Coding4 years ago 1 reply

In document: ITU-T G.723.1 Speech Coder: A Matlab Implementation P. Kabal Department of Electrical & Computer Engineering McGill...

In document: ITU-T G.723.1 Speech Coder: A Matlab Implementation P. Kabal Department of Electrical & Computer Engineering McGill University -------------------HHP(z), ---Highpass Filter ?(page 5).? and in page 15, what is HF(z)? tks! LU DINH VU


VAD Decision from G.729B and AMR code

Started by tparis23 in Speech Coding4 years ago

Hello, I am implementing several standard VADs as a reference for my work in my internship on VAD. We've decided on G.729B and AMR. I have...

Hello, I am implementing several standard VADs as a reference for my work in my internship on VAD. We've decided on G.729B and AMR. I have the C code and have read the documentation for the algorithms and for the code, but I seem to be missing something. Where is the VAD speech/non-speech decision output for these programs? Is it in the bitstream, or do I need to modify the code myself ...


G722 Source Link

Started by mahesh_gb5 in Speech Coding4 years ago 1 reply

Dear All Please let me know where i can download the G722 source code if i use this is there any licensing issues. Thanks and reg's --Mahesh

Dear All Please let me know where i can download the G722 source code if i use this is there any licensing issues. Thanks and reg's --Mahesh


EVRC-B capable of word Mangling/ word replacement

Started by Kevin in Speech Coding4 years ago 1 reply

I came across a recording at the mobile of a land to mobile EVRC-B call Not on VZW. It had Screech Which was no surprise but also it decoded...

I came across a recording at the mobile of a land to mobile EVRC-B call Not on VZW. It had Screech Which was no surprise but also it decoded words that were not sent in the Harvard sentence source speech files! The word "write" was turned into ri.....ot and the word "cookie" became coochie! This would make it possible peoples words could be unknowingly miss understood or worse! Can 4GV mangl...


data format of PCM 16bit signed mono-channel and alignment(line-up) two PCM-16bit .wav files.

Started by nangergong in Speech Coding4 years ago 2 replies

Hi, all: I have some .wav files with format of PCM 16bit signed mono-channel from this...

Hi, all: I have some .wav files with format of PCM 16bit signed mono-channel from this link: https://ccrma.stanford.edu/courses/422/projects/WaveFormat/ I know each sample of these .wav files(except the header) can be converted to value -32768 to 32767 I want to time-align two .wav files with the format mentioned above. My steps are: 1) convert .wav files...


Sign up
or Sign in