Issue implementing Energy threshold algorithm for Voice Activity Detection

Started by kash...@gmail.com in Speech Recognition6 years ago

Hi all I am trying to implement the energy threshold algorithm for voice activity detection and not getting meaningful values for energy for...

Hi all I am trying to implement the energy threshold algorithm for voice activity detection and not getting meaningful values for energy for frames of size wL. wL = 1784 // about 40 ms ( const double decay_constant = 0.90 // some optimal value between 0 and 1 double prevrms = 1.0 // avoid DivideByZero double threshold = some optimal value after some experimentation for (int i = 0; i ...


voice command

Started by manw...@hotmail.com in Speech Recognition7 years ago 1 reply

Good day to everyone. I am new to speech processing and recognition. So the question may look elementary to many. I am trying to develop a small...

Good day to everyone. I am new to speech processing and recognition. So the question may look elementary to many. I am trying to develop a small system that would require the system just to recognize 6 words. What kind of feature extraction and pattern matching would be optimum for me? Anwer, Dhaka, Bangladesh


Hidden Markov model - zero probability problem

Started by pete...@st.fri.uniza.sk in Speech Recognition7 years ago

Hi, I am implementing a speech recognition system based on Hidden Markov models. I have finished the Baum-Welch algorithm for model training -...

Hi, I am implementing a speech recognition system based on Hidden Markov models. I have finished the Baum-Welch algorithm for model training - and tested it on some primitive models. I work with discrete HMM, so I am using a code book with 50 output symbols. For speech parametrization I am using 12 MFCC coefficients for every block of sound(512 sound samples - 30 ms). Now when I am trainin...


Re: Classifying 'tone' of voice during speech recognition

Started by Jeff Brower in Speech Recognition8 years ago 2 replies

Judith- > There is a lot of overlap of values(a confusion) in the > features you extract from the emotions.so classifying > emotions? &...

Judith- > There is a lot of overlap of values(a confusion) in the > features you extract from the emotions.so classifying > emotions? & tones is a big job. I'm not the original poster, but yes I agree 100%. -Jeff


Classifying 'tone' of voice during speech recognition

Started by ypar...@gmail.com in Speech Recognition8 years ago 1 reply

Hi, I am a newbie to speech recognition and DSP field. I am looking for a software library that will take an audio file and classify the 'tone'...

Hi, I am a newbie to speech recognition and DSP field. I am looking for a software library that will take an audio file and classify the 'tone' of voice in audio file to one of the few predetermined categories e.g. sad, angry, annoyed, happy, unknown, 'insufficient data' etc. In case of audio file having multiple tones I would like to know ranking e.g. 1-Angry (50%) 2-Annoyed (40%) 3-Sad (...


Re: Classifying 'tone' of voice during speech recognition

Started by Paresh Yadav in Speech Recognition8 years ago

Hi Jeff, Thanks for your response. I agree with you. I would love to be famous and might just do that ;-). This is not my domain at present...

Hi Jeff, Thanks for your response. I agree with you. I would love to be famous and might just do that ;-). This is not my domain at present and I have other things to worry about in addition to paying my bills with the current job. I like to work on crazy challenging problem and this seems just the kind of trouble I am usually looking for. Regards, Paresh On Mon, Apr 5, 2010 at 3:22 ...


Alphabet Recognition

Started by erik...@gmail.com in Speech Recognition8 years ago

(Sorry if this is double-post with my previous email, some issues). I am working on a small program for recognition the alphabet. The idea is...

(Sorry if this is double-post with my previous email, some issues). I am working on a small program for recognition the alphabet. The idea is to take in some sound sample, perform a FFT on the modulated curve, then compare the extreme points of magnitudes on the frequency plane, is this how its normally done? Since if i limit myself to the alphabet and the numbers, it should not be so many diff...


Need advices on speech recognition project

Started by nhan...@hotmail.com in Speech Recognition8 years ago 1 reply

Hello, I am an undergraduate student looking to do a senior project on speech recognition. The object of my project is to used speech recognition...

Hello, I am an undergraduate student looking to do a senior project on speech recognition. The object of my project is to used speech recognition system to control a device. My partner is working on designing this particular device and my part on this project is the speech recognition. So far on my research, I learned that speech recognition is related to dsp. It uses algorithms to recognized/anal...


removing silence/noise from speech signal

Started by unique0attitude in Speech Recognition8 years ago 1 reply

i m doing a project on speech recognition. 1) i've tried to determine the end-points of connected words and get the separate words. but the prob...

i m doing a project on speech recognition. 1) i've tried to determine the end-points of connected words and get the separate words. but the prob is that it is not removing complete silence portion. And in some cases it is considering even the noise portion as a seperate word. 2) What if we dont separate the words and take MFCCs of both words together as one matrix. will that be ok? Cn any1 he...


Speech recognition for a newbie

Started by ngolehung84 in Speech Recognition8 years ago 1 reply

Hi all, I'm very new with both DSP and Speech recognition. So could you please let me know: + What type of filters used to remove noise...

Hi all, I'm very new with both DSP and Speech recognition. So could you please let me know: + What type of filters used to remove noise since my input is taken in a noisy environment. + Some documents that discuss the frequency range of human voice? + Other types of filters? + Algorithms/techniques/models used to retrieve and match sample/pattern. These are all I think I need to wor...


Ask a Question to the DSPRelated community

To significantly increase your chances of receiving answers, please make sure to:

  1. Use a meaningful title
  2. Express your question clearly and well
  3. Do not use this forum to promote your product, service or business
  4. Write in clear, grammatical, correctly-spelled language
  5. Do not post content that violates a copyright