detecting the beat of a sound
hi guys i am making a character that can give the impression that it is speaking , so what i want to do is to use a microphone and when i speak in it the character starts to move his mouth to look like it is talking . i used morphing to make the face change , but my problem is that how i could detect the sound beat from microphone and know if someone is speaking or not to make a reaction for the lips of my character :( :( is there any tutorials ??
thanks for any help
Well, how realistic do you want it to look? The simplest solution would just to be to take a running average of the sound level of the last N samples and use that as the 'position' of the mouth (where 0 volume is mouth closed, full volume is mouth wide open). This will probably look fairly crappy, but it might be good enough for what you want.
The other extreme is detecting which phonetics are being spoken and playing back a mouth animation that matches that phonetic and transitions to the next phonetic (transitions could probably be done by blending the animations for neighbouring phenomes). This option is far from easy.....
The other extreme is detecting which phonetics are being spoken and playing back a mouth animation that matches that phonetic and transitions to the next phonetic (transitions could probably be done by blending the animations for neighbouring phenomes). This option is far from easy.....
"Voilà! In view, a humble vaudevillian veteran, cast vicariously as both victim and villain by the vicissitudes of Fate. This visage, no mere veneer of vanity, is a vestige of the vox populi, now vacant, vanished. However, this valorous visitation of a bygone vexation stands vivified, and has vowed to vanquish these venal and virulent vermin vanguarding vice and vouchsafing the violently vicious and voracious violation of volition. The only verdict is vengeance; a vendetta held as a votive, not in vain, for the value and veracity of such shall one day vindicate the vigilant and the virtuous. Verily, this vichyssoise of verbiage veers most verbose, so let me simply add that it's my very good honor to meet you and you may call me V.".....V
i don't want it to look very real just to sho a movement , but my question is how i could detect if someone is speaking or not ????? so if there is speaking the mouth will move randomly and if not it will stay closed .
thanks joanusdmentia
how i could detect if someone is speaking or not ?????
thanks joanusdmentia
You should look around at sound visualisation examples if you want to figure out if somebody's talking or not. Once you figure out how to display sounds as images, you'll even be able to pick out certain parts where the sounds are a little more specific, like they do in the Nullsoft Tiny Fullscreen visualisation plugin in Winamp (the scrolling voiceprint one to be exact). Even then it's still not going to be perfect, but it should still give you something better than a flapping mouth that doesn't follow the words.
Quote: Original post by Gorax
You should look around at sound visualisation examples if you want to figure out if somebody's talking or not. Once you figure out how to display sounds as images, you'll even be able to pick out certain parts where the sounds are a little more specific
this what i am searching for but i am not finding any things :( , i just want to know how to take the amplitude strength of the sound wave
Quote: Original post by joanusdmentia
Well, how realistic do you want it to look? The simplest solution would just to be to take a running average of the sound level of the last N samples and use that as the 'position' of the mouth (where 0 volume is mouth closed, full volume is mouth wide open). This will probably look fairly crappy, but it might be good enough for what you want.
The other extreme is detecting which phonetics are being spoken and playing back a mouth animation that matches that phonetic and transitions to the next phonetic (transitions could probably be done by blending the animations for neighbouring phenomes). This option is far from easy.....
I think that the question was answered correctly by joanusdmentia... and there's no need to to act like this... get samples from the microphone, then process it and display according to the processing this is the way to do it.
That processing can vary from checking the average volume of those samples to detecting the phonetics and transitions from the speech. This is audio processing, you'll get more info on google. For the phonetics and transitions in speech look for info on the speech recognition algorithms. Phonetics and transition would allow you to choose the right animation frames (or whatever used for animation) according to what was said.
JFF
This topic is closed to new replies.
Advertisement
Popular Topics
Advertisement
Recommended Tutorials
Advertisement