It's not quite so simple. You have to use fourier transforms to produce a spectrogram of the sound, identify consonant and vowel patterns (this is beyond me, but it involves identifying formant frequencies), identify pauses, calibrate to the user's voice, infer implied words (uming a certain language is being spoken)... It's very heavy stuff.
No comments:
Post a Comment