Audio-visual speech recognition
Appearance
From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Speechgrl (talk | contribs) at 13:56, 27 September 2007 (→External links). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.Revision as of 13:56, 27 September 2007 by Speechgrl (talk | contribs) (→External links)
Audio visual speech recognition (AVSR) is a technique that uses image processing capabilities in lip reading to aid speech recognition systems in recognizing undeterministic phones or giving preponderance among near probability decisions.
Each system lip reading and speech recognition works separately then their results are mixed at the stage of feature fusion.
External links
IBM Research - Audio Visual Speech Technologies
- SpeechTEK - speech technology education
- Speech Technology magazine - editorial content about the speech industry
Technology and related concepts | |||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| |||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||