Jump to content

Audio-visual speech recognition

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by SporkBot (talk | contribs) at 17:39, 23 November 2017 (Remove template per TFD outcome). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

Audio visual speech recognition (AVSR) is a technique that uses image processing capabilities in lip reading to aid speech recognition systems in recognizing undeterministic phones or giving preponderance among near probability decisions.

Each system of lip reading and speech recognition works separately, then their results are mixed at the stage of feature fusion.

External links

IBM Research - Audio Visual Speech Technologies

This computational linguistics-related article is a stub. You can help Wikipedia by expanding it.

Retrieved from "https://en.wikipedia.org/w/index.php?title=Audio-visual_speech_recognition&oldid=811745748"

Hidden category:

All stub articles