Jump to content

User:Icax0r/Maximum likelihood linear regression

From Wikipedia, the free encyclopedia
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

Maximum Likelihood Linear Regression (also known as MLLR) is a commonly-used technique in Speech recognition for speaker adaptation. Given a small amount of speech from a target speaker, the parameters of a speaker-independent model can be adjusted to maximize the likelihood of the target speaker's speech under the model.

Description

Parameter Tying

If not all of the distributions are observed in the training data, then the parameters may be tied - i.e. a single transformation matrix is applied to several distributions. Distributions can be clustered according to phonetic similarity, or automatic clustering methods may be used.

-- TODO -- math

Relation to Other Types of Adaptation

CMLLR

-- TODO -- doesn't rely on having a transcript.  and MLLR does??  use a GMM to estimate the tr

MAP Adaptation

-- TODO you need more data. --

References