Jump to content

Additive smoothing

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Touko vk (talk | contribs) at 10:04, 25 April 2008 (Additive smoothing). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

In the field of statistical language modeling and statistics additive smoothing is a technique used to smooth distribution representing for example occurrences of a word in a text.

The additively smoothed distribution is defined as:

where is a number typically between 0 and 1, is the group of all sample groups (for example different words) and is the number of all samples


Additive smoothing is sometimes referred as Lidstone smoothing.

References

  • SF Chen, J Goodman (1996). An empirical study of smoothing techniques for language modeling. Proceedings of the 34th annual meeting on Association for Computational Linguistics table of contents