Jump to content

Hierarchical Dirichlet process

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by 80.3.173.20 (talk) at 12:25, 18 August 2012 (Clarified definition of HDP). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

In statistics, the hierarchical Dirichlet process is a nonparametric Bayesian approach to clustering grouped data. It uses a Dirichlet process for each group of data, with the Dirichlet processes for all groups sharing a base distribution which is itself drawn from a Dirichlet process. This method allows groups to share statistical strength via sharing of clusters across groups. The base distribution being drawn from a Dirichlet process is important, because draws from a Dirichlet process are atomic probability measures, and the atoms will appear in all group-level Dirichlet processes. Since each atom corresponds to a cluster, clusters are shared across all groups.[1]


References

  1. ^ Teh, Y. W.; Jordan, M. I.; Beal, M. J.; Blei, D. M. (2006). "Hierarchical Dirichlet Processes" (PDF). Journal of the American Statistical Association. 101: pp. 1566–1581.