Hierarchical Dirichlet process
Appearance
![]() | This article needs attention from an expert in Statistics. Please add a reason or a talk parameter to this template to explain the issue with the article.(February 2012) |
This article needs additional citations for verification. (February 2012) |
In statistics, the hierarchical Dirichlet process is a nonparametric Bayesian approach to clustering grouped data. It uses a Dirichlet process for each group of data, with the Dirichlet processes for all groups sharing a base distribution which is itself drawn from a Dirichlet process. This method allows groups to share statistical strength via sharing of clusters across groups. The base distribution being drawn from a Dirichlet process is important, because draws from a Dirichlet process are atomic probability measures, and the atoms will appear in all group-level Dirichlet processes. Since each atom corresponds to a cluster, clusters are shared across all groups.[1]
References
- ^ Teh, Y. W.; Jordan, M. I.; Beal, M. J.; Blei, D. M. (2006). "Hierarchical Dirichlet Processes" (PDF). Journal of the American Statistical Association. 101: pp. 1566–1581.