Jump to content

Dendrogram

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Manudouz (talk | contribs) at 13:45, 22 October 2018 (Clarifying the leading sentence.). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
Hierarchical clustering dendrogram of the Iris dataset (using R). Source

A dendrogram is a diagram representing a tree. This diagrammatic representation is frequently used in different contexts:

The name dendrogram derives from the two ancient greek words δένδρον (déndron), meaning "tree", and γράμμα (grámma), meaning "drawing, mathematical figure".[2][3]

Clustering example

For a clustering example, suppose this data is to be clustered using Euclidean distance as the distance metric.

Raw data

The hierarchical clustering dendrogram would be as such:

Traditional representation

The top row of nodes represents data (individual observations), and the remaining nodes represent the clusters to which the data belong, with the arrows representing the distance (dissimilarity).

The distance between merged clusters is monotone increasing with the level of the merger: the height of each node in the plot is proportional to the value of the intergroup dissimilarity between its two daughters (the top nodes representing individual observations are all plotted at zero height).

See also

References

  1. ^ Everitt, Brian (1998). Dictionary of Statistics. Cambridge, UK: Cambridge University Press. p. 96. ISBN 0-521-59346-8.
  2. ^ Bailly, Anatole (1981-01-01). Abrégé du dictionnaire grec français. Paris: Hachette. ISBN 2010035283. OCLC 461974285.
  3. ^ Bailly, Anatole. "Greek-french dictionary online". www.tabularium.be. Retrieved October 20, 2018.