Jump to content

Dot plot (statistics)

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by 2601:1c0:5901:2127:bd2b:acda:c439:2f6f (talk) at 05:33, 2 November 2018 (Added content). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

This is stupid why do you even check Wikipedia

Dot plots

A dot plot of 50 random values from 0 to 9.

The dot plot as a representation of a distribution consists of group of data points plotted on a simple scale. Dot plots are used for continuous, quantitative, univariate data. Data points may be labelled if there are few of them.

Dot plots are one of the simplest statistical plots, and are suitable for small to moderate sized data sets. They are useful for highlighting clusters and gaps, as well as outliers. Their other advantage is the conservation of numerical information. When dealing with larger data sets (around 20–30 or more data points) the related stemplot, box plot or histogram may be more efficient, as dot plots may become too cluttered after this point. Dot plots may be distinguished from histograms in that dots are not spaced uniformly along the horizontal axis.

Although the plot appears to be simple, its computation and the statistical theory underlying it are not simple. The algorithm for computing a dot plot is closely related to kernel density estimation. The size chosen for the dots affects the appearance of the plot. Choice of dot size is equivalent to choosing the bandwidth for a kernel density estimate.

In the R programming language this type of plot is also referred to as a stripchart[1] or stripplot.[2]

Cleveland dot plots

Dot plot may also refer to plots of points that each belong to one of several categories. They are an alternative to bar charts or pie charts, and look somewhat like a horizontal bar chart where the bars are replaced by a dots at the values associated with each category. Compared to (vertical) bar charts and pie charts, Cleveland argues that dot plots allow more accurate interpretation of the graph by readers by making the labels easier to read, reducing non-data ink (or graph clutter) and supporting table look-up.

References

  1. ^ Peter Dalgaard. Introductory Statistics with R. Springer. ISBN 0-387-95475-9.
  2. ^ Paul Murrell (2005). R Graphics. Chapman & Hall/CRC. ISBN 1-58488-486-X.

Other references

  • Wild, C. and Seber, G. (2000) Chance Encounters: A First Course in Data Analysis and Inference John Wiley and Sons. ISBN 0-471-32936-3