Jump to content

Algorithms for calculating variance

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Dick Beldin (talk | contribs) at 03:39, 7 March 2001. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

back to Variance

Algorithms for Calculating Variance

The variance of a population is defined as the root mean squared deviation from the mean. That mouthful says the same as the formula below.

  • Variance = {(x1-µ)2 + (x2-µ)2 + ... + (xn-µ)2} / n; where µ is the Arithmetic Mean of the data set.

There is another formula for calculating variance which you may see. It uses the sum of all the data and the sum of the squares. The formula is:

  • Variance = [n{x12 + x22 + ... + xn2} - {x1 + x2 + ... + xn}2] / n2

This formula was introduced when the prevailing calculators made it much easier to sum squares and the raw data than to sum the squared deviations. Because this formula can result in loss of precision, it should no longer be recommended except for small exercises.


The method of calculation may be more easily understood from the table below where the mean is 8.

ixixi-mean(xi-mean)2
(deviation)(squared deviation)
15-39
27-11
3800
41024
51024
n=5sum=40018
  • mean = 40/5 = 8
  • variance = 18/5 = 3.6
  • standard deviation = 1.897366596101 or 1.9

Note that the column of deviations sums to zero. This is always the case. Note also that we round the standard deviation to one more than the number of significant digits in the mean.