Jump to content

Exploratory data analysis

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Cutler (talk | contribs) at 12:08, 31 December 2003. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

Exploratory data anaysis. or EDA, is the name given by John Tukey to that part of statistical practice concerned with reviewing, communicating and using data where there is a low level of knowledge about its cause system.

Tukey held that too much emphasis in statistics was placed on evaluating and testing given hypotheses (confirmatory data analysis) and that the balance was in need of redressing in favour of using data to suggest hypotheses to test. In particular, confusion of the two types of analysis and employing them on the same set of data can lead to bias owing to the effect of testing effects suggested by the data.

The objectives of EDA are to:

The principle graphical tools used in EDA are:

The principle quantitative tools are:

Bibliography

  • Hoaglin, D C; Mosteller, F & Tukey, J W (Eds) (1985) Exploring Data Tables, Trends and Shapes ISBN 0471097764
  • Hoaglin, D C; Mosteller, F & Tukey, J W (Eds) (1983) Understanding Robust and Exploratory Data Analysis ISBN 0471097772
  • Tukey, J W (1977) Exploratory Data Analysis ISBN 0201076160
  • Velleman, P F & Hoaglin, D C (1981) Applications, Basics and Computing of Exploratory Data Analysis ISBN 087150409X