Jump to content

Data processing inequality

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by 195.220.100.11 (talk) at 16:10, 27 September 2019. The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

The Data processing inequality is an information theoretic concept which states that the information content of a signal cannot be increased via a local physical operation. This can be expressed concisely as 'post-processing cannot increase information'.[1] As explained by Kinney and Atwal, the DPI means that information is generally lost (never gained) when transmitted through a noisy channel.[2]

Definition

Let three random variables form the Markov chain , implying that the conditional distribution of depends only on and is conditionally independent of . Specifically, we have such a Markov chain if the joint probability mass function can be written as

In this setting, no processing of Y , deterministic or random, can increase the information that Y contains about X. Using the mutual information, this can be written as :

With the equality if and only if .[3]

See also

References

  1. ^ Beaudry, Normand (2012), "An intuitive proof of the data processing inequality", Quantum Information & Computation, 12 (5–6): 432–441, arXiv:1107.0740, Bibcode:2011arXiv1107.0740B
  2. ^ Kinney; Atwal (Mar 2014). "Equitability, mutual information, and the maximal information coefficient". Proc Natl Acad Sci U S A. 111 (9): 3354–9. arXiv:1301.7745. Bibcode:2014PNAS..111.3354K. doi:10.1073/pnas.1309933111. PMC 3948249. PMID 24550517.
  3. ^ Cover; Thomas (2012). Elements of information theory. John Wiley & Sons.