Jump to content

MA plot

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Talgalili (talk | contribs) at 15:36, 21 October 2011 (Examples: coloring the code). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

DNA microarrays consist of an arrayed series of thousands of microscopic spots of DNA which allow comparisons between two samples of RNA or DNA (target samples), which can provide data on relative gene expression levels (in the case of RNA) or gene copy number (for DNA). The data obtained from two-color DNA microarrays come in the form of fluorescent Red (Cy5) and Green (Cy3) dye intensities. One-color oligonucleotide arrays use only a single fluorescent dye.

Microarray data is often normalized within arrays to control for systematic biases in dye coupling and hybridization efficiencies, as well as other technical biases in the DNA probes and the print tip used to spot the array[1]. By minimizing these systematic variations, true biological differences can be found. To determine whether normalization is needed, one can plot Cy5 (G) intensities against Cy3 (R) intensities and see whether the slope of the line is around 1. An improved method, which is basically a scaled, 45 degree rotation of the R vs. G plot is an MA-plot[2]. The MA-plot is a plot of the distribution of the red/green intensity ratio ('M') plotted by the average intensity ('A'). M and A are defined by the following equations.

 

M is, therefore, the intensity ratio and A is the average intensity for a dot in the plot. MA plots are then used to visualize intensity-dependent ratio of raw microarray data. The MA plot uses M as the y-axis and A as the x-axis. The MA plot gives a quick overview of the distribution of the data. In many microarray gene expression experiments, the general assumption is that most of the genes would not see any change in their expression. Therefore the majority of the points on the y axis (M) would be located at 0, since Log(1) is 0. If this is not the case, then a normalization method such as LOESS should be applied to the data before statistical analysis.

The MA plot is the application of a Bland–Altman plot for visual representation of two channel DNA microarray gene expression data which has been transformed onto the M and A scale.

Packages

Several Bioconductor packages, for the R software, provide the facility for creating MA plots, these include affy (ma.plot, mva.pairs), limma (plotMA) and marray (maPlot)

Examples

library(affy)

if (require(affydata)) 
{
     data(Dilution)
}

y <- (exprs(Dilution)[, c("20B", "10A")])

x11()

ma.plot( rowMeans(log2(y)), log2(y[, 1])-log2(y[, 2]), cex=1 ) 

title("Dilutions Dataset (array 20B v 10A)")

library(preprocessCore)

#do a quantile normalization
x <- normalize.quantiles(y)

x11()

ma.plot( rowMeans(log2(x)), log2(x[, 1])-log2(x[, 2]), cex=1 ) 
title("Post Norm: Dilutions Dataset (array 20B v 10A)")


MA Plot for raw data, note of the distortion in the upper ranges.
Pre Normalization
MA Plot for quantile normalized data.
Post Normalization
MA Plots.

References

  1. ^ YH Yang, S Dudoit, P Luu, DM Lin, V Peng, J Ngai, TP Speed. (2002). Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Research vol. 30 (4) pp. e15.
  2. ^ Dudoit, S, Yang, YH, Callow, MJ, Speed, TP. (2002). Statistical methods for identifying differentially expressed genes in replicated cDNA microarray experiments. Stat. Sin. 12:1 111-139

See also