1. Home
  2. MAEBAYES procedure

MAEBAYES procedure

Modifies t-values by an empirical Bayes method (D.B. Baird).

Options

PRINT = string tokens What to print (estimates); default esti
PLOT = string tokens What to plot (phistograms, thistograms, pvalues, tvalues); default * i.e. nothing
DATATYPE = string token Type of data specified by the DATA parameter when it is a variate (means, tvalues); default tval
METHOD = string token Type of test to use to form probability values (twosided, greaterthan, lessthan); default twos
DEVICE = scalar Device number on which to plot the graphs
GRAPHICSFILE = text What graphics filename template to use to save the graphs; default *

Parameters

DATA = pointers or variates Pointers of variates or variates of means or t-values to be summarized
SD = variates Supplies standard deviations of the data when DATA is a variate of means or t-values
DF = variates or scalars Supplies degrees of freedom when DATA is a variate of means or t-values
SD0 = scalars Saves the estimated prior standard deviation
DF0 = scalars Saves the estimated number of degrees of freedom assigned to the prior standard deviation
TMODIFIED = variates Saves the modified t-values
SDMODIFIED = variates Saves the shrunken SD values
PMODIFIED = variates Saves the modified probability values

Description

In a microarray experiment, as hundreds and often thousands of probes are being processed in parallel, there is a loss of power if you consider the variation of each probe in isolation. If this parallelism is used between the genes to gain extra information on the variation of an individual probe, then more powerful tests of the level of differential expression of a probe can be obtained. To do this, a prior distribution of the standard deviations (or equivalently the variances) over the probes is assumed. In particular, it is assumed that the reciprocal of the variance, sp2, of each probe is distributed as a multiple of a chi-square distribution with d0 degrees of

freedom, i.e. 1/sp2 is distributed as 1/(d0 × s02) × Chisquare(d0).

If the parameters of this distribution, the prior degrees of freedom d0 and standard deviation s0 are estimated, more information can be gained on an individual probe, by shrinking it towards the prior by an amount that depends on the amount of information in the standard deviation sp of the probe (in this case its degrees of freedom dp). The modified standard deviation s~p is then given by:

s~p = √((d02 × s02 + dp2 × sp2) / (d0 + dp))

A modified t-test can then be performed using the modified standard deviation with d0 + dp degrees of freedom. The method can also produce the probability values for tests that the differential expression differs from zero. The METHOD option selects the type of test i.e. two-sided, or for values greater than or less than zero (the default is two-sided).

The DATA parameter can supply a pointer containing one variate per slide, with the probes in the same position within each variate. The means and standard deviations are then be calculated from the raw data. Alternatively, DATA can supply a variate containing means or t-values for each probe. The DATATYPE option should then indicate which of these has been given, the SD parameter should supply a variate containing the standard deviations for each probe, and the DF parameter should supply a variate with the numbers of degrees of freedom.

The estimated prior number of degrees of freedom d0 and standard deviation s0 can be saved, in scalars, by the D0 and S0 parameters. The TMODIFIED parameter can supply a variate to save the modified t-values, the SDMODIFIED parameter can save the shrunken SD values, and the PMODIFIED parameter can save the modified probability values.

By default, the estimates are printed, but this can be suppressed by setting option PRINT=*. The PLOT option controls what plots are produced, with settings:

    phistograms two histograms showing the modified and raw probabilities plotted on the same scale;
    thistograms two histograms showing the modified and raw t-values plotted on the same scale;
    pvalues a scatter plot of modified versus raw probabilities; and
    tvalues a scatter plot of modified versus t-values.

By default, nothing is plotted. You can use the DEVICE option to plot to a device other than the screen. The GRAPHICSFILE specifies then supplies a template for the file names.

Options: PRINT, PLOT, DATATYPE, METHOD, DEVICE, GRAPHICSFILE.

Parameters: DATA, SD, DF, SD0, DF0, TMODIFIED, SDMODIFIED, PMODIFIED.

Reference

Smyth, G.K. (2004). Linear models and empirical Bayes methods for assessing differential expression in microarray experiments. Statistical Applications in Genetics and Molecular Biology, 3, No. 1, Article 3.

See also

Procedures: AFFYMETRIX, FDRBONFERRONI, FDRMIXTURE, MAANOVA, MABGCORRECT, MAREGRESSION, MARMA, MAROBUSTMEANS, MAVDIFFERENCE, MAVOLCANO, QNORMALIZE.

Commands for: Microarray data.

Example

CAPTION    'MAEBAYES example'; STYLE=meta
ENQUIRE    CHANNEL=-1; EXIST=check; NAME=\
           '%GENDIR%/Data/Microarrays/ApoAIKnockOutEffects.GSH'
IF check
  SPLOAD   '%GENDIR%/Data/Microarrays/ApoAIKnockOutEffects.GSH'
  MAEBAYES [PRINT=estimates; DATATYPE=means; PLOT=phistogram,tvalues;\ 
           METHOD=twosided] DATA=Cont[1]; SD=Res_SD; DF=DF;\ 
           TMODIFIED=ModTCont; SDMODIFIED=Mod_SD; PMODIFIED=Mod_PrCont
ELSE
  CAPTION  'Microarray example datasets have not been installed.'
ENDIF
Updated on March 7, 2019

Was this article helpful?