Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments

Verfasser / Beitragende:
[Gordon K Smyth]
Ort, Verlag, Jahr:
2004
Enthalten in:
Statistical Applications in Genetics and Molecular Biology, 3/1(2004-02-12), 1-25
Format:
Artikel (online)
ID: 37892575X
LEADER caa a22 4500
001 37892575X
003 CHVBK
005 20180305123617.0
007 cr unu---uuuuu
008 161128e20040212xx s 000 0 eng
024 7 0 |a 10.2202/1544-6115.1027  |2 doi 
035 |a (NATIONALLICENCE)gruyter-10.2202/1544-6115.1027 
100 1 |a Smyth  |D Gordon K.  |u Walter and Eliza Hall Institute 
245 1 0 |a Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments  |h [Elektronische Daten]  |c [Gordon K Smyth] 
520 3 |a The problem of identifying differentially expressed genes in designed microarray experiments is considered. Lonnstedt and Speed (2002) derived an expression for the posterior odds of differential expression in a replicated two-color experiment using a simple hierarchical parametric model. The purpose of this paper is to develop the hierarchical model of Lonnstedt and Speed (2002) into a practical approach for general microarray experiments with arbitrary numbers of treatments and RNA samples. The model is reset in the context of general linear models with arbitrary coefficients and contrasts of interest. The approach applies equally well to both single channel and two color microarray experiments. Consistent, closed form estimators are derived for the hyperparameters in the model. The estimators proposed have robust behavior even for small numbers of arrays and allow for incomplete data arising from spot filtering or spot quality weights. The posterior odds statistic is reformulated in terms of a moderated t-statistic in which posterior residual standard deviations are used in place of ordinary standard deviations. The empirical Bayes approach is equivalent to shrinkage of the estimated sample variances towards a pooled estimate, resulting in far more stable inference when the number of arrays is small. The use of moderated t-statistics has the advantage over the posterior odds that the number of hyperparameters which need to estimated is reduced; in particular, knowledge of the non-null prior for the fold changes are not required. The moderated t-statistic is shown to follow a t-distribution with augmented degrees of freedom. The moderated t inferential approach extends to accommodate tests of composite null hypotheses through the use of moderated F-statistics. The performance of the methods is demonstrated in a simulation study. Results are presented for two publicly available data sets. 
540 |a ©2011 Walter de Gruyter GmbH & Co. KG, Berlin/Boston 
690 7 |a Microarrays  |2 nationallicence 
690 7 |a Statistical Theory and Methods  |2 nationallicence 
690 7 |a microarrays  |2 nationallicence 
690 7 |a empirical Bayes  |2 nationallicence 
690 7 |a linear models  |2 nationallicence 
690 7 |a hyperparameters  |2 nationallicence 
690 7 |a differential expression  |2 nationallicence 
773 0 |t Statistical Applications in Genetics and Molecular Biology  |d De Gruyter  |g 3/1(2004-02-12), 1-25  |q 3:1<1  |1 2004  |2 3  |o sagmb 
856 4 0 |u https://doi.org/10.2202/1544-6115.1027  |q text/html  |z Onlinezugriff via DOI 
908 |D 1  |a research article  |2 jats 
950 |B NATIONALLICENCE  |P 856  |E 40  |u https://doi.org/10.2202/1544-6115.1027  |q text/html  |z Onlinezugriff via DOI 
950 |B NATIONALLICENCE  |P 100  |E 1-  |a Smyth  |D Gordon K.  |u Walter and Eliza Hall Institute 
950 |B NATIONALLICENCE  |P 773  |E 0-  |t Statistical Applications in Genetics and Molecular Biology  |d De Gruyter  |g 3/1(2004-02-12), 1-25  |q 3:1<1  |1 2004  |2 3  |o sagmb 
900 7 |b CC0  |u http://creativecommons.org/publicdomain/zero/1.0  |2 nationallicence 
898 |a BK010053  |b XK010053  |c XK010000 
949 |B NATIONALLICENCE  |F NATIONALLICENCE  |b NL-gruyter