Separate-channel analysis of two-channel microarrays: recovering inter-spot information
Open Access
- 26 May 2013
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 14 (1) , 1-15
- https://doi.org/10.1186/1471-2105-14-165
Abstract
Two-channel (or two-color) microarrays are cost-effective platforms for comparative analysis of gene expression. They are traditionally analysed in terms of the log-ratios (M-values) of the two channel intensities at each spot, but this analysis does not use all the information available in the separate channel observations. Mixed models have been proposed to analyse intensities from the two channels as separate observations, but such models can be complex to use and the gain in efficiency over the log-ratio analysis is difficult to quantify. Mixed models yield test statistics for the null distributions can be specified only approximately, and some approaches do not borrow strength between genes. This article reformulates the mixed model to clarify the relationship with the traditional log-ratio analysis, to facilitate information borrowing between genes, and to obtain an exact distributional theory for the resulting test statistics. The mixed model is transformed to operate on the M-values and A-values (average log-expression for each spot) instead of on the log-expression values. The log-ratio analysis is shown to ignore information contained in the A-values. The relative efficiency of the log-ratio analysis is shown to depend on the size of the intraspot correlation. A new separate channel analysis method is proposed that assumes a constant intra-spot correlation coefficient across all genes. This approach permits the mixed model to be transformed into an ordinary linear model, allowing the data analysis to use a well-understood empirical Bayes analysis pipeline for linear modeling of microarray data. This yields statistically powerful test statistics that have an exact distributional theory. The log-ratio, mixed model and common correlation methods are compared using three case studies. The results show that separate channel analyses that borrow strength between genes are more powerful than log-ratio analyses. The common correlation analysis is the most powerful of all. The common correlation method proposed in this article for separate-channel analysis of two-channel microarray data is no more difficult to apply in practice than the traditional log-ratio analysis. It provides an intuitive and powerful means to conduct analyses and make comparisons that might otherwise not be possible.Keywords
This publication has 37 references indexed in Scilit:
- Rootstock-regulated gene expression patterns associated with fire blight resistance in appleBMC Genomics, 2012
- Comparative transcriptomics among floral organs of the basal eudicot Eschscholzia californica as reference for floral evolutionary developmental studiesGenome Biology, 2010
- Microarray analysis reveals transcriptional plasticity in the reef building coral Acropora milleporaMolecular Ecology, 2009
- Comparison of small n statistical tests of differential expression applied to microarraysBMC Bioinformatics, 2009
- Microarray background correction: maximum likelihood estimation for the normal-exponential convolutionBiostatistics, 2008
- Significance testing for small microarray experimentsStatistics in Medicine, 2005
- Improved statistical tests for differential gene expression by shrinking variance components estimatesBiostatistics, 2004
- Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray ExperimentsStatistical Applications in Genetics and Molecular Biology, 2004
- Analysis of Variance for Gene Expression Microarray DataJournal of Computational Biology, 2000
- Data Analysis Using Stein's Estimator and its GeneralizationsJournal of the American Statistical Association, 1975