Discovering gene expression patterns in time course microarray experiments by ANOVA-SCA

TitleDiscovering gene expression patterns in time course microarray experiments by ANOVA-SCA
Publication TypeJournal Article
Year of Publication2007
AuthorsNueda, MJ, Conesa, A, Westerhuis, JA, Hoefsloot, HC, Smilde, AK, Talon, M, Ferrer, A
KeywordsAlgorithms *Analysis of Variance Computational Biology/*methods Computer Simulation Data Interpretation; Genetic; Genetic Models; Statistical Gene Expression Profiling/*methods Models; Statistical Oligonucleotide Array Sequence Analysis/*methods Principal Component Analysis Time Factors Transcription

MOTIVATION: Designed microarray experiments are used to investigate the effects that controlled experimental factors have on gene expression and learn about the transcriptional responses associated with external variables. In these datasets, signals of interest coexist with varying sources of unwanted noise in a framework of (co)relation among the measured variables and with the different levels of the studied factors. Discovering experimentally relevant transcriptional changes require methodologies that take all these elements into account. RESULTS: In this work, we develop the application of the Analysis of variance-simultaneous component analysis (ANOVA-SCA) Smilde et al. Bioinformatics, (2005) to the analysis of multiple series time course microarray data as an example of multifactorial gene expression profiling experiments. We denoted this implementation as ASCA-genes. We show how the combination of ANOVA-modeling and a dimension reduction technique is effective in extracting targeted signals from data by-passing structural noise. The methodology is valuable for identifying main and secondary responses associated with the experimental factors and spotting relevant experimental conditions. We additionally propose a novel approach for gene selection in the context of the relation of individual transcriptional patterns to global gene expression signals. We demonstrate the methodology on both real and synthetic datasets. AVAILABILITY: ASCA-genes has been implemented in the statistical language R and is available at SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Nueda, Maria Jose Conesa, Ana Westerhuis, Johan A Hoefsloot, Huub C J Smilde, Age K Talon, Manuel Ferrer, Alberto Research Support, Non-U.S. Gov’t England Bioinformatics (Oxford, England) Bioinformatics. 2007 Jul 15;23(14):1792-800. Epub 2007 May 22.