A statistical framework for integrating two microarray data sets in differential expression analysis

Yinglei Lai, Sarah E. Eckenrode, Jin Xiong She

Research output: Contribution to journalArticlepeer-review

9 Scopus citations

Abstract

Background: Different microarray data sets can be collected for studying the same or similar diseases. We expect to achieve a more efficient analysis of differential expression if an efficient statistical method can be developed for integrating different microarray data sets. Although many statistical methods have been proposed for data integration, the genome-wide concordance of different data sets has not been well considered in the analysis. Results: Before considering data integration, it is necessary to evaluate the genome-wide concordance so that misleading results can be avoided. Based on the test results, different subsequent actions are suggested. The evaluation of genome-wide concordance and the data integration can be achieved based on the normal distribution based mixture models. Conclusion: The results from our simulation study suggest that misleading results can be generated if the genome-wide concordance issue is not appropriately considered. Our method provides a rigorous parametric solution. The results also show that our method is robust to certain model misspecification and is practically useful for the integrative analysis of differential expression.

Original languageEnglish (US)
Article numberS23
JournalBMC Bioinformatics
Volume10
Issue numberSUPPL. 1
DOIs
StatePublished - Jan 30 2009

ASJC Scopus subject areas

  • Structural Biology
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'A statistical framework for integrating two microarray data sets in differential expression analysis'. Together they form a unique fingerprint.

Cite this