Your browser doesn't support javascript.
loading
Alternative empirical Bayes models for adjusting for batch effects in genomic studies.
Zhang, Yuqing; Jenkins, David F; Manimaran, Solaiappan; Johnson, W Evan.
Afiliação
  • Zhang Y; Division of Computational Biomedicine, Boston University School of Medicine, 72 East Concord Street, Boston, 02118, MA, USA.
  • Jenkins DF; Graduate Program in Bioinformatics, Boston University, 24 Cummington Mall, Boston, 02215, MA, USA.
  • Manimaran S; Division of Computational Biomedicine, Boston University School of Medicine, 72 East Concord Street, Boston, 02118, MA, USA.
  • Johnson WE; Graduate Program in Bioinformatics, Boston University, 24 Cummington Mall, Boston, 02215, MA, USA.
BMC Bioinformatics ; 19(1): 262, 2018 07 13.
Article em En | MEDLINE | ID: mdl-30001694
ABSTRACT

BACKGROUND:

Combining genomic data sets from multiple studies is advantageous to increase statistical power in studies where logistical considerations restrict sample size or require the sequential generation of data. However, significant technical heterogeneity is commonly observed across multiple batches of data that are generated from different processing or reagent batches, experimenters, protocols, or profiling platforms. These so-called batch effects often confound true biological relationships in the data, reducing the power benefits of combining multiple batches, and may even lead to spurious results in some combined studies. Therefore there is significant need for effective methods and software tools that account for batch effects in high-throughput genomic studies.

RESULTS:

Here we contribute multiple methods and software tools for improved combination and analysis of data from multiple batches. In particular, we provide batch effect solutions for cases where the severity of the batch effects is not extreme, and for cases where one high-quality batch can serve as a reference, such as the training set in a biomarker study. We illustrate our approaches and software in both simulated and real data scenarios.

CONCLUSIONS:

We demonstrate the value of these new contributions compared to currently established approaches in the specified batch correction situations.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Genômica Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2018 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Genômica Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2018 Tipo de documento: Article