Your browser doesn't support javascript.
loading
Batch-normalization of cerebellar and medulloblastoma gene expression datasets utilizing empirically defined negative control genes.
Weishaupt, Holger; Johansson, Patrik; Sundström, Anders; Lubovac-Pilav, Zelmina; Olsson, Björn; Nelander, Sven; Swartling, Fredrik J.
Afiliação
  • Weishaupt H; Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Rudbeck Laboratory, Uppsala University, Uppsala, Sweden.
  • Johansson P; Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Rudbeck Laboratory, Uppsala University, Uppsala, Sweden.
  • Sundström A; Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Rudbeck Laboratory, Uppsala University, Uppsala, Sweden.
  • Lubovac-Pilav Z; Division for Biology and Bioinformatics, School of Bioscience, The Systems Biology Research Centre, University of Skövde, Skövde, Sweden.
  • Olsson B; Division for Biology and Bioinformatics, School of Bioscience, The Systems Biology Research Centre, University of Skövde, Skövde, Sweden.
  • Nelander S; Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Rudbeck Laboratory, Uppsala University, Uppsala, Sweden.
  • Swartling FJ; Department of Immunology, Genetics and Pathology, Science for Life Laboratory, Rudbeck Laboratory, Uppsala University, Uppsala, Sweden.
Bioinformatics ; 35(18): 3357-3364, 2019 09 15.
Article em En | MEDLINE | ID: mdl-30715209
ABSTRACT
MOTIVATION Medulloblastoma (MB) is a brain cancer predominantly arising in children. Roughly 70% of patients are cured today, but survivors often suffer from severe sequelae. MB has been extensively studied by molecular profiling, but often in small and scattered cohorts. To improve cure rates and reduce treatment side effects, accurate integration of such data to increase analytical power will be important, if not essential.

RESULTS:

We have integrated 23 transcription datasets, spanning 1350 MB and 291 normal brain samples. To remove batch effects, we combined the Removal of Unwanted Variation (RUV) method with a novel pipeline for determining empirical negative control genes and a panel of metrics to evaluate normalization performance. The documented approach enabled the removal of a majority of batch effects, producing a large-scale, integrative dataset of MB and cerebellar expression data. The proposed strategy will be broadly applicable for accurate integration of data and incorporation of normal reference samples for studies of various diseases. We hope that the integrated dataset will improve current research in the field of MB by allowing more large-scale gene expression analyses. AVAILABILITY AND IMPLEMENTATION The RUV-normalized expression data is available through the Gene Expression Omnibus (GEO; https//www.ncbi.nlm.nih.gov/geo/) and can be accessed via the GSE series number GSE124814. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Neoplasias Cerebelares / Meduloblastoma Limite: Humans Idioma: En Ano de publicação: 2019 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Neoplasias Cerebelares / Meduloblastoma Limite: Humans Idioma: En Ano de publicação: 2019 Tipo de documento: Article