Evaluation of two public genome references for chinese hamster ovary cells in the context of rna-seq based gene expression analysis.
Biotechnol Bioeng
; 114(7): 1603-1613, 2017 07.
Article
em En
| MEDLINE
| ID: mdl-28295162
RNA-Seq is a powerful transcriptomics tool for mammalian cell culture process development. Successful RNA-Seq data analysis requires a high quality reference for read mapping and gene expression quantification. Currently, there are two public genome references for Chinese hamster ovary (CHO) cells, the predominant mammalian cell line in the biopharmaceutical industry. In this study, we compared these two references by analyzing 60 RNA-Seq samples from a variety of CHO cell culture conditions. Among the 20,891 common genes in both references, we observed that 31.5% have more than 7.1% quantification differences, implying gene definition differences in the two references. We propose a framework to quantify this difference using two metrics, Consistency and Stringency, which account for the average quantification difference between the two references over all samples, and the sample-specific effect on the quantification result, respectively. These two metrics can be used to identify potential genes for future gene model improvement and to understand the reliability of differentially expressed genes identified by RNA-Seq data analysis. Before a more comprehensive genome reference for CHO cells emerges, the strategy proposed in this study can enable more robust transcriptome analysis from CHO cell RNA-Seq data. Biotechnol. Bioeng. 2017;114: 1603-1613. © 2017 Wiley Periodicals, Inc.
Palavras-chave
Texto completo:
1
Coleções:
01-internacional
Base de dados:
MEDLINE
Assunto principal:
Genoma Humano
/
Análise de Sequência de RNA
/
Perfilação da Expressão Gênica
/
Transcriptoma
Tipo de estudo:
Prognostic_studies
Limite:
Animals
/
Humans
Idioma:
En
Revista:
Biotechnol Bioeng
Ano de publicação:
2017
Tipo de documento:
Article
País de publicação:
Estados Unidos