Your browser doesn't support javascript.
loading
Effects of subsampling on characteristics of RNA-seq data from triple-negative breast cancer patients.
Stupnikov, Alexey; Glazko, Galina V; Emmert-Streib, Frank.
Afiliación
  • Stupnikov A; Computational Biology and Machine Learning Laboratory, Faculty of Medicine, Health and Life Sciences, School of Medicine, Dentistry and Biomedical Sciences, Center for Cancer Research and Cell Biology, Queen's University Belfast, 97 Lisburn Road, Belfast, BT9 7JL, UK. astupnikov01@qub.ac.uk.
  • Glazko GV; Division of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, 72205, USA. gvglazko@uams.edu.
  • Emmert-Streib F; Computational Biology and Machine Learning Laboratory, Faculty of Medicine, Health and Life Sciences, School of Medicine, Dentistry and Biomedical Sciences, Center for Cancer Research and Cell Biology, Queen's University Belfast, 97 Lisburn Road, Belfast, BT9 7JL, UK. v@bio-complexity.com.
Chin J Cancer ; 34(10): 427-38, 2015 Aug 08.
Article en En | MEDLINE | ID: mdl-26253000
BACKGROUND: Data from RNA-seq experiments provide a wealth of information about the transcriptome of an organism. However, the analysis of such data is very demanding. In this study, we aimed to establish robust analysis procedures that can be used in clinical practice. METHODS: We studied RNA-seq data from triple-negative breast cancer patients. Specifically, we investigated the subsampling of RNA-seq data. RESULTS: The main results of our investigations are as follows: (1) the subsampling of RNA-seq data gave biologically realistic simulations of sequencing experiments with smaller sequencing depth but not direct scaling of count matrices; (2) the saturation of results required an average sequencing depth larger than 32 million reads and an individual sequencing depth larger than 46 million reads; and (3) for an abrogated feature selection, higher moments of the distribution of all expressed genes had a higher sensitivity for signal detection than the corresponding mean values. CONCLUSIONS: Our results reveal important characteristics of RNA-seq data that must be understood before one can apply such an approach to translational medicine.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: ARN / Perfilación de la Expresión Génica / Neoplasias de la Mama Triple Negativas Límite: Humans Idioma: En Revista: Chin J Cancer Asunto de la revista: NEOPLASIAS Año: 2015 Tipo del documento: Article

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: ARN / Perfilación de la Expresión Génica / Neoplasias de la Mama Triple Negativas Límite: Humans Idioma: En Revista: Chin J Cancer Asunto de la revista: NEOPLASIAS Año: 2015 Tipo del documento: Article