Pesquisa | BVS - MINISTÉRIO DA SAÚDE

PRODUCTION OF A PRELIMINARY QUALITY CONTROL PIPELINE FOR SINGLE NUCLEI RNA-SEQ AND ITS APPLICATION IN THE ANALYSIS OF CELL TYPE DIVERSITY OF POST-MORTEM HUMAN BRAIN NEOCORTEX.

Aevermann, Brian; McCorrison, Jamison; Venepally, Pratap; Hodge, Rebecca; Bakken, Trygve; Miller, Jeremy; Novotny, Mark; Tran, Danny N; Diezfuertes, Francisco; Christiansen, Lena; Zhang, Fan; Steemers, Frank; Lasken, Roger S; Lein, E D; Schork, Nicholas; Scheuermann, Richard H.

Pac Symp Biocomput ; 22: 564-575, 2017.

Artigo em Inglês | MEDLINE | ID: mdl-27897007

RESUMO

Next generation sequencing of the RNA content of single cells or single nuclei (sc/nRNA-seq) has become a powerful approach to understand the cellular complexity and diversity of multicellular organisms and environmental ecosystems. However, the fact that the procedure begins with a relatively small amount of starting material, thereby pushing the limits of the laboratory procedures required, dictates that careful approaches for sample quality control (QC) are essential to reduce the impact of technical noise and sample bias in downstream analysis applications. Here we present a preliminary framework for sample level quality control that is based on the collection of a series of quantitative laboratory and data metrics that are used as features for the construction of QC classification models using random forest machine learning approaches. We've applied this initial framework to a dataset comprised of 2272 single nuclei RNA-seq results and determined that ~79% of samples were of high quality. Removal of the poor quality samples from downstream analysis was found to improve the cell type clustering results. In addition, this approach identified quantitative features related to the proportion of unique or duplicate reads and the proportion of reads remaining after quality trimming as useful features for pass/fail classification. The construction and use of classification models for the identification of poor quality samples provides for an objective and scalable approach to sc/nRNA-seq quality control.

Assuntos

Sequenciamento de Nucleotídeos em Larga Escala/estatística & dados numéricos , Neocórtex/citologia , Neocórtex/metabolismo , RNA Nuclear/genética , Análise de Sequência de RNA/estatística & dados numéricos , Autopsia , Viés , Núcleo Celular/genética , Biologia Computacional , Bases de Dados de Ácidos Nucleicos , Árvores de Decisões , Sequenciamento de Nucleotídeos em Larga Escala/normas , Humanos , Aprendizado de Máquina , Controle de Qualidade , Análise de Sequência de RNA/normas , Análise de Célula Única , Software

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA