Pesquisa | BVS Aleitamento Materno

1.

Live-seq enables temporal transcriptomic recording of single cells.

Chen, Wanze; Guillaume-Gentil, Orane; Rainer, Pernille Yde; Gäbelein, Christoph G; Saelens, Wouter; Gardeux, Vincent; Klaeger, Amanda; Dainese, Riccardo; Zachara, Magda; Zambelli, Tomaso; Vorholt, Julia A; Deplancke, Bart.

Nature ; 608(7924): 733-740, 2022 08.

Artigo em Inglês | MEDLINE | ID: mdl-35978187

RESUMO

Single-cell transcriptomics (scRNA-seq) has greatly advanced our ability to characterize cellular heterogeneity1. However, scRNA-seq requires lysing cells, which impedes further molecular or functional analyses on the same cells. Here, we established Live-seq, a single-cell transcriptome profiling approach that preserves cell viability during RNA extraction using fluidic force microscopy2,3, thus allowing to couple a cell's ground-state transcriptome to its downstream molecular or phenotypic behaviour. To benchmark Live-seq, we used cell growth, functional responses and whole-cell transcriptome read-outs to demonstrate that Live-seq can accurately stratify diverse cell types and states without inducing major cellular perturbations. As a proof of concept, we show that Live-seq can be used to directly map a cell's trajectory by sequentially profiling the transcriptomes of individual macrophages before and after lipopolysaccharide (LPS) stimulation, and of adipose stromal cells pre- and post-differentiation. In addition, we demonstrate that Live-seq can function as a transcriptomic recorder by preregistering the transcriptomes of individual macrophages that were subsequently monitored by time-lapse imaging after LPS exposure. This enabled the unsupervised, genome-wide ranking of genes on the basis of their ability to affect macrophage LPS response heterogeneity, revealing basal Nfkbia expression level and cell cycle state as important phenotypic determinants, which we experimentally validated. Thus, Live-seq can address a broad range of biological questions by transforming scRNA-seq from an end-point to a temporal analysis approach.

Assuntos

Sobrevivência Celular , Perfilação da Expressão Gênica , Macrófagos , RNA-Seq , Análise de Célula Única , Transcriptoma , Tecido Adiposo/citologia , Ciclo Celular/efeitos dos fármacos , Ciclo Celular/genética , Diferenciação Celular , Perfilação da Expressão Gênica/métodos , Perfilação da Expressão Gênica/normas , Genoma/efeitos dos fármacos , Genoma/genética , Lipopolissacarídeos/imunologia , Lipopolissacarídeos/farmacologia , Macrófagos/citologia , Macrófagos/efeitos dos fármacos , Macrófagos/imunologia , Macrófagos/metabolismo , Inibidor de NF-kappaB alfa/genética , Especificidade de Órgãos , Fenótipo , RNA/genética , RNA/isolamento & purificação , RNA-Seq/métodos , RNA-Seq/normas , Reprodutibilidade dos Testes , Análise de Sequência de RNA/métodos , Análise de Sequência de RNA/normas , Análise de Célula Única/métodos , Células Estromais/citologia , Células Estromais/metabolismo , Fatores de Tempo , Transcriptoma/genética

2.

scMRMA: single cell multiresolution marker-based annotation.

Li, Jia; Sheng, Quanhu; Shyr, Yu; Liu, Qi.

Nucleic Acids Res ; 50(2): e7, 2022 01 25.

Artigo em Inglês | MEDLINE | ID: mdl-34648021

RESUMO

Single-cell RNA sequencing has become a powerful tool for identifying and characterizing cellular heterogeneity. One essential step to understanding cellular heterogeneity is determining cell identities. The widely used strategy predicts identities by projecting cells or cell clusters unidirectionally against a reference to find the best match. Here, we develop a bidirectional method, scMRMA, where a hierarchical reference guides iterative clustering and deep annotation with enhanced resolutions. Taking full advantage of the reference, scMRMA greatly improves the annotation accuracy. scMRMA achieved better performance than existing methods in four benchmark datasets and successfully revealed the expansion of CD8 T cell populations in squamous cell carcinoma after anti-PD-1 treatment.

Assuntos

Biomarcadores , Biologia Computacional/métodos , Perfilação da Expressão Gênica/métodos , Análise de Sequência de RNA/métodos , Análise de Célula Única , Software , Algoritmos , Análise por Conglomerados , Biologia Computacional/normas , Bases de Dados Genéticas , Perfilação da Expressão Gênica/normas , Humanos , Anotação de Sequência Molecular , Reprodutibilidade dos Testes , Análise de Sequência de RNA/normas , Análise de Célula Única/métodos

3.

Quo vadis microRNAs?

Fromm, Bastian; Keller, Andreas; Yang, Xiaozeng; Friedlander, Marc R; Peterson, Kevin J; Griffiths-Jones, Sam.

Trends Genet ; 36(7): 461-463, 2020 07.

Artigo em Inglês | MEDLINE | ID: mdl-32544447

RESUMO

Since 2002, published miRNAs have been collected and named by the online repository miRBase. However, with 11 000 annual publications this has become challenging. Recently, four specialized miRNA databases were published, addressing particular needs for diverse scientific communities. This development provides major opportunities for the future of miRNA annotation and nomenclature.

Assuntos

Bases de Dados de Ácidos Nucleicos , Regulação da Expressão Gênica , MicroRNAs/genética , Anotação de Sequência Molecular/normas , Análise de Sequência de RNA/normas , Software , Genômica , Humanos

4.

TERA-Seq: true end-to-end sequencing of native RNA molecules for transcriptome characterization.

Ibrahim, Fadia; Oppelt, Jan; Maragkakis, Manolis; Mourelatos, Zissimos.

Nucleic Acids Res ; 49(20): e115, 2021 11 18.

Artigo em Inglês | MEDLINE | ID: mdl-34428294

RESUMO

Direct sequencing of single, native RNA molecules through nanopores has a strong potential to transform research in all aspects of RNA biology and clinical diagnostics. The existing platform from Oxford Nanopore Technologies is unable to sequence the very 5' ends of RNAs and is limited to polyadenylated molecules. Here, we develop True End-to-end RNA Sequencing (TERA-Seq), a platform that addresses these limitations, permitting more thorough transcriptome characterization. TERA-Seq describes both poly- and non-polyadenylated RNA molecules and accurately identifies their native 5' and 3' ends by ligating uniquely designed adapters that are sequenced along with the transcript. We find that capped, full-length mRNAs in human cells show marked variation of poly(A) tail lengths at the single molecule level. We report prevalent capping downstream of canonical transcriptional start sites in otherwise fully spliced and polyadenylated molecules. We reveal RNA processing and decay at single molecule level and find that mRNAs decay cotranslationally, often from their 5' ends, while frequently retaining poly(A) tails. TERA-Seq will prove useful in many applications where true end-to-end direct sequencing of single, native RNA molecules and their isoforms is desirable.

Assuntos

RNA Mensageiro/genética , Análise de Sequência de RNA/métodos , Transcriptoma , Células HeLa , Humanos , Poliadenilação , Splicing de RNA , RNA Mensageiro/química , RNA Mensageiro/metabolismo , Análise de Sequência de RNA/normas

5.

Data denoising with transfer learning in single-cell transcriptomics.

Wang, Jingshu; Agarwal, Divyansh; Huang, Mo; Hu, Gang; Zhou, Zilu; Ye, Chengzhong; Zhang, Nancy R.

Nat Methods ; 16(9): 875-878, 2019 09.

Artigo em Inglês | MEDLINE | ID: mdl-31471617

RESUMO

Single-cell RNA sequencing (scRNA-seq) data are noisy and sparse. Here, we show that transfer learning across datasets remarkably improves data quality. By coupling a deep autoencoder with a Bayesian model, SAVER-X extracts transferable gene-gene relationships across data from different labs, varying conditions and divergent species, to denoise new target datasets.

Assuntos

Neoplasias da Mama/metabolismo , Biologia Computacional/métodos , Leucócitos Mononucleares/metabolismo , Análise de Sequência de RNA/normas , Análise de Célula Única/métodos , Linfócitos T/metabolismo , Transcriptoma , Animais , Teorema de Bayes , Feminino , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Humanos , Camundongos , Análise de Sequência de RNA/métodos

6.

Recurrent functional misinterpretation of RNA-seq data caused by sample-specific gene length bias.

Mandelboum, Shir; Manber, Zohar; Elroy-Stein, Orna; Elkon, Ran.

PLoS Biol ; 17(11): e3000481, 2019 11.

Artigo em Inglês | MEDLINE | ID: mdl-31714939

RESUMO

Data normalization is a critical step in RNA sequencing (RNA-seq) analysis, aiming to remove systematic effects from the data to ensure that technical biases have minimal impact on the results. Analyzing numerous RNA-seq datasets, we detected a prevalent sample-specific length effect that leads to a strong association between gene length and fold-change estimates between samples. This stochastic sample-specific effect is not corrected by common normalization methods, including reads per kilobase of transcript length per million reads (RPKM), Trimmed Mean of M values (TMM), relative log expression (RLE), and quantile and upper-quartile normalization. Importantly, we demonstrate that this bias causes recurrent false positive calls by gene-set enrichment analysis (GSEA) methods, thereby leading to frequent functional misinterpretation of the data. Gene sets characterized by markedly short genes (e.g., ribosomal protein genes) or long genes (e.g., extracellular matrix genes) are particularly prone to such false calls. This sample-specific length bias is effectively removed by the conditional quantile normalization (cqn) and EDASeq methods, which allow the integration of gene length as a sample-specific covariate. Consequently, using these normalization methods led to substantial reduction in GSEA false results while retaining true ones. In addition, we found that application of gene-set tests that take into account gene-gene correlations attenuates false positive rates caused by the length bias, but statistical power is reduced as well. Our results advocate the inspection and correction of sample-specific length biases as default steps in RNA-seq analysis pipelines and reiterate the need to account for intergene correlations when performing gene-set enrichment tests to lessen false interpretation of transcriptomic data.

Assuntos

RNA/química , Análise de Sequência de RNA/normas , Animais , Viés , Bases de Dados Genéticas , Conjuntos de Dados como Assunto , Humanos , Camundongos , Transcriptoma

7.

mirnaQC: a webserver for comparative quality control of miRNA-seq data.

Aparicio-Puerta, Ernesto; Gómez-Martín, Cristina; Giannoukakos, Stavros; Medina, José María; Marchal, Juan Antonio; Hackenberg, Michael.

Nucleic Acids Res ; 48(W1): W262-W267, 2020 07 02.

Artigo em Inglês | MEDLINE | ID: mdl-32484556

RESUMO

Although miRNA-seq is extensively used in many different fields, its quality control is frequently restricted to a PhredScore-based filter. Other important quality related aspects like microRNA yield, the fraction of putative degradation products (such as rRNA fragments) or the percentage of adapter-dimers are hard to assess using absolute thresholds. Here we present mirnaQC, a webserver that relies on 34 quality parameters to assist in miRNA-seq quality control. To improve their interpretability, quality attributes are ranked using a reference distribution obtained from over 36 000 publicly available miRNA-seq datasets. Accepted input formats include FASTQ and SRA accessions. The results page contains several sections that deal with putative technical artefacts related to library preparation, sequencing, contamination or yield. Different visualisations, including PCA and heatmaps, are available to help users identify underlying issues. Finally, we show the usefulness of this approach by analysing two publicly available datasets and discussing the different quality issues that can be detected using mirnaQC.

Assuntos

MicroRNAs/química , Análise de Sequência de RNA/normas , Software , Artefatos , Feminino , Humanos , MicroRNAs/metabolismo , Controle de Qualidade , Neoplasias do Colo do Útero/genética

8.

High resolution annotation of zebrafish transcriptome using long-read sequencing.

Nudelman, German; Frasca, Antonio; Kent, Brandon; Sadler, Kirsten C; Sealfon, Stuart C; Walsh, Martin J; Zaslavsky, Elena.

Genome Res ; 28(9): 1415-1425, 2018 09.

Artigo em Inglês | MEDLINE | ID: mdl-30061115

RESUMO

With the emergence of zebrafish as an important model organism, a concerted effort has been made to study its transcriptome. This effort is limited, however, by gaps in zebrafish annotation, which are especially pronounced concerning transcripts dynamically expressed during zygotic genome activation (ZGA). To date, short-read sequencing has been the principal technology for zebrafish transcriptome annotation. In part because these sequence reads are too short for assembly methods to resolve the full complexity of the transcriptome, the current annotation is rudimentary. By providing direct observation of full-length transcripts, recently refined long-read sequencing platforms can dramatically improve annotation coverage and accuracy. Here, we leveraged the SMRT platform to study the transcriptome of zebrafish embryos before and after ZGA. Our analysis revealed additional novelty and complexity in the zebrafish transcriptome, identifying 2539 high-confidence novel transcripts that originated from previously unannotated loci and 1835 high-confidence new isoforms in previously annotated genes. We validated these findings using a suite of computational approaches including structural prediction, sequence homology, and functional conservation analyses, as well as by confirmatory transcript quantification with short-read sequencing data. Our analyses provided insight into new homologs and paralogs of functionally important proteins and noncoding RNAs, isoform switching occurrences, and different classes of novel splicing events. Several novel isoforms representing distinct splicing events were validated through PCR experiments, including the discovery and validation of a novel 8-kb transcript spanning multiple mir-430 elements, an important driver of early development. Our study provides a significantly improved zebrafish transcriptome annotation resource.

Assuntos

Anotação de Sequência Molecular , Transcriptoma , Peixe-Zebra/genética , Animais , Análise de Sequência de RNA/métodos , Análise de Sequência de RNA/normas , Homologia de Sequência do Ácido Nucleico

9.

Bias, robustness and scalability in single-cell differential expression analysis.

Soneson, Charlotte; Robinson, Mark D.

Nat Methods ; 15(4): 255-261, 2018 04.

Artigo em Inglês | MEDLINE | ID: mdl-29481549

RESUMO

Many methods have been used to determine differential gene expression from single-cell RNA (scRNA)-seq data. We evaluated 36 approaches using experimental and synthetic data and found considerable differences in the number and characteristics of the genes that are called differentially expressed. Prefiltering of lowly expressed genes has important effects, particularly for some of the methods developed for bulk RNA-seq data analysis. However, we found that bulk RNA-seq analysis methods do not generally perform worse than those developed specifically for scRNA-seq. We also present conquer, a repository of consistently processed, analysis-ready public scRNA-seq data sets that is aimed at simplifying method evaluation and reanalysis of published results. Each data set provides abundance estimates for both genes and transcripts, as well as quality control and exploratory analysis reports.

Assuntos

Biologia Computacional/métodos , Regulação da Expressão Gênica/fisiologia , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos , Perfilação da Expressão Gênica/métodos , RNA/genética , Análise de Sequência de RNA/normas , Software

10.

A general LC-MS-based RNA sequencing method for direct analysis of multiple-base modifications in RNA mixtures.

Zhang, Ning; Shi, Shundi; Jia, Tony Z; Ziegler, Ashley; Yoo, Barney; Yuan, Xiaohong; Li, Wenjia; Zhang, Shenglong.

Nucleic Acids Res ; 47(20): e125, 2019 11 18.

Artigo em Inglês | MEDLINE | ID: mdl-31504795

RESUMO

A complete understanding of the structural and functional potential of RNA requires understanding of chemical modifications and non-canonical bases; this in turn requires advances in current sequencing methods to be able to sequence not only canonical ribonucleotides, but at the same time directly sequence these non-standard moieties. Here, we present the first direct and modification type-independent RNA sequencing method via introduction of a 2-dimensional hydrophobic end-labeling strategy into traditional mass spectrometry-based sequencing (2D HELS MS Seq) to allow de novo sequencing of RNA mixtures and enhance sample usage efficiency. Our method can directly read out the complete sequence, while identifying, locating, and quantifying base modifications accurately in both single and mixed RNA samples containing multiple different modifications at single-base resolution. Our method can also quantify stoichiometry/percentage of modified RNA versus its canonical counterpart RNA, simulating a real biological sample where modifications exist but may not be 100% at a particular site in the RNA. This method is a critical step towards fully sequencing real complex cellular RNA samples of any type and containing any modification type and can also be used in the quality control of modified therapeutic RNAs.

Assuntos

Espectrometria de Massas/métodos , Processamento Pós-Transcricional do RNA , RNA/química , Análise de Sequência de RNA/métodos , Animais , Humanos , Espectrometria de Massas/normas , RNA/genética , RNA/metabolismo , Sensibilidade e Especificidade , Análise de Sequência de RNA/normas

11.

iSeqQC: a tool for expression-based quality control in RNA sequencing.

Kumar, Gaurav; Ertel, Adam; Feldman, George; Kupper, Joan; Fortina, Paolo.

BMC Bioinformatics ; 21(1): 56, 2020 Feb 13.

Artigo em Inglês | MEDLINE | ID: mdl-32054449

RESUMO

BACKGROUND: Quality Control in any high-throughput sequencing technology is a critical step, which if overlooked can compromise an experiment and the resulting conclusions. A number of methods exist to identify biases during sequencing or alignment, yet not many tools exist to interpret biases due to outliers. RESULTS: Hence, we developed iSeqQC, an expression-based QC tool that detects outliers either produced due to variable laboratory conditions or due to dissimilarity within a phenotypic group. iSeqQC implements various statistical approaches including unsupervised clustering, agglomerative hierarchical clustering and correlation coefficients to provide insight into outliers. It can be utilized through command-line (Github: https://github.com/gkumar09/iSeqQC) or web-interface (http://cancerwebpa.jefferson.edu/iSeqQC). A local shiny installation can also be obtained from github (https://github.com/gkumar09/iSeqQC). CONCLUSION: iSeqQC is a fast, light-weight, expression-based QC tool that detects outliers by implementing various statistical approaches.

Assuntos

Perfilação da Expressão Gênica/normas , Sequenciamento de Nucleotídeos em Larga Escala/normas , Análise de Sequência de RNA/normas , Software , Análise por Conglomerados , Humanos , Controle de Qualidade

12.

Dual indexed library design enables compatibility of in-Drop single-cell RNA-sequencing with exAMP chemistry sequencing platforms.

Southard-Smith, Austin N; Simmons, Alan J; Chen, Bob; Jones, Angela L; Ramirez Solano, Marisol A; Vega, Paige N; Scurrah, Cherie' R; Zhao, Yue; Brenan, Michael J; Xuan, Jiekun; Shrubsole, Martha J; Porter, Ely B; Chen, Xi; Brenan, Colin J H; Liu, Qi; Quigley, Lauren N M; Lau, Ken S.

BMC Genomics ; 21(1): 456, 2020 Jul 02.

Artigo em Inglês | MEDLINE | ID: mdl-32616006

RESUMO

BACKGROUND: The increasing demand of single-cell RNA-sequencing (scRNA-seq) experiments, such as the number of experiments and cells queried per experiment, necessitates higher sequencing depth coupled to high data quality. New high-throughput sequencers, such as the Illumina NovaSeq 6000, enables this demand to be filled in a cost-effective manner. However, current scRNA-seq library designs present compatibility challenges with newer sequencing technologies, such as index-hopping, and their ability to generate high quality data has yet to be systematically evaluated. RESULTS: Here, we engineered a dual-indexed library structure, called TruDrop, on top of the inDrop scRNA-seq platform to solve these compatibility challenges, such that TruDrop libraries and standard Illumina libraries can be sequenced alongside each other on the NovaSeq. On scRNA-seq libraries, we implemented a previously-documented countermeasure to the well-described problem of index-hopping, demonstrated significant improvements in base-calling accuracy on the NovaSeq, and provided an example of multiplexing twenty-four scRNA-seq libraries simultaneously. We showed favorable comparisons in transcriptional diversity of TruDrop compared with prior inDrop libraries. CONCLUSIONS: Our approach enables cost-effective, high throughput generation of sequencing data with high quality, which should enable more routine use of scRNA-seq technologies.

Assuntos

Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos , Animais , Humanos , Camundongos , Alinhamento de Sequência , Análise de Sequência de RNA/normas , Análise de Célula Única/normas

13.

Development of a relevant strategy using de novo transcriptome assembly method for transcriptome comparisons between Muscovy and common duck species and their reciprocal inter-specific mule and hinny hybrids fed ad libitum and overfed.

Liu, Xi; Hérault, Frédéric; Diot, Christian; Corre, Erwan.

BMC Genomics ; 21(1): 687, 2020 Oct 02.

Artigo em Inglês | MEDLINE | ID: mdl-33008290

RESUMO

BACKGROUND: Common Pekin and Muscovy ducks and their intergeneric hinny and mule hybrids have different abilities for fatty liver production. RNA-Seq analyses from the liver of these different genetic types fed ad libitum or overfed would help to identify genes with different response to overfeeding between them. However RNA-seq analyses from different species and comparison is challenging. The goal of this study was develop a relevant strategy for transcriptome analysis and comparison between different species. RESULTS: Transcriptomes were first assembled with a reference-based approach. Important mapping biases were observed when heterologous mapping were conducted on common duck reference genome, suggesting that this reference-based strategy was not suited to compare the four different genetic types. De novo transcriptome assemblies were then performed using Trinity and Oases. Assemblies of transcriptomes were not relevant when more than a single genetic type was considered. Finally, single genetic type transcriptomes were assembled with DRAP in a mega-transcriptome. No bias was observed when reads from the different genetic types were mapped on this mega-transcriptome and differences in gene expression between the four genetic types could be identified. CONCLUSIONS: Analyses using both reference-based and de novo transcriptome assemblies point out a good performance of the de novo approach for the analysis of gene expression in different species. It also allowed the identification of differences in responses to overfeeding between Pekin and Muscovy ducks and hinny and mule hybrids.

Assuntos

Patos/genética , Perfilação da Expressão Gênica/veterinária , Fígado/metabolismo , Análise de Sequência de RNA/veterinária , Transcriptoma , Animais , Patos/fisiologia , Fígado Gorduroso/genética , Fígado Gorduroso/veterinária , Perfilação da Expressão Gênica/métodos , Perfilação da Expressão Gênica/normas , Hibridização Genética , Doenças das Aves Domésticas/genética , Padrões de Referência , Análise de Sequência de RNA/métodos , Análise de Sequência de RNA/normas

14.

Assessing the reliability of spike-in normalization for analyses of single-cell RNA sequencing data.

Lun, Aaron T L; Calero-Nieto, Fernando J; Haim-Vilmovsky, Liora; Göttgens, Berthold; Marioni, John C.

Genome Res ; 27(11): 1795-1806, 2017 11.

Artigo em Inglês | MEDLINE | ID: mdl-29030468

RESUMO

By profiling the transcriptomes of individual cells, single-cell RNA sequencing provides unparalleled resolution to study cellular heterogeneity. However, this comes at the cost of high technical noise, including cell-specific biases in capture efficiency and library generation. One strategy for removing these biases is to add a constant amount of spike-in RNA to each cell and to scale the observed expression values so that the coverage of spike-in transcripts is constant across cells. This approach has previously been criticized as its accuracy depends on the precise addition of spike-in RNA to each sample. Here, we perform mixture experiments using two different sets of spike-in RNA to quantify the variance in the amount of spike-in RNA added to each well in a plate-based protocol. We also obtain an upper bound on the variance due to differences in behavior between the two spike-in sets. We demonstrate that both factors are small contributors to the total technical variance and have only minor effects on downstream analyses, such as detection of highly variable genes and clustering. Our results suggest that scaling normalization using spike-in transcripts is reliable enough for routine use in single-cell RNA sequencing data analyses.

Assuntos

Análise de Sequência de RNA/normas , Análise de Célula Única/normas , Algoritmos , Animais , Linhagem Celular , Perfilação da Expressão Gênica/normas , Regulação da Expressão Gênica , Camundongos , Reprodutibilidade dos Testes

15.

Power analysis of single-cell RNA-sequencing experiments.

Svensson, Valentine; Natarajan, Kedar Nath; Ly, Lam-Ha; Miragaia, Ricardo J; Labalette, Charlotte; Macaulay, Iain C; Cvejic, Ana; Teichmann, Sarah A.

Nat Methods ; 14(4): 381-387, 2017 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-28263961

RESUMO

Single-cell RNA sequencing (scRNA-seq) has become an established and powerful method to investigate transcriptomic cell-to-cell variation, thereby revealing new cell types and providing insights into developmental processes and transcriptional stochasticity. A key question is how the variety of available protocols compare in terms of their ability to detect and accurately quantify gene expression. Here, we assessed the protocol sensitivity and accuracy of many published data sets, on the basis of spike-in standards and uniform data processing. For our workflow, we developed a flexible tool for counting the number of unique molecular identifiers (https://github.com/vals/umis/). We compared 15 protocols computationally and 4 protocols experimentally for batch-matched cell populations, in addition to investigating the effects of spike-in molecular degradation. Our analysis provides an integrated framework for comparing scRNA-seq protocols.

Assuntos

Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos , Animais , Células-Tronco Embrionárias/fisiologia , Congelamento , Camundongos , Poli A , RNA Mensageiro , Sensibilidade e Especificidade , Análise de Sequência de RNA/normas , Análise de Sequência de RNA/estatística & dados numéricos , Análise de Célula Única/normas , Análise de Célula Única/estatística & dados numéricos , Fluxo de Trabalho

16.

SCnorm: robust normalization of single-cell RNA-seq data.

Bacher, Rhonda; Chu, Li-Fang; Leng, Ning; Gasch, Audrey P; Thomson, James A; Stewart, Ron M; Newton, Michael; Kendziorski, Christina.

Nat Methods ; 14(6): 584-586, 2017 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-28418000

RESUMO

The normalization of RNA-seq data is essential for accurate downstream inference, but the assumptions upon which most normalization methods are based are not applicable in the single-cell setting. Consequently, applying existing normalization methods to single-cell RNA-seq data introduces artifacts that bias downstream analyses. To address this, we introduce SCnorm for accurate and efficient normalization of single-cell RNA-seq data.

Assuntos

Algoritmos , Sequenciamento de Nucleotídeos em Larga Escala/normas , RNA/genética , Análise de Sequência de RNA/normas , Análise de Célula Única/normas , Transcriptoma/genética , Interpretação Estatística de Dados , Valores de Referência , Software

17.

Normalizing single-cell RNA sequencing data: challenges and opportunities.

Vallejos, Catalina A; Risso, Davide; Scialdone, Antonio; Dudoit, Sandrine; Marioni, John C.

Nat Methods ; 14(6): 565-571, 2017 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-28504683

RESUMO

Single-cell transcriptomics is becoming an important component of the molecular biologist's toolkit. A critical step when analyzing data generated using this technology is normalization. However, normalization is typically performed using methods developed for bulk RNA sequencing or even microarray data, and the suitability of these methods for single-cell transcriptomics has not been assessed. We here discuss commonly used normalization approaches and illustrate how these can produce misleading results. Finally, we present alternative approaches and provide recommendations for single-cell RNA sequencing users.

Assuntos

Algoritmos , Sequenciamento de Nucleotídeos em Larga Escala/normas , RNA/genética , Análise de Sequência de RNA/normas , Análise de Célula Única/normas , Transcriptoma/genética , Interpretação Estatística de Dados , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Valores de Referência

18.

Identification and removal of sequencing artifacts produced by mispriming during reverse transcription in multiple RNA-seq technologies.

Shivram, Haridha; Iyer, Vishwanath R.

RNA ; 24(9): 1266-1274, 2018 09.

Artigo em Inglês | MEDLINE | ID: mdl-29950518

RESUMO

The quality of RNA sequencing data relies on specific priming by the primer used for reverse transcription (RT-primer). Nonspecific annealing of the RT-primer to the RNA template can generate reads with incorrect cDNA ends and can cause misinterpretation of data (RT mispriming). This kind of artifact in RNA-seq based technologies is underappreciated and currently no adequate tools exist to computationally remove them from published data sets. We show that mispriming can occur with as little as two bases of complementarity at the 3' end of the primer followed by intermittent regions of complementarity. We also provide a computational pipeline that identifies cDNA reads produced from RT mispriming, allowing users to filter them out from any aligned data set. Using this analysis pipeline, we identify thousands of mispriming events in a dozen published data sets from diverse technologies including short RNA-seq, total/mRNA-seq, HITS-CLIP, and GRO-seq. We further show how RT mispriming can lead to misinterpretation of data. In addition to providing a solution to computationally remove RT-misprimed reads, we also propose an experimental solution to completely avoid RT-mispriming by performing RNA-seq using thermostable group II intron derived reverse transcriptase (TGIRT-seq).

Assuntos

Reação em Cadeia da Polimerase Via Transcriptase Reversa/normas , Análise de Sequência de RNA/normas , Artefatos , Linhagem Celular Tumoral , Biologia Computacional/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento de Nucleotídeos em Larga Escala/normas , Humanos , Sondas RNA/metabolismo , Reação em Cadeia da Polimerase Via Transcriptase Reversa/métodos , Transcrição Reversa , Análise de Sequência de RNA/métodos

19.

Low-Bias RNA Sequencing of the HIV-2 Genome from Blood Plasma.

James, Katherine L; de Silva, Thushan I; Brown, Katherine; Whittle, Hilton; Taylor, Stephen; McVean, Gilean; Esbjörnsson, Joakim; Rowland-Jones, Sarah L.

J Virol ; 93(1)2019 01 01.

Artigo em Inglês | MEDLINE | ID: mdl-30333167

RESUMO

Accurate determination of the genetic diversity present in the HIV quasispecies is critical for the development of a preventative vaccine: in particular, little is known about viral genetic diversity for the second type of HIV, HIV-2. A better understanding of HIV-2 biology is relevant to the HIV vaccine field because a substantial proportion of infected people experience long-term viral control, and prior HIV-2 infection has been associated with slower HIV-1 disease progression in coinfected subjects. The majority of traditional and next-generation sequencing methods have relied on target amplification prior to sequencing, introducing biases that may obscure the true signals of diversity in the viral population. Additionally, target enrichment through PCR requires a priori sequence knowledge, which is lacking for HIV-2. Therefore, a target enrichment free method of library preparation would be valuable for the field. We applied an RNA shotgun sequencing (RNA-Seq) method without PCR amplification to cultured viral stocks and patient plasma samples from HIV-2-infected individuals. Libraries generated from total plasma RNA were analyzed with a two-step pipeline: (i) de novo genome assembly, followed by (ii) read remapping. By this approach, whole-genome sequences were generated with a 28× to 67× mean depth of coverage. Assembled reads showed a low level of GC bias, and comparison of the genome diversities at the intrahost level showed low diversity in the accessory gene vpx in all patients. Our study demonstrates that RNA-Seq is a feasible full-genome de novo sequencing method for blood plasma samples collected from HIV-2-infected individuals.IMPORTANCE An accurate picture of viral genetic diversity is critical for the development of a globally effective HIV vaccine. However, sequencing strategies are often complicated by target enrichment prior to sequencing, introducing biases that can distort variant frequencies, which are not easily corrected for in downstream analyses. Additionally, detailed a priori sequence knowledge is needed to inform robust primer design when employing PCR amplification, a factor that is often lacking when working with tropical diseases localized in developing countries. Previous work has demonstrated that direct RNA shotgun sequencing (RNA-Seq) can be used to circumvent these issues for hepatitis C virus (HCV) and norovirus. We applied RNA-Seq to total RNA extracted from HIV-2 blood plasma samples, demonstrating the applicability of this technique to HIV-2 and allowing us to generate a dynamic picture of genetic diversity over the whole genome of HIV-2 in the context of low-bias sequencing.

Assuntos

Infecções por HIV/virologia , HIV-2/genética , RNA Viral/sangue , Análise de Sequência de RNA/métodos , África Ocidental , Viés , Feminino , Genoma Viral , Infecções por HIV/sangue , HIV-2/classificação , Humanos , Masculino , Filogenia , Quase-Espécies , Análise de Sequência de RNA/normas

20.

A complete statistical model for calibration of RNA-seq counts using external spike-ins and maximum likelihood theory.

Athanasiadou, Rodoniki; Neymotin, Benjamin; Brandt, Nathan; Wang, Wei; Christiaen, Lionel; Gresham, David; Tranchina, Daniel.

PLoS Comput Biol ; 15(3): e1006794, 2019 03.

Artigo em Inglês | MEDLINE | ID: mdl-30856174

RESUMO

A fundamental assumption, common to the vast majority of high-throughput transcriptome analyses, is that the expression of most genes is unchanged among samples and that total cellular RNA remains constant. As the number of analyzed experimental systems increases however, different independent studies demonstrate that this assumption is often violated. We present a calibration method using RNA spike-ins that allows for the measurement of absolute cellular abundance of RNA molecules. We apply the method to pooled RNA from cell populations of known sizes. For each transcript, we compute a nominal abundance that can be converted to absolute by dividing by a scale factor determined in separate experiments: the yield coefficient of the transcript relative to that of a reference spike-in measured with the same protocol. The method is derived by maximum likelihood theory in the context of a complete statistical model for sequencing counts contributed by cellular RNA and spike-ins. The counts are based on a sample from a fixed number of cells to which a fixed population of spike-in molecules has been added. We illustrate and evaluate the method with applications to two global expression data sets, one from the model eukaryote Saccharomyces cerevisiae, proliferating at different growth rates, and differentiating cardiopharyngeal cell lineages in the chordate Ciona robusta. We tested the method in a technical replicate dilution study, and in a k-fold validation study.

Assuntos

Funções Verossimilhança , Modelos Estatísticos , Análise de Sequência de RNA/normas , Animais , Calibragem , Ciona/embriologia , Ciona/genética , Expressão Gênica , Genes Fúngicos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento de Nucleotídeos em Larga Escala/normas , RNA Fúngico/genética , Saccharomyces cerevisiae/genética

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA