Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
Mais filtros

Bases de dados
Tipo de documento
Intervalo de ano de publicação
1.
Nat Commun ; 15(1): 3972, 2024 May 10.
Artigo em Inglês | MEDLINE | ID: mdl-38730241

RESUMO

The advancement of Long-Read Sequencing (LRS) techniques has significantly increased the length of sequencing to several kilobases, thereby facilitating the identification of alternative splicing events and isoform expressions. Recently, numerous computational tools for isoform detection using long-read sequencing data have been developed. Nevertheless, there remains a deficiency in comparative studies that systemically evaluate the performance of these tools, which are implemented with different algorithms, under various simulations that encompass potential influencing factors. In this study, we conducted a benchmark analysis of thirteen methods implemented in nine tools capable of identifying isoform structures from long-read RNA-seq data. We evaluated their performances using simulated data, which represented diverse sequencing platforms generated by an in-house simulator, RNA sequins (sequencing spike-ins) data, as well as experimental data. Our findings demonstrate IsoQuant as a highly effective tool for isoform detection with LRS, with Bambu and StringTie2 also exhibiting strong performance. These results offer valuable guidance for future research on alternative splicing analysis and the ongoing improvement of tools for isoform detection using LRS data.


Assuntos
Algoritmos , Processamento Alternativo , RNA Mensageiro , Análise de Sequência de RNA , Humanos , RNA Mensageiro/genética , RNA Mensageiro/análise , Análise de Sequência de RNA/métodos , Isoformas de RNA/genética , Software , Biologia Computacional/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Isoformas de Proteínas/genética
2.
Nat Commun ; 11(1): 5148, 2020 10 13.
Artigo em Inglês | MEDLINE | ID: mdl-33051455

RESUMO

The differences in transcription start sites (TSS) and transcription end sites (TES) among gene isoforms can affect the stability, localization, and translation efficiency of mRNA. Gene isoforms allow a single gene diverse functions across different cell types, and isoform dynamics allow different functions over time. However, methods to efficiently identify and quantify RNA isoforms genome-wide in single cells are still lacking. Here, we introduce single cell RNA Cap And Tail sequencing (scRCAT-seq), a method to demarcate the boundaries of isoforms based on short-read sequencing, with higher efficiency and lower cost than existing long-read sequencing methods. In conjunction with machine learning algorithms, scRCAT-seq demarcates RNA transcripts with unprecedented accuracy. We identified hundreds of previously uncharacterized transcripts and thousands of alternative transcripts for known genes, revealed cell-type specific isoforms for various cell types across different species, and generated a cell atlas of isoform dynamics during the development of retinal cones.


Assuntos
Capuzes de RNA/genética , Isoformas de RNA/genética , Análise de Sequência de RNA/métodos , Transcrição Gênica , Humanos , Análise de Sequência de RNA/economia , Análise de Célula Única , Especificidade da Espécie , Sítio de Iniciação de Transcrição
3.
Hum Mutat ; 35(10): 1249-59, 2014 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-25066652

RESUMO

Assessment of the functional consequences of variants near splice sites is a major challenge in the diagnostic laboratory. To address this issue, we created expression minigenes (EMGs) to determine the RNA and protein products generated by splice site variants (n = 10) implicated in cystic fibrosis (CF). Experimental results were compared with the splicing predictions of eight in silico tools. EMGs containing the full-length Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) coding sequence and flanking intron sequences generated wild-type transcript and fully processed protein in Human Embryonic Kidney (HEK293) and CF bronchial epithelial (CFBE41o-) cells. Quantification of variant induced aberrant mRNA isoforms was concordant using fragment analysis and pyrosequencing. The splicing patterns of c.1585-1G>A and c.2657+5G>A were comparable to those reported in primary cells from individuals bearing these variants. Bioinformatics predictions were consistent with experimental results for 9/10 variants (MES), 8/10 variants (NNSplice), and 7/10 variants (SSAT and Sroogle). Programs that estimate the consequences of mis-splicing predicted 11/16 (HSF and ASSEDA) and 10/16 (Fsplice and SplicePort) experimentally observed mRNA isoforms. EMGs provide a robust experimental approach for clinical interpretation of splice site variants and refinement of in silico tools.


Assuntos
Simulação por Computador , Técnicas Genéticas , Isoformas de RNA/genética , Splicing de RNA , Linhagem Celular , Fibrose Cística/genética , Fibrose Cística/metabolismo , Regulador de Condutância Transmembrana em Fibrose Cística/genética , Humanos , Mutação , Isoformas de RNA/análise , Sítios de Splice de RNA/genética
4.
PLoS One ; 7(1): e29175, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22238592

RESUMO

With the recent advances in high-throughput RNA sequencing (RNA-Seq), biologists are able to measure transcription with unprecedented precision. One problem that can now be tackled is that of isoform quantification: here one tries to reconstruct the abundances of isoforms of a gene. We have developed a statistical solution for this problem, based on analyzing a set of RNA-Seq reads, and a practical implementation, available from archive.gersteinlab.org/proj/rnaseq/IQSeq, in a tool we call IQSeq (Isoform Quantification in next-generation Sequencing). Here, we present theoretical results which IQSeq is based on, and then use both simulated and real datasets to illustrate various applications of the tool. In order to measure the accuracy of an isoform-quantification result, one would try to estimate the average variance of the estimated isoform abundances for each gene (based on resampling the RNA-seq reads), and IQSeq has a particularly fast algorithm (based on the Fisher Information Matrix) for calculating this, achieving a speedup of ~ 500 times compared to brute-force resampling. IQSeq also calculates an information theoretic measure of overall transcriptome complexity to describe isoform abundance for a whole experiment. IQSeq has many features that are particularly useful in RNA-Seq experimental design, allowing one to optimally model the integration of different sequencing technologies in a cost-effective way. In particular, the IQSeq formalism integrates the analysis of different sample (i.e. read) sets generated from different technologies within the same statistical framework. It also supports a generalized statistical partial-sample-generation function to model the sequencing process. This allows one to have a modular, "plugin-able" read-generation function to support the particularities of the many evolving sequencing technologies.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Isoformas de RNA/análise , Software , Algoritmos , Animais , Anelídeos/embriologia , Anelídeos/genética , Biologia Computacional/economia , Biologia Computacional/métodos , Simulação por Computador , Embrião não Mamífero , Regulação da Expressão Gênica no Desenvolvimento , Sequenciamento de Nucleotídeos em Larga Escala/economia , Humanos , Funções Verossimilhança , Modelos Biológicos , Modelos Teóricos , Isoformas de RNA/genética , Integração de Sistemas , Fator 1 de Transcrição de Linfócitos T/genética , Transcriptoma/genética , Transcriptoma/fisiologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA