Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 983
Filtrar
Mais filtros

Tipo de documento
Intervalo de ano de publicação
1.
Mol Cell ; 82(7): 1372-1382.e4, 2022 04 07.
Artigo em Inglês | MEDLINE | ID: mdl-35240057

RESUMO

Fundamental aspects of DNA replication, such as the anatomy of replication stall sites, how replisomes are influenced by gene transcription, and whether the progression of sister replisomes is coordinated, are poorly understood. Available techniques do not allow the precise mapping of the positions of individual replisomes on chromatin. We have developed a method called Replicon-seq that entails the excision of full-length replicons by controlled nuclease cleavage at replication forks. Replicons are sequenced using Nanopore, which provides a single-molecule readout of long DNA. Using Replicon-seq, we found that sister replisomes function autonomously and yet progress through chromatin with remarkable consistency. Replication forks that encounter obstacles pause for a short duration but rapidly resume synthesis. The helicase Rrm3 plays a critical role both in mitigating the effect of protein barriers and with facilitating efficient termination. Replicon-seq provides a high-resolution means of defining how individual replisomes move across the genome.


Assuntos
DNA Helicases , Replicação do DNA , Cromatina/genética , Cromossomos/metabolismo , DNA Helicases/genética , DNA Helicases/metabolismo
2.
Mol Cell ; 81(10): 2135-2147.e5, 2021 05 20.
Artigo em Inglês | MEDLINE | ID: mdl-33713597

RESUMO

Coronavirus disease 2019 (COVID-19), caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), is currently a global pandemic. CoVs are known to generate negative subgenomes (subgenomic RNAs [sgRNAs]) through transcription-regulating sequence (TRS)-dependent template switching, but the global dynamic landscapes of coronaviral subgenomes and regulatory rules remain unclear. Here, using next-generation sequencing (NGS) short-read and Nanopore long-read poly(A) RNA sequencing in two cell types at multiple time points after infection with SARS-CoV-2, we identified hundreds of template switches and constructed the dynamic landscapes of SARS-CoV-2 subgenomes. Interestingly, template switching could occur in a bidirectional manner, with diverse SARS-CoV-2 subgenomes generated from successive template-switching events. The majority of template switches result from RNA-RNA interactions, including seed and compensatory modes, with terminal pairing status as a key determinant. Two TRS-independent template switch modes are also responsible for subgenome biogenesis. Our findings reveal the subgenome landscape of SARS-CoV-2 and its regulatory features, providing a molecular basis for understanding subgenome biogenesis and developing novel anti-viral strategies.


Assuntos
COVID-19 , Genoma Viral , Sequenciamento de Nucleotídeos em Larga Escala , RNA Viral , SARS-CoV-2 , Animais , COVID-19/genética , COVID-19/metabolismo , Células CACO-2 , Chlorocebus aethiops , Humanos , RNA Viral/genética , RNA Viral/metabolismo , SARS-CoV-2/genética , SARS-CoV-2/metabolismo , Células Vero
3.
Genes Dev ; 35(13-14): 1005-1019, 2021 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-34168039

RESUMO

N6-methyladenosine (m6A) is an abundant internal RNA modification, influencing transcript fate and function in uninfected and virus-infected cells. Installation of m6A by the nuclear RNA methyltransferase METTL3 occurs cotranscriptionally; however, the genomes of some cytoplasmic RNA viruses are also m6A-modified. How the cellular m6A modification machinery impacts coronavirus replication, which occurs exclusively in the cytoplasm, is unknown. Here we show that replication of SARS-CoV-2, the agent responsible for the COVID-19 pandemic, and a seasonal human ß-coronavirus HCoV-OC43, can be suppressed by depletion of METTL3 or cytoplasmic m6A reader proteins YTHDF1 and YTHDF3 and by a highly specific small molecule METTL3 inhibitor. Reduction of infectious titer correlates with decreased synthesis of viral RNAs and the essential nucleocapsid (N) protein. Sites of m6A modification on genomic and subgenomic RNAs of both viruses were mapped by methylated RNA immunoprecipitation sequencing (meRIP-seq). Levels of host factors involved in m6A installation, removal, and recognition were unchanged by HCoV-OC43 infection; however, nuclear localization of METTL3 and cytoplasmic m6A readers YTHDF1 and YTHDF2 increased. This establishes that coronavirus RNAs are m6A-modified and host m6A pathway components control ß-coronavirus replication. Moreover, it illustrates the therapeutic potential of targeting the m6A pathway to restrict coronavirus reproduction.


Assuntos
Coronavirus Humano OC43/fisiologia , Processamento Pós-Transcricional do RNA/genética , SARS-CoV-2/fisiologia , Replicação Viral/genética , Adenosina/análogos & derivados , Adenosina/genética , Adenosina/metabolismo , Linhagem Celular , Infecções por Coronavirus/metabolismo , Infecções por Coronavirus/virologia , Regulação da Expressão Gênica/efeitos dos fármacos , Interações Hospedeiro-Patógeno/efeitos dos fármacos , Humanos , Metiltransferases/antagonistas & inibidores , Metiltransferases/metabolismo , Proteínas do Nucleocapsídeo , RNA Viral/metabolismo , Proteínas de Ligação a RNA/metabolismo , Replicação Viral/efeitos dos fármacos
4.
Mol Cell ; 77(5): 985-998.e8, 2020 03 05.
Artigo em Inglês | MEDLINE | ID: mdl-31839405

RESUMO

Understanding how splicing events are coordinated across numerous introns in metazoan RNA transcripts requires quantitative analyses of transient RNA processing events in living cells. We developed nanopore analysis of co-transcriptional processing (nano-COP), in which nascent RNAs are directly sequenced through nanopores, exposing the dynamics and patterns of RNA splicing without biases introduced by amplification. Long nano-COP reads reveal that, in human and Drosophila cells, splicing occurs after RNA polymerase II transcribes several kilobases of pre-mRNA, suggesting that metazoan splicing transpires distally from the transcription machinery. Inhibition of the branch-site recognition complex SF3B rapidly diminished global co-transcriptional splicing. We found that splicing order does not strictly follow the order of transcription and is associated with cis-acting elements, alternative splicing, and RNA-binding factors. Further, neighboring introns in human cells tend to be spliced concurrently, implying that splicing of these introns occurs cooperatively. Thus, nano-COP unveils the organizational complexity of RNA processing.


Assuntos
Sequenciamento por Nanoporos , Nanoporos , Precursores de RNA/metabolismo , Splicing de RNA , RNA Mensageiro/metabolismo , Análise de Sequência de RNA/métodos , Transcriptoma , Animais , Proteínas de Drosophila/genética , Proteínas de Drosophila/metabolismo , Drosophila melanogaster , Humanos , Íntrons , Células K562 , Cinética , RNA Polimerase II/genética , RNA Polimerase II/metabolismo , Precursores de RNA/genética , Fatores de Processamento de RNA/genética , Fatores de Processamento de RNA/metabolismo , RNA Mensageiro/genética , Transcrição Gênica
5.
Trends Genet ; 39(9): 649-671, 2023 09.
Artigo em Inglês | MEDLINE | ID: mdl-37230864

RESUMO

Long-read sequencing (LRS) technologies have provided extremely powerful tools to explore genomes. While in the early years these methods suffered technical limitations, they have recently made significant progress in terms of read length, throughput, and accuracy and bioinformatics tools have strongly improved. Here, we aim to review the current status of LRS technologies, the development of novel methods, and the impact on genomics research. We will explore the most impactful recent findings made possible by these technologies focusing on high-resolution sequencing of genomes and transcriptomes and the direct detection of DNA and RNA modifications. We will also discuss how LRS methods promise a more comprehensive understanding of human genetic variation, transcriptomics, and epigenetics for the coming years.


Assuntos
Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Genômica/métodos , Análise de Sequência de DNA/métodos , Biologia Computacional , Perfilação da Expressão Gênica/métodos
6.
Am J Hum Genet ; 110(8): 1229-1248, 2023 08 03.
Artigo em Inglês | MEDLINE | ID: mdl-37541186

RESUMO

Despite advances in clinical genetic testing, including the introduction of exome sequencing (ES), more than 50% of individuals with a suspected Mendelian condition lack a precise molecular diagnosis. Clinical evaluation is increasingly undertaken by specialists outside of clinical genetics, often occurring in a tiered fashion and typically ending after ES. The current diagnostic rate reflects multiple factors, including technical limitations, incomplete understanding of variant pathogenicity, missing genotype-phenotype associations, complex gene-environment interactions, and reporting differences between clinical labs. Maintaining a clear understanding of the rapidly evolving landscape of diagnostic tests beyond ES, and their limitations, presents a challenge for non-genetics professionals. Newer tests, such as short-read genome or RNA sequencing, can be challenging to order, and emerging technologies, such as optical genome mapping and long-read DNA sequencing, are not available clinically. Furthermore, there is no clear guidance on the next best steps after inconclusive evaluation. Here, we review why a clinical genetic evaluation may be negative, discuss questions to be asked in this setting, and provide a framework for further investigation, including the advantages and disadvantages of new approaches that are nascent in the clinical sphere. We present a guide for the next best steps after inconclusive molecular testing based upon phenotype and prior evaluation, including when to consider referral to research consortia focused on elucidating the underlying cause of rare unsolved genetic disorders.


Assuntos
Exoma , Testes Genéticos , Humanos , Exoma/genética , Análise de Sequência de DNA , Fenótipo , Sequenciamento do Exoma , Doenças Raras
7.
Am J Hum Genet ; 110(5): 863-879, 2023 05 04.
Artigo em Inglês | MEDLINE | ID: mdl-37146589

RESUMO

Deleterious mutations in the X-linked gene encoding ornithine transcarbamylase (OTC) cause the most common urea cycle disorder, OTC deficiency. This rare but highly actionable disease can present with severe neonatal onset in males or with later onset in either sex. Individuals with neonatal onset appear normal at birth but rapidly develop hyperammonemia, which can progress to cerebral edema, coma, and death, outcomes ameliorated by rapid diagnosis and treatment. Here, we develop a high-throughput functional assay for human OTC and individually measure the impact of 1,570 variants, 84% of all SNV-accessible missense mutations. Comparison to existing clinical significance calls, demonstrated that our assay distinguishes known benign from pathogenic variants and variants with neonatal onset from late-onset disease presentation. This functional stratification allowed us to identify score ranges corresponding to clinically relevant levels of impairment of OTC activity. Examining the results of our assay in the context of protein structure further allowed us to identify a 13 amino acid domain, the SMG loop, whose function appears to be required in human cells but not in yeast. Finally, inclusion of our data as PS3 evidence under the current ACMG guidelines, in a pilot reclassification of 34 variants with complete loss of activity, would change the classification of 22 from variants of unknown significance to clinically actionable likely pathogenic variants. These results illustrate how large-scale functional assays are especially powerful when applied to rare genetic diseases.


Assuntos
Hiperamonemia , Doença da Deficiência de Ornitina Carbomoiltransferase , Ornitina Carbamoiltransferase , Humanos , Substituição de Aminoácidos , Hiperamonemia/etiologia , Hiperamonemia/genética , Mutação de Sentido Incorreto/genética , Ornitina Carbamoiltransferase/genética , Doença da Deficiência de Ornitina Carbomoiltransferase/genética , Doença da Deficiência de Ornitina Carbomoiltransferase/diagnóstico , Doença da Deficiência de Ornitina Carbomoiltransferase/terapia
8.
RNA ; 30(8): 955-966, 2024 Jul 16.
Artigo em Inglês | MEDLINE | ID: mdl-38777382

RESUMO

The long noncoding RNA TERRA is transcribed from telomeres in virtually all eukaryotes with linear chromosomes. In humans, TERRA transcription is driven in part by promoters comprising CpG dinucleotide-rich repeats of 29 bp repeats, believed to be present in half of the subtelomeres. Thus far, TERRA expression has been analyzed mainly using molecular biology-based approaches that only generate partial and somehow biased results. Here, we present a novel experimental pipeline to study human TERRA based on long-read sequencing (TERRA ONTseq). By applying TERRA ONTseq to different cell lines, we show that the vast majority of human telomeres produce TERRA and that the cellular levels of TERRA transcripts vary according to their chromosomes of origin. Using TERRA ONTseq, we also identified regions containing TERRA transcription start sites (TSSs) in more than half of human subtelomeres. TERRA TSS regions are generally found immediately downstream from 29 bp repeat-related sequences, which appear to be more widespread than previously estimated. Finally, we isolated a novel TERRA promoter from the highly expressed subtelomere of the long arm of Chromosome 7. With the development of TERRA ONTseq, we provide a refined picture of human TERRA biogenesis and expression and we equip the scientific community with an invaluable tool for future studies.


Assuntos
Regiões Promotoras Genéticas , RNA Longo não Codificante , Telômero , Sítio de Iniciação de Transcrição , Transcriptoma , Humanos , Telômero/genética , Telômero/metabolismo , RNA Longo não Codificante/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de RNA/métodos
9.
Trends Genet ; 38(10): 987-988, 2022 10.
Artigo em Inglês | MEDLINE | ID: mdl-35643778

RESUMO

Claussin et al. introduce Replicon-seq, a new genome-wide DNA sequencing technology that monitors the progression of individual replisomes at high resolution in vivo.


Assuntos
Replicação do DNA , Replicon , DNA , DNA Helicases/metabolismo , Replicon/genética
10.
Trends Genet ; 38(3): 246-257, 2022 03.
Artigo em Inglês | MEDLINE | ID: mdl-34711425

RESUMO

Nanopore sequencing provides signal data corresponding to the nucleotide motifs sequenced. Through machine learning-based methods, these signals are translated into long-read sequences that overcome the read size limit of short-read sequencing. However, analyzing the raw nanopore signal data provides many more opportunities beyond just sequencing genomes and transcriptomes: algorithms that use machine learning approaches to extract biological information from these signals allow the detection of DNA and RNA modifications, the estimation of poly(A) tail length, and the prediction of RNA secondary structures. In this review, we discuss how developments in machine learning methodologies contributed to more accurate basecalling and lower error rates, and how these methods enable new biological discoveries. We argue that direct nanopore sequencing of DNA and RNA provides a new dimensionality for genomics experiments and highlight challenges and future directions for computational approaches to extract the additional information provided by nanopore signal data.


Assuntos
Sequenciamento por Nanoporos , Nanoporos , Algoritmos , Genômica , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Aprendizado de Máquina , Análise de Sequência de DNA/métodos
11.
RNA ; 29(6): 847-861, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-36854608

RESUMO

Ligation by plant and fungal RNA ligases yields an internal 2'-phosphate group on each RNA ligation product. In budding yeast, this covalent mark occurs at the splice junction of two targets of ligation: intron-containing tRNAs and the messenger RNA HAC1 The repertoire of RNA molecules repaired by RNA ligation has not been explored due to a lack of unbiased approaches for identifying RNA ligation products. Here, we define several unique signals produced by 2'-phosphorylated RNAs during nanopore sequencing. A 2'-phosphate at the splice junction of HAC1 mRNA inhibits 5' → 3' degradation, enabling detection of decay intermediates in yeast RNA repair mutants by nanopore sequencing. During direct RNA sequencing, intact 2'-phosphorylated RNAs on HAC1 and tRNAs produce diagnostic changes in nanopore current properties and base calling features, including stalls produced as the modified RNA translocates through the nanopore motor protein. These approaches enable directed studies to identify novel RNA repair events in other contexts.


Assuntos
Sequenciamento por Nanoporos , Fosforilação , RNA , Saccharomyces cerevisiae , RNA/genética , RNA/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , RNA de Transferência/genética , RNA de Transferência/metabolismo , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo
12.
RNA ; 29(8): 1255-1273, 2023 08.
Artigo em Inglês | MEDLINE | ID: mdl-37192814

RESUMO

Ribosomal RNA (rRNA) maturation in archaea is a complex multistep process that requires well-defined endo- and exoribonuclease activities to generate fully mature linear rRNAs. However, technical challenges prevented detailed mapping of rRNA processing steps and a systematic analysis of rRNA maturation pathways across the tree of life. In this study, we used long-read (PCR)-cDNA and direct RNA nanopore-based sequencing to study rRNA maturation in three archaeal model organisms, namely the Euryarchaea Haloferax volcanii and Pyrococcus furiosus and the Crenarchaeon Sulfolobus acidocaldarius Compared to standard short-read protocols, nanopore sequencing facilitates simultaneous readout of 5'- and 3'-positions, which is required for the classification of rRNA processing intermediates. More specifically, we (i) accurately detect and describe rRNA maturation stages by analysis of terminal read positions of cDNA reads and thereupon (ii) explore the stage-dependent installation of the KsgA-mediated dimethylations in H. volcanii using base-calling and signal characteristics of direct RNA reads. Due to the single-molecule sequencing capacity of nanopore sequencing, we could detect hitherto unknown intermediates with high confidence, revealing details about the maturation of archaea-specific circular rRNA intermediates. Taken together, our study delineates common principles and unique features of rRNA processing in euryarchaeal and crenarchaeal representatives, thereby significantly expanding our understanding of rRNA maturation pathways in archaea.


Assuntos
Sequenciamento por Nanoporos , Nanoporos , RNA Ribossômico/genética , RNA , Archaea/genética , DNA Complementar , Análise de Sequência de RNA
13.
RNA ; 29(8): 1099-1107, 2023 08.
Artigo em Inglês | MEDLINE | ID: mdl-37137666

RESUMO

RT-PCR and northern blots have long been used to study RNA isoforms usage for single genes. Recent advancements in long-read sequencing have yielded unprecedented information about the usage and abundance of these RNA isoforms. However, visualization of long-read sequencing data remains challenging due to the high information density. To alleviate these issues, we have developed NanoBlot, an open-source R-package that generates northern blot and RT-PCR-like images from long-read sequencing data. NanoBlot requires aligned, positionally sorted and indexed BAM files. Plotting is based around ggplot2 and is easily customizable. Advantages of NanoBlot include a robust system for designing probes to visualize isoforms including excluding reads based on the presence or absence of a specified region, an elegant solution to representing isoforms with continuous variations in length, and the ability to overlay multiple genes in the same plot using different colors. We present examples of nanoblots compared to actual northern blot data. In addition to traditional gel-like images, the NanoBlot package can also output other visualizations such as violin plots and 3'-RACE-like plots focused on 3'-end isoforms visualization. The use of the NanoBlot package should provide a simple answer to some of the challenges of visualizing long-read RNA-sequencing data.


Assuntos
Isoformas de RNA , RNA , RNA/genética , Isoformas de RNA/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de RNA/métodos , Isoformas de Proteínas/genética , Processamento Alternativo , Perfilação da Expressão Gênica/métodos , Transcriptoma
14.
Brief Bioinform ; 25(1)2023 11 22.
Artigo em Inglês | MEDLINE | ID: mdl-38189540

RESUMO

Nanopore sequencers can enrich or deplete the targeted DNA molecules in a library by reversing the voltage across individual nanopores. However, it requires substantial computational resources to achieve rapid operations in parallel at read-time sequencing. We present a deep learning framework, NanoDeep, to overcome these limitations by incorporating convolutional neural network and squeeze and excitation. We first showed that the raw squiggle derived from native DNA sequences determines the origin of microbial and human genomes. Then, we demonstrated that NanoDeep successfully classified bacterial reads from the pooled library with human sequence and showed enrichment for bacterial sequence compared with routine nanopore sequencing setting. Further, we showed that NanoDeep improves the sequencing efficiency and preserves the fidelity of bacterial genomes in the mock sample. In addition, NanoDeep performs well in the enrichment of metagenome sequences of gut samples, showing its potential applications in the enrichment of unknown microbiota. Our toolkit is available at https://github.com/lysovosyl/NanoDeep.


Assuntos
Aprendizado Profundo , Sequenciamento por Nanoporos , Nanoporos , Humanos , Biblioteca Gênica , Genoma Bacteriano
15.
Mol Syst Biol ; 20(7): 767-798, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38755290

RESUMO

Static gene expression programs have been extensively characterized in stem cells and mature human cells. However, the dynamics of RNA isoform changes upon cell-state-transitions during cell differentiation, the determinants and functional consequences have largely remained unclear. Here, we established an improved model for human neurogenesis in vitro that is amenable for systems-wide analyses of gene expression. Our multi-omics analysis reveals that the pronounced alterations in cell morphology correlate strongly with widespread changes in RNA isoform expression. Our approach identifies thousands of new RNA isoforms that are expressed at distinct differentiation stages. RNA isoforms mainly arise from exon skipping and the alternative usage of transcription start and polyadenylation sites during human neurogenesis. The transcript isoform changes can remodel the identity and functions of protein isoforms. Finally, our study identifies a set of RNA binding proteins as a potential determinant of differentiation stage-specific global isoform changes. This work supports the view of regulated isoform changes that underlie state-transitions during neurogenesis.


Assuntos
Diferenciação Celular , Neurogênese , Neurônios , Isoformas de RNA , Humanos , Neurogênese/genética , Diferenciação Celular/genética , Isoformas de RNA/genética , Isoformas de RNA/metabolismo , Neurônios/metabolismo , Neurônios/citologia , Processamento Alternativo , Proteínas de Ligação a RNA/metabolismo , Proteínas de Ligação a RNA/genética , Isoformas de Proteínas/metabolismo , Isoformas de Proteínas/genética , Éxons/genética
16.
Mass Spectrom Rev ; 43(1): 5-38, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-36052666

RESUMO

The discovery of RNA silencing has revealed that non-protein-coding sequences (ncRNAs) can cover essential roles in regulatory networks and their malfunction may result in severe consequences on human health. These findings have prompted a general reassessment of the significance of RNA as a key player in cellular processes. This reassessment, however, will not be complete without a greater understanding of the distribution and function of the over 170 variants of the canonical ribonucleotides, which contribute to the breathtaking structural diversity of natural RNA. This review surveys the analytical approaches employed for the identification, characterization, and detection of RNA posttranscriptional modifications (rPTMs). The merits of analyzing individual units after exhaustive hydrolysis of the initial biopolymer are outlined together with those of identifying their position in the sequence of parent strands. Approaches based on next generation sequencing and mass spectrometry technologies are covered in depth to provide a comprehensive view of their respective merits. Deciphering the epitranscriptomic code will require not only mapping the location of rPTMs in the various classes of RNAs, but also assessing the variations of expression levels under different experimental conditions. The fact that no individual platform is currently capable of meeting all such demands implies that it will be essential to capitalize on complementary approaches to obtain the desired information. For this reason, the review strived to cover the broadest possible range of techniques to provide readers with the fundamental elements necessary to make informed choices and design the most effective possible strategy to accomplish the task at hand.


Assuntos
Processamento Pós-Transcricional do RNA , RNA , Humanos , RNA/genética , Análise de Sequência de RNA/métodos
17.
Semin Cell Dev Biol ; 127: 155-165, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-34838434

RESUMO

It is well established that DNA base modifications play a key role in gene regulation during development and in response to environmental stress. This type of epigenetic control of development and environmental responses has been intensively studied over the past few decades. Similar to DNA, various RNA species also undergo modifications that play important roles in, for example, RNA splicing, protein translation, and the avoidance of immune surveillance by host. More than 160 different types of RNA modifications have been identified. In addition to base modifications, RNA modification also involves splicing of pre-mRNAs, leading to as many as tens of transcript isoforms from a single pre-RNA, especially in higher organisms. However, the function, prevalence and distribution of RNA modifications are poorly understood. The lack of a suitable method for the reliable identification of RNA modifications constitutes a significant challenge to studying their functions. This review focuses on the technologies that enable de novo identification of RNA base modifications and the alternatively spliced mRNA transcripts.


Assuntos
Processamento Alternativo , Splicing de RNA , Processamento Alternativo/genética , Isoformas de Proteínas/metabolismo , RNA/genética , RNA/metabolismo , Precursores de RNA/genética , Precursores de RNA/metabolismo , Splicing de RNA/genética , RNA Mensageiro/genética
18.
BMC Genomics ; 25(1): 189, 2024 Feb 17.
Artigo em Inglês | MEDLINE | ID: mdl-38368357

RESUMO

BACKGROUND: CRISPR-Cas9 technology has advanced in vivo gene therapy for disorders like hemophilia A, notably through the successful targeted incorporation of the F8 gene into the Alb locus in hepatocytes, effectively curing this disorder in mice. However, thoroughly evaluating the safety and specificity of this therapy is essential. Our study introduces a novel methodology to analyze complex insertion sequences at the on-target edited locus, utilizing barcoded long-range PCR, CRISPR RNP-mediated deletion of unedited alleles, magnetic bead-based long amplicon enrichment, and nanopore sequencing. RESULTS: We identified the expected F8 insertions and various fragment combinations resulting from the in vivo linearization of the double-cut plasmid donor. Notably, our research is the first to document insertions exceeding ten kbp. We also found that a small proportion of these insertions were derived from sources other than donor plasmids, including Cas9-sgRNA plasmids, genomic DNA fragments, and LINE-1 elements. CONCLUSIONS: Our study presents a robust method for analyzing the complexity of on-target editing, particularly for in vivo long insertions, where donor template integration can be challenging. This work offers a new tool for quality control in gene editing outcomes and underscores the importance of detailed characterization of edited genomic sequences. Our findings have significant implications for enhancing the safety and effectiveness of CRISPR-Cas9 gene therapy in treating various disorders, including hemophilia A.


Assuntos
Hemofilia A , Sequenciamento por Nanoporos , Camundongos , Animais , Sistemas CRISPR-Cas , RNA Guia de Sistemas CRISPR-Cas , Hemofilia A/genética , Hemofilia A/terapia , Edição de Genes/métodos , DNA
19.
BMC Genomics ; 25(1): 528, 2024 May 28.
Artigo em Inglês | MEDLINE | ID: mdl-38807060

RESUMO

BACKGROUND: Direct RNA sequencing (dRNA-seq) on the Oxford Nanopore Technologies (ONT) platforms can produce reads covering up to full-length gene transcripts, while containing decipherable information about RNA base modifications and poly-A tail lengths. Although many published studies have been expanding the potential of dRNA-seq, its sequencing accuracy and error patterns remain understudied. RESULTS: We present the first comprehensive evaluation of sequencing accuracy and characterisation of systematic errors in dRNA-seq data from diverse organisms and synthetic in vitro transcribed RNAs. We found that for sequencing kits SQK-RNA001 and SQK-RNA002, the median read accuracy ranged from 87% to 92% across species, and deletions significantly outnumbered mismatches and insertions. Due to their high abundance in the transcriptome, heteropolymers and short homopolymers were the major contributors to the overall sequencing errors. We also observed systematic biases across all species at the levels of single nucleotides and motifs. In general, cytosine/uracil-rich regions were more likely to be erroneous than guanines and adenines. By examining raw signal data, we identified the underlying signal-level features potentially associated with the error patterns and their dependency on sequence contexts. While read quality scores can be used to approximate error rates at base and read levels, failure to detect DNA adapters may be a source of errors and data loss. By comparing distinct basecallers, we reason that some sequencing errors are attributable to signal insufficiency rather than algorithmic (basecalling) artefacts. Lastly, we generated dRNA-seq data using the latest SQK-RNA004 sequencing kit released at the end of 2023 and found that although the overall read accuracy increased, the systematic errors remain largely identical compared to the previous kits. CONCLUSIONS: As the first systematic investigation of dRNA-seq errors, this study offers a comprehensive overview of reproducible error patterns across diverse datasets, identifies potential signal-level insufficiency, and lays the foundation for error correction methods.


Assuntos
Sequenciamento por Nanoporos , Análise de Sequência de RNA , Análise de Sequência de RNA/métodos , Sequenciamento por Nanoporos/métodos , Nanoporos , Humanos , Animais , RNA/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos
20.
BMC Genomics ; 25(1): 679, 2024 Jul 08.
Artigo em Inglês | MEDLINE | ID: mdl-38978005

RESUMO

BACKGROUND: Oxford Nanopore provides high throughput sequencing platforms able to reconstruct complete bacterial genomes with 99.95% accuracy. However, even small levels of error can obscure the phylogenetic relationships between closely related isolates. Polishing tools have been developed to correct these errors, but it is uncertain if they obtain the accuracy needed for the high-resolution source tracking of foodborne illness outbreaks. RESULTS: We tested 132 combinations of assembly and short- and long-read polishing tools to assess their accuracy for reconstructing the genome sequences of 15 highly similar Salmonella enterica serovar Newport isolates from a 2020 onion outbreak. While long-read polishing alone improved accuracy, near perfect accuracy (99.9999% accuracy or ~ 5 nucleotide errors across the 4.8 Mbp genome, excluding low confidence regions) was only obtained by pipelines that combined both long- and short-read polishing tools. Notably, medaka was a more accurate and efficient long-read polisher than Racon. Among short-read polishers, NextPolish showed the highest accuracy, but Pilon, Polypolish, and POLCA performed similarly. Among the 5 best performing pipelines, polishing with medaka followed by NextPolish was the most common combination. Importantly, the order of polishing tools mattered i.e., using less accurate tools after more accurate ones introduced errors. Indels in homopolymers and repetitive regions, where the short reads could not be uniquely mapped, remained the most challenging errors to correct. CONCLUSIONS: Short reads are still needed to correct errors in nanopore sequenced assemblies to obtain the accuracy required for source tracking investigations. Our granular assessment of the performance of the polishing pipelines allowed us to suggest best practices for tool users and areas for improvement for tool developers.


Assuntos
Benchmarking , Surtos de Doenças , Genoma Bacteriano , Nanoporos , Sequenciamento por Nanoporos/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Salmonella enterica/genética , Salmonella enterica/isolamento & purificação , Humanos , Filogenia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA