Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 13 de 13
Filtrar
1.
Nat Rev Genet ; 25(7): 460-475, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38366034

RESUMO

Short tandem repeats (STRs) are highly polymorphic sequences throughout the human genome that are composed of repeated copies of a 1-6-bp motif. Over 1 million variable STR loci are known, some of which regulate gene expression and influence complex traits, such as height. Moreover, variants in at least 60 STR loci cause genetic disorders, including Huntington disease and fragile X syndrome. Accurately identifying and genotyping STR variants is challenging, in particular mapping short reads to repetitive regions and inferring expanded repeat lengths. Recent advances in sequencing technology and computational tools for STR genotyping from sequencing data promise to help overcome this challenge and solve genetically unresolved cases and the 'missing heritability' of polygenic traits. Here, we compare STR genotyping methods, analytical tools and their applications to understand the effect of STR variation on health and disease. We identify emergent opportunities to refine genotyping and quality-control approaches as well as to integrate STRs into variant-calling workflows and large cohort analyses.


Assuntos
Genoma Humano , Repetições de Microssatélites , Humanos , Repetições de Microssatélites/genética , Análise de Sequência de DNA/métodos , Técnicas de Genotipagem/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Genótipo
2.
Genome Res ; 34(5): 778-783, 2024 06 25.
Artigo em Inglês | MEDLINE | ID: mdl-38692839

RESUMO

In silico simulation of high-throughput sequencing data is a technique used widely in the genomics field. However, there is currently a lack of effective tools for creating simulated data from nanopore sequencing devices, which measure DNA or RNA molecules in the form of time-series current signal data. Here, we introduce Squigulator, a fast and simple tool for simulation of realistic nanopore signal data. Squigulator takes a reference genome, a transcriptome, or read sequences, and generates corresponding raw nanopore signal data. This is compatible with basecalling software from Oxford Nanopore Technologies (ONT) and other third-party tools, thereby providing a useful substrate for development, testing, debugging, validation, and optimization at every stage of a nanopore analysis workflow. The user may generate data with preset parameters emulating specific ONT protocols or noise-free "ideal" data, or they may deterministically modify a range of experimental variables and/or noise parameters to shape the data to their needs. We present a brief example of Squigulator's use, creating simulated data to model the degree to which different parameters impact the accuracy of ONT basecalling and downstream variant detection. This analysis reveals new insights into the nature of ONT data and basecalling algorithms. We provide Squigulator as an open-source tool for the nanopore community.


Assuntos
Sequenciamento por Nanoporos , Software , Sequenciamento por Nanoporos/métodos , Simulação por Computador , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Nanoporos , Humanos , Genômica/métodos , Análise de Sequência de DNA/métodos , Algoritmos
3.
Bioinformatics ; 40(8)2024 Aug 02.
Artigo em Inglês | MEDLINE | ID: mdl-39137136

RESUMO

MOTIVATION: Nanopore sequencing current signal data can be 'basecalled' into sequence information or analysed directly, with the capacity to identify diverse molecular features, such as DNA/RNA base modifications and secondary structures. However, raw signal data is large and complex, and there is a need for improved visualization strategies to facilitate signal analysis, exploration and tool development. RESULTS: Squigualiser (Squiggle visualiser) is a toolkit for intuitive, interactive visualization of sequence-aligned signal data, which currently supports both DNA and RNA sequencing data from Oxford Nanopore Technologies instruments. Squigualiser is compatible with a wide range of alternative signal-alignment software packages and enables visualization of both signal-to-read and signal-to-reference aligned data at single-base resolution. Squigualiser generates an interactive signal browser view (HTML file), in which the user can navigate across a genome/transcriptome region and customize the display. Multiple independent reads are integrated into a 'signal pileup' format and different datasets can be displayed as parallel tracks. Although other methods exist, Squigualiser provides the community with a software package purpose-built for raw signal data visualization, incorporating a range of new and existing features into a unified platform. AVAILABILITY AND IMPLEMENTATION: Squigualiser is an open-source package under an MIT licence: https://github.com/hiruna72/squigualiser. The software was developed using Python 3.8 and can be installed with pip or bioconda or executed directly using prebuilt binaries provided with each release.


Assuntos
Sequenciamento por Nanoporos , Software , Sequenciamento por Nanoporos/métodos , Análise de Sequência de DNA/métodos , Alinhamento de Sequência/métodos , Análise de Sequência de RNA/métodos
4.
Genet Med ; 26(5): 101076, 2024 05.
Artigo em Inglês | MEDLINE | ID: mdl-38258669

RESUMO

PURPOSE: Genome sequencing (GS)-specific diagnostic rates in prospective tightly ascertained exome sequencing (ES)-negative intellectual disability (ID) cohorts have not been reported extensively. METHODS: ES, GS, epigenetic signatures, and long-read sequencing diagnoses were assessed in 74 trios with at least moderate ID. RESULTS: The ES diagnostic yield was 42 of 74 (57%). GS diagnoses were made in 9 of 32 (28%) ES-unresolved families. Repeated ES with a contemporary pipeline on the GS-diagnosed families identified 8 of 9 single-nucleotide variations/copy-number variations undetected in older ES, confirming a GS-unique diagnostic rate of 1 in 32 (3%). Episignatures contributed diagnostic information in 9% with GS corroboration in 1 of 32 (3%) and diagnostic clues in 2 of 32 (6%). A genetic etiology for ID was detected in 51 of 74 (69%) families. Twelve candidate disease genes were identified. Contemporary ES followed by GS cost US$4976 (95% CI: $3704; $6969) per diagnosis and first-line GS at a cost of $7062 (95% CI: $6210; $8475) per diagnosis. CONCLUSION: Performing GS only in ID trios would be cost equivalent to ES if GS were available at $2435, about a 60% reduction from current prices. This study demonstrates that first-line GS achieves higher diagnostic rate than contemporary ES but at a higher cost.


Assuntos
Sequenciamento do Exoma , Exoma , Deficiência Intelectual , Humanos , Deficiência Intelectual/genética , Deficiência Intelectual/diagnóstico , Masculino , Feminino , Exoma/genética , Sequenciamento do Exoma/economia , Estudos de Coortes , Testes Genéticos/economia , Testes Genéticos/métodos , Sequenciamento Completo do Genoma/economia , Criança , Genoma Humano/genética , Variações do Número de Cópias de DNA/genética , Polimorfismo de Nucleotídeo Único/genética , Pré-Escolar
5.
Cerebellum ; 2024 May 18.
Artigo em Inglês | MEDLINE | ID: mdl-38760634

RESUMO

The hereditary cerebellar ataxias (HCAs) are rare, progressive neurologic disorders caused by variants in many different genes. Inheritance may follow autosomal dominant, autosomal recessive, X-linked or mitochondrial patterns. The list of genes associated with adult-onset cerebellar ataxia is continuously growing, with several new genes discovered in the last few years. This includes short-tandem repeat (STR) expansions in RFC1, causing cerebellar ataxia, neuropathy, vestibular areflexia syndrome (CANVAS), FGF14-GAA causing spinocerebellar ataxia type 27B (SCA27B), and THAP11. In addition, the genetic basis for SCA4, has recently been identified as a STR expansion in ZFHX3. Given the large and growing number of genes, and different gene variant types, the approach to diagnostic testing for adult-onset HCA can be complex. Testing methods include targeted evaluation of STR expansions (e.g. SCAs, Friedreich ataxia, fragile X-associated tremor/ataxia syndrome, dentatorubral-pallidoluysian atrophy), next generation sequencing for conventional variants, which may include targeted gene panels, whole exome, or whole genome sequencing, followed by various potential additional tests. This review proposes a diagnostic approach for clinical testing, highlights the challenges with current testing technologies, and discusses future advances which may overcome these limitations. Implementing long-read sequencing has the potential to transform the diagnostic approach in HCA, with the overall aim to improve the diagnostic yield.

6.
J Peripher Nerv Syst ; 29(2): 262-274, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38860315

RESUMO

BACKGROUND: Loss-of-function variants in MME (membrane metalloendopeptidase) are a known cause of recessive Charcot-Marie-Tooth Neuropathy (CMT). A deep intronic variant, MME c.1188+428A>G (NM_000902.5), was identified through whole genome sequencing (WGS) of two Australian families with recessive inheritance of axonal CMT using the seqr platform. MME c.1188+428A>G was detected in a homozygous state in Family 1, and in a compound heterozygous state with a known pathogenic MME variant (c.467del; p.Pro156Leufs*14) in Family 2. AIMS: We aimed to determine the pathogenicity of the MME c.1188+428A>G variant through segregation and splicing analysis. METHODS: The splicing impact of the deep intronic MME variant c.1188+428A>G was assessed using an in vitro exon-trapping assay. RESULTS: The exon-trapping assay demonstrated that the MME c.1188+428A>G variant created a novel splice donor site resulting in the inclusion of an 83 bp pseudoexon between MME exons 12 and 13. The incorporation of the pseudoexon into MME transcript is predicted to lead to a coding frameshift and premature termination codon (PTC) in MME exon 14 (p.Ala397ProfsTer47). This PTC is likely to result in nonsense mediated decay (NMD) of MME transcript leading to a pathogenic loss-of-function. INTERPRETATION: To our knowledge, this is the first report of a pathogenic deep intronic MME variant causing CMT. This is of significance as deep intronic variants are missed using whole exome sequencing screening methods. Individuals with CMT should be reassessed for deep intronic variants, with splicing impacts being considered in relation to the potential pathogenicity of variants.


Assuntos
Doença de Charcot-Marie-Tooth , Metaloendopeptidases , Splicing de RNA , Adulto , Feminino , Humanos , Masculino , Doença de Charcot-Marie-Tooth/genética , Íntrons , Metaloendopeptidases/genética , Mutação , Linhagem
7.
Gigascience ; 132024 01 02.
Artigo em Inglês | MEDLINE | ID: mdl-38608279

RESUMO

BACKGROUND: As adoption of nanopore sequencing technology continues to advance, the need to maintain large volumes of raw current signal data for reanalysis with updated algorithms is a growing challenge. Here we introduce slow5curl, a software package designed to streamline nanopore data sharing, accessibility, and reanalysis. RESULTS: Slow5curl allows a user to fetch a specified read or group of reads from a raw nanopore dataset stored on a remote server, such as a public data repository, without downloading the entire file. Slow5curl uses an index to quickly fetch specific reads from a large dataset in SLOW5/BLOW5 format and highly parallelized data access requests to maximize download speeds. Using all public nanopore data from the Human Pangenome Reference Consortium (>22 TB), we demonstrate how slow5curl can be used to quickly fetch and reanalyze raw signal reads corresponding to a set of target genes from each individual in large cohort dataset (n = 91), minimizing the time, egress costs, and local storage requirements for their reanalysis. CONCLUSIONS: We provide slow5curl as a free, open-source package that will reduce frictions in data sharing for the nanopore community: https://github.com/BonsonW/slow5curl.


Assuntos
Sequenciamento por Nanoporos , Nanoporos , Humanos , Algoritmos , Disseminação de Informação , Registros
8.
Nat Commun ; 15(1): 1977, 2024 Mar 04.
Artigo em Inglês | MEDLINE | ID: mdl-38438347

RESUMO

DNA methylation (5mC) is a repressive gene regulatory mark widespread in vertebrate genomes, yet the developmental dynamics in which 5mC patterns are established vary across species. While mammals undergo two rounds of global 5mC erasure, teleosts, for example, exhibit localized maternal-to-paternal 5mC remodeling. Here, we studied 5mC dynamics during the embryonic development of sea lamprey, a jawless vertebrate which occupies a critical phylogenetic position as the sister group of the jawed vertebrates. We employed 5mC quantification in lamprey embryos and tissues, and discovered large-scale maternal-to-paternal epigenome remodeling that affects ~30% of the embryonic genome and is predominantly associated with partially methylated domains. We further demonstrate that sequences eliminated during programmed genome rearrangement (PGR), are hypermethylated in sperm prior to the onset of PGR. Our study thus unveils important insights into the evolutionary origins of vertebrate 5mC reprogramming, and how this process might participate in diverse developmental strategies.


Assuntos
Epigenoma , Petromyzon , Feminino , Animais , Masculino , Filogenia , Sêmen , Desenvolvimento Embrionário/genética , Mamíferos
9.
Nat Commun ; 15(1): 2480, 2024 Mar 20.
Artigo em Inglês | MEDLINE | ID: mdl-38509097

RESUMO

The expression of genes encompasses their transcription into mRNA followed by translation into protein. In recent years, next-generation sequencing and mass spectrometry methods have profiled DNA, RNA and protein abundance in cells. However, there are currently no reference standards that are compatible across these genomic, transcriptomic and proteomic methods, and provide an integrated measure of gene expression. Here, we use synthetic biology principles to engineer a multi-omics control, termed pREF, that can act as a universal molecular standard for next-generation sequencing and mass spectrometry methods. The pREF sequence encodes 21 synthetic genes that can be in vitro transcribed into spike-in mRNA controls, and in vitro translated to generate matched protein controls. The synthetic genes provide qualitative controls that can measure sensitivity and quantitative accuracy of DNA, RNA and peptide detection. We demonstrate the use of pREF in metagenome DNA sequencing and RNA sequencing experiments and evaluate the quantification of proteins using mass spectrometry. Unlike previous spike-in controls, pREF can be independently propagated and the synthetic mRNA and protein controls can be sustainably prepared by recipient laboratories using common molecular biology techniques. Together, this provides a universal synthetic standard able to integrate genomic, transcriptomic and proteomic methods.


Assuntos
DNA , Proteômica , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , DNA/genética , Genômica , RNA
10.
medRxiv ; 2024 Jul 03.
Artigo em Inglês | MEDLINE | ID: mdl-39006414

RESUMO

Spinocerebellar ataxia 27B (SCA27B) is a common autosomal dominant ataxia caused by an intronic GAA•TTC repeat expansion in FGF14 . Neuropathological studies have shown that neuronal loss is largely restricted to the cerebellum. Although the repeat locus is highly unstable during intergenerational transmission, it remains unknown whether it exhibits cerebral mosaicism and progressive instability throughout life. We conducted an analysis of the FGF14 GAA•TTC repeat somatic instability across 156 serial blood samples from 69 individuals, fibroblasts, induced pluripotent stem cells, and post-mortem brain tissues from six controls and six patients with SCA27B, alongside methylation profiling using targeted long-read sequencing. Peripheral tissues exhibited minimal somatic instability, which did not significantly change over periods of more than 20 years. In post-mortem brains, the GAA•TTC repeat was remarkably stable across all regions, except in the cerebellar hemispheres and vermis. The levels of somatic expansion in the cerebellar hemispheres and vermis were, on average, 3.15 and 2.72 times greater relative to other examined brain regions, respectively. Additionally, levels of somatic expansion in the brain increased with repeat length and tissue expression of FGF14 . We found no significant difference in methylation of wild-type and expanded FGF14 alleles in post-mortem cerebellar hemispheres between patients and controls. In conclusion, our study revealed that the FGF14 GAA•TTC repeat exhibits a cerebellar-specific expansion bias, which may explain the pure and late-onset cerebellar involvement in SCA27B.

11.
Nat Genet ; 56(7): 1366-1370, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38937606

RESUMO

The factors driving or preventing pathological expansion of tandem repeats remain largely unknown. Here, we assessed the FGF14 (GAA)·(TTC) repeat locus in 2,530 individuals by long-read and Sanger sequencing and identified a common 5'-flanking variant in 70.34% of alleles analyzed (3,463/4,923) that represents the phylogenetically ancestral allele and is present on all major haplotypes. This common sequence variation is present nearly exclusively on nonpathogenic alleles with fewer than 30 GAA-pure triplets and is associated with enhanced stability of the repeat locus upon intergenerational transmission and increased Fiber-seq chromatin accessibility.


Assuntos
Alelos , Fatores de Crescimento de Fibroblastos , Fatores de Crescimento de Fibroblastos/genética , Fatores de Crescimento de Fibroblastos/metabolismo , Humanos , Haplótipos , Variação Genética , Loci Gênicos
12.
Sci Adv ; 10(27): eadl1197, 2024 Jul 05.
Artigo em Inglês | MEDLINE | ID: mdl-38959305

RESUMO

Pancreatic ductal adenocarcinoma (PDAC) is characterized by increasing fibrosis, which can enhance tumor progression and spread. Here, we undertook an unbiased temporal assessment of the matrisome of the highly metastatic KPC (Pdx1-Cre, LSL-KrasG12D/+, LSL-Trp53R172H/+) and poorly metastatic KPflC (Pdx1-Cre, LSL-KrasG12D/+, Trp53fl/+) genetically engineered mouse models of pancreatic cancer using mass spectrometry proteomics. Our assessment at early-, mid-, and late-stage disease reveals an increased abundance of nidogen-2 (NID2) in the KPC model compared to KPflC, with further validation showing that NID2 is primarily expressed by cancer-associated fibroblasts (CAFs). Using biomechanical assessments, second harmonic generation imaging, and birefringence analysis, we show that NID2 reduction by CRISPR interference (CRISPRi) in CAFs reduces stiffness and matrix remodeling in three-dimensional models, leading to impaired cancer cell invasion. Intravital imaging revealed improved vascular patency in live NID2-depleted tumors, with enhanced response to gemcitabine/Abraxane. In orthotopic models, NID2 CRISPRi tumors had less liver metastasis and increased survival, highlighting NID2 as a potential PDAC cotarget.


Assuntos
Carcinoma Ductal Pancreático , Neoplasias Pancreáticas , Proteômica , Animais , Humanos , Camundongos , Proteínas de Ligação ao Cálcio/metabolismo , Proteínas de Ligação ao Cálcio/genética , Fibroblastos Associados a Câncer/metabolismo , Fibroblastos Associados a Câncer/patologia , Carcinoma Ductal Pancreático/metabolismo , Carcinoma Ductal Pancreático/patologia , Carcinoma Ductal Pancreático/genética , Moléculas de Adesão Celular , Linhagem Celular Tumoral , Desoxicitidina/análogos & derivados , Desoxicitidina/farmacologia , Modelos Animais de Doenças , Fibrose , Gencitabina , Neoplasias Pancreáticas/metabolismo , Neoplasias Pancreáticas/patologia , Neoplasias Pancreáticas/genética , Proteômica/métodos
13.
Nat Commun ; 15(1): 6327, 2024 Jul 27.
Artigo em Inglês | MEDLINE | ID: mdl-39068203

RESUMO

Oculopharyngodistal myopathy (OPDM) is an inherited myopathy manifesting with ptosis, dysphagia and distal weakness. Pathologically it is characterised by rimmed vacuoles and intranuclear inclusions on muscle biopsy. In recent years CGG • CCG repeat expansion in four different genes were identified in OPDM individuals in Asian populations. None of these have been found in affected individuals of non-Asian ancestry. In this study we describe the identification of CCG expansions in ABCD3, ranging from 118 to 694 repeats, in 35 affected individuals across eight unrelated OPDM families of European ancestry. ABCD3 transcript appears upregulated in fibroblasts and skeletal muscle from OPDM individuals, suggesting a potential role of over-expression of CCG repeat containing ABCD3 transcript in progressive skeletal muscle degeneration. The study provides further evidence of the role of non-coding repeat expansions in unsolved neuromuscular diseases and strengthens the association between the CGG • CCG repeat motif and a specific pattern of muscle weakness.


Assuntos
Músculo Esquelético , Expansão das Repetições de Trinucleotídeos , População Branca , Humanos , Masculino , Feminino , Adulto , Expansão das Repetições de Trinucleotídeos/genética , Pessoa de Meia-Idade , População Branca/genética , Músculo Esquelético/patologia , Transportadores de Cassetes de Ligação de ATP/genética , Miopatias Congênitas Estruturais/genética , Miopatias Congênitas Estruturais/patologia , Linhagem , Idoso , Adulto Jovem , Fibroblastos/metabolismo , Fibroblastos/patologia , Debilidade Muscular/genética , Debilidade Muscular/patologia , Adolescente , Distrofias Musculares
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA