Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 64
Filtrar
1.
J Peripher Nerv Syst ; 2024 Jun 11.
Artículo en Inglés | MEDLINE | ID: mdl-38860315

RESUMEN

BACKGROUND: Loss-of-function variants in MME (membrane metalloendopeptidase) are a known cause of recessive Charcot-Marie-Tooth Neuropathy (CMT). A deep intronic variant, MME c.1188+428A>G (NM_000902.5), was identified through whole genome sequencing (WGS) of two Australian families with recessive inheritance of axonal CMT using the seqr platform. MME c.1188+428A>G was detected in a homozygous state in Family 1, and in a compound heterozygous state with a known pathogenic MME variant (c.467del; p.Pro156Leufs*14) in Family 2. AIMS: We aimed to determine the pathogenicity of the MME c.1188+428A>G variant through segregation and splicing analysis. METHODS: The splicing impact of the deep intronic MME variant c.1188+428A>G was assessed using an in vitro exon-trapping assay. RESULTS: The exon-trapping assay demonstrated that the MME c.1188+428A>G variant created a novel splice donor site resulting in the inclusion of an 83 bp pseudoexon between MME exons 12 and 13. The incorporation of the pseudoexon into MME transcript is predicted to lead to a coding frameshift and premature termination codon (PTC) in MME exon 14 (p.Ala397ProfsTer47). This PTC is likely to result in nonsense mediated decay (NMD) of MME transcript leading to a pathogenic loss-of-function. INTERPRETATION: To our knowledge, this is the first report of a pathogenic deep intronic MME variant causing CMT. This is of significance as deep intronic variants are missed using whole exome sequencing screening methods. Individuals with CMT should be reassessed for deep intronic variants, with splicing impacts being considered in relation to the potential pathogenicity of variants.

2.
Genome Res ; 2024 Jun 06.
Artículo en Inglés | MEDLINE | ID: mdl-38692839

RESUMEN

In silico simulation of high-throughput sequencing data is a technique used widely in the genomics field. However, there is currently a lack of effective tools for creating simulated data from nanopore sequencing devices, which measure DNA or RNA molecules in the form of time-series current signal data. Here, we introduce Squigulator, a fast and simple tool for simulation of realistic nanopore signal data. Squigulator takes a reference genome, a transcriptome, or read sequences, and generates corresponding raw nanopore signal data. This is compatible with basecalling software from Oxford Nanopore Technologies (ONT) and other third-party tools, thereby providing a useful substrate for development, testing, debugging, validation, and optimization at every stage of a nanopore analysis workflow. The user may generate data with preset parameters emulating specific ONT protocols or noise-free "ideal" data, or they may deterministically modify a range of experimental variables and/or noise parameters to shape the data to their needs. We present a brief example of Squigulator's use, creating simulated data to model the degree to which different parameters impact the accuracy of ONT basecalling and downstream variant detection. This analysis reveals new insights into the nature of ONT data and basecalling algorithms. We provide Squigulator as an open-source tool for the nanopore community.

3.
Cerebellum ; 2024 May 18.
Artículo en Inglés | MEDLINE | ID: mdl-38760634

RESUMEN

The hereditary cerebellar ataxias (HCAs) are rare, progressive neurologic disorders caused by variants in many different genes. Inheritance may follow autosomal dominant, autosomal recessive, X-linked or mitochondrial patterns. The list of genes associated with adult-onset cerebellar ataxia is continuously growing, with several new genes discovered in the last few years. This includes short-tandem repeat (STR) expansions in RFC1, causing cerebellar ataxia, neuropathy, vestibular areflexia syndrome (CANVAS), FGF14-GAA causing spinocerebellar ataxia type 27B (SCA27B), and THAP11. In addition, the genetic basis for SCA4, has recently been identified as a STR expansion in ZFHX3. Given the large and growing number of genes, and different gene variant types, the approach to diagnostic testing for adult-onset HCA can be complex. Testing methods include targeted evaluation of STR expansions (e.g. SCAs, Friedreich ataxia, fragile X-associated tremor/ataxia syndrome, dentatorubral-pallidoluysian atrophy), next generation sequencing for conventional variants, which may include targeted gene panels, whole exome, or whole genome sequencing, followed by various potential additional tests. This review proposes a diagnostic approach for clinical testing, highlights the challenges with current testing technologies, and discusses future advances which may overcome these limitations. Implementing long-read sequencing has the potential to transform the diagnostic approach in HCA, with the overall aim to improve the diagnostic yield.

4.
Gigascience ; 132024 Jan 02.
Artículo en Inglés | MEDLINE | ID: mdl-38608279

RESUMEN

BACKGROUND: As adoption of nanopore sequencing technology continues to advance, the need to maintain large volumes of raw current signal data for reanalysis with updated algorithms is a growing challenge. Here we introduce slow5curl, a software package designed to streamline nanopore data sharing, accessibility, and reanalysis. RESULTS: Slow5curl allows a user to fetch a specified read or group of reads from a raw nanopore dataset stored on a remote server, such as a public data repository, without downloading the entire file. Slow5curl uses an index to quickly fetch specific reads from a large dataset in SLOW5/BLOW5 format and highly parallelized data access requests to maximize download speeds. Using all public nanopore data from the Human Pangenome Reference Consortium (>22 TB), we demonstrate how slow5curl can be used to quickly fetch and reanalyze raw signal reads corresponding to a set of target genes from each individual in large cohort dataset (n = 91), minimizing the time, egress costs, and local storage requirements for their reanalysis. CONCLUSIONS: We provide slow5curl as a free, open-source package that will reduce frictions in data sharing for the nanopore community: https://github.com/BonsonW/slow5curl.


Asunto(s)
Secuenciación de Nanoporos , Nanoporos , Humanos , Algoritmos , Difusión de la Información , Registros
5.
Nat Commun ; 15(1): 1977, 2024 Mar 04.
Artículo en Inglés | MEDLINE | ID: mdl-38438347

RESUMEN

DNA methylation (5mC) is a repressive gene regulatory mark widespread in vertebrate genomes, yet the developmental dynamics in which 5mC patterns are established vary across species. While mammals undergo two rounds of global 5mC erasure, teleosts, for example, exhibit localized maternal-to-paternal 5mC remodeling. Here, we studied 5mC dynamics during the embryonic development of sea lamprey, a jawless vertebrate which occupies a critical phylogenetic position as the sister group of the jawed vertebrates. We employed 5mC quantification in lamprey embryos and tissues, and discovered large-scale maternal-to-paternal epigenome remodeling that affects ~30% of the embryonic genome and is predominantly associated with partially methylated domains. We further demonstrate that sequences eliminated during programmed genome rearrangement (PGR), are hypermethylated in sperm prior to the onset of PGR. Our study thus unveils important insights into the evolutionary origins of vertebrate 5mC reprogramming, and how this process might participate in diverse developmental strategies.


Asunto(s)
Epigenoma , Petromyzon , Femenino , Animales , Masculino , Filogenia , Semen , Desarrollo Embrionario/genética , Mamíferos
6.
Nat Commun ; 15(1): 2480, 2024 Mar 20.
Artículo en Inglés | MEDLINE | ID: mdl-38509097

RESUMEN

The expression of genes encompasses their transcription into mRNA followed by translation into protein. In recent years, next-generation sequencing and mass spectrometry methods have profiled DNA, RNA and protein abundance in cells. However, there are currently no reference standards that are compatible across these genomic, transcriptomic and proteomic methods, and provide an integrated measure of gene expression. Here, we use synthetic biology principles to engineer a multi-omics control, termed pREF, that can act as a universal molecular standard for next-generation sequencing and mass spectrometry methods. The pREF sequence encodes 21 synthetic genes that can be in vitro transcribed into spike-in mRNA controls, and in vitro translated to generate matched protein controls. The synthetic genes provide qualitative controls that can measure sensitivity and quantitative accuracy of DNA, RNA and peptide detection. We demonstrate the use of pREF in metagenome DNA sequencing and RNA sequencing experiments and evaluate the quantification of proteins using mass spectrometry. Unlike previous spike-in controls, pREF can be independently propagated and the synthetic mRNA and protein controls can be sustainably prepared by recipient laboratories using common molecular biology techniques. Together, this provides a universal synthetic standard able to integrate genomic, transcriptomic and proteomic methods.


Asunto(s)
ADN , Proteómica , ARN Mensajero/genética , ARN Mensajero/metabolismo , ADN/genética , Genómica , ARN
8.
Nat Rev Genet ; 2024 Feb 16.
Artículo en Inglés | MEDLINE | ID: mdl-38366034

RESUMEN

Short tandem repeats (STRs) are highly polymorphic sequences throughout the human genome that are composed of repeated copies of a 1-6-bp motif. Over 1 million variable STR loci are known, some of which regulate gene expression and influence complex traits, such as height. Moreover, variants in at least 60 STR loci cause genetic disorders, including Huntington disease and fragile X syndrome. Accurately identifying and genotyping STR variants is challenging, in particular mapping short reads to repetitive regions and inferring expanded repeat lengths. Recent advances in sequencing technology and computational tools for STR genotyping from sequencing data promise to help overcome this challenge and solve genetically unresolved cases and the 'missing heritability' of polygenic traits. Here, we compare STR genotyping methods, analytical tools and their applications to understand the effect of STR variation on health and disease. We identify emergent opportunities to refine genotyping and quality-control approaches as well as to integrate STRs into variant-calling workflows and large cohort analyses.

9.
Genet Med ; 26(5): 101076, 2024 May.
Artículo en Inglés | MEDLINE | ID: mdl-38258669

RESUMEN

PURPOSE: Genome sequencing (GS)-specific diagnostic rates in prospective tightly ascertained exome sequencing (ES)-negative intellectual disability (ID) cohorts have not been reported extensively. METHODS: ES, GS, epigenetic signatures, and long-read sequencing diagnoses were assessed in 74 trios with at least moderate ID. RESULTS: The ES diagnostic yield was 42 of 74 (57%). GS diagnoses were made in 9 of 32 (28%) ES-unresolved families. Repeated ES with a contemporary pipeline on the GS-diagnosed families identified 8 of 9 single-nucleotide variations/copy-number variations undetected in older ES, confirming a GS-unique diagnostic rate of 1 in 32 (3%). Episignatures contributed diagnostic information in 9% with GS corroboration in 1 of 32 (3%) and diagnostic clues in 2 of 32 (6%). A genetic etiology for ID was detected in 51 of 74 (69%) families. Twelve candidate disease genes were identified. Contemporary ES followed by GS cost US$4976 (95% CI: $3704; $6969) per diagnosis and first-line GS at a cost of $7062 (95% CI: $6210; $8475) per diagnosis. CONCLUSION: Performing GS only in ID trios would be cost equivalent to ES if GS were available at $2435, about a 60% reduction from current prices. This study demonstrates that first-line GS achieves higher diagnostic rate than contemporary ES but at a higher cost.


Asunto(s)
Secuenciación del Exoma , Exoma , Discapacidad Intelectual , Humanos , Discapacidad Intelectual/genética , Discapacidad Intelectual/diagnóstico , Masculino , Femenino , Exoma/genética , Secuenciación del Exoma/economía , Estudios de Cohortes , Pruebas Genéticas/economía , Pruebas Genéticas/métodos , Secuenciación Completa del Genoma/economía , Niño , Genoma Humano/genética , Variaciones en el Número de Copia de ADN/genética , Polimorfismo de Nucleótido Simple/genética , Preescolar
10.
Nature ; 624(7992): 602-610, 2023 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-38093003

RESUMEN

Indigenous Australians harbour rich and unique genomic diversity. However, Aboriginal and Torres Strait Islander ancestries are historically under-represented in genomics research and almost completely missing from reference datasets1-3. Addressing this representation gap is critical, both to advance our understanding of global human genomic diversity and as a prerequisite for ensuring equitable outcomes in genomic medicine. Here we apply population-scale whole-genome long-read sequencing4 to profile genomic structural variation across four remote Indigenous communities. We uncover an abundance of large insertion-deletion variants (20-49 bp; n = 136,797), structural variants (50 b-50 kb; n = 159,912) and regions of variable copy number (>50 kb; n = 156). The majority of variants are composed of tandem repeat or interspersed mobile element sequences (up to 90%) and have not been previously annotated (up to 62%). A large fraction of structural variants appear to be exclusive to Indigenous Australians (12% lower-bound estimate) and most of these are found in only a single community, underscoring the need for broad and deep sampling to achieve a comprehensive catalogue of genomic structural variation across the Australian continent. Finally, we explore short tandem repeats throughout the genome to characterize allelic diversity at 50 known disease loci5, uncover hundreds of novel repeat expansion sites within protein-coding genes, and identify unique patterns of diversity and constraint among short tandem repeat sequences. Our study sheds new light on the dimensions and dynamics of genomic structural variation within and beyond Australia.


Asunto(s)
Aborigenas Australianos e Isleños del Estrecho de Torres , Genoma Humano , Variación Estructural del Genoma , Humanos , Alelos , Australia/etnología , Aborigenas Australianos e Isleños del Estrecho de Torres/genética , Conjuntos de Datos como Asunto , Variaciones en el Número de Copia de ADN/genética , Sitios Genéticos/genética , Genética Médica , Variación Estructural del Genoma/genética , Genómica , Mutación INDEL/genética , Secuencias Repetitivas Esparcidas/genética , Repeticiones de Microsatélite/genética , Genoma Humano/genética
11.
BMC Biol ; 21(1): 284, 2023 12 08.
Artículo en Inglés | MEDLINE | ID: mdl-38066641

RESUMEN

BACKGROUND: Sea snakes underwent a complete transition from land to sea within the last ~ 15 million years, yet they remain a conspicuous gap in molecular studies of marine adaptation in vertebrates. RESULTS: Here, we generate four new annotated sea snake genomes, three of these at chromosome-scale (Hydrophis major, H. ornatus and H. curtus), and perform detailed comparative genomic analyses of sea snakes and their closest terrestrial relatives. Phylogenomic analyses highlight the possibility of near-simultaneous speciation at the root of Hydrophis, and synteny maps show intra-chromosomal variations that will be important targets for future adaptation and speciation genomic studies of this system. We then used a strict screen for positive selection in sea snakes (against a background of seven terrestrial snake genomes) to identify genes over-represented in hypoxia adaptation, sensory perception, immune response and morphological development. CONCLUSIONS: We provide the best reference genomes currently available for the prolific and medically important elapid snake radiation. Our analyses highlight the phylogenetic complexity and conserved genome structure within Hydrophis. Positively selected marine-associated genes provide promising candidates for future, functional studies linking genetic signatures to the marine phenotypes of sea snakes and other vertebrates.


Asunto(s)
Elapidae , Hydrophiidae , Animales , Elapidae/genética , Hydrophiidae/genética , Filogenia , Cromosomas/genética
12.
Nat Commun ; 14(1): 7767, 2023 Nov 27.
Artículo en Inglés | MEDLINE | ID: mdl-38012187

RESUMEN

Chimeric antigen receptor (CAR) T cell therapy is effective in treating B cell malignancies, but factors influencing the persistence of functional CAR+ T cells, such as product composition, patients' lymphodepletion, and immune reconstitution, are not well understood. To shed light on this issue, here we conduct a single-cell multi-omics analysis of transcriptional, clonal, and phenotypic profiles from pre- to 1-month post-infusion of CAR+ and CAR- T cells from patients from a CARTELL study (ACTRN12617001579381) who received a donor-derived 4-1BB CAR product targeting CD19. Following infusion, CAR+ T cells and CAR- T cells shows similar differentiation profiles with clonally expanded populations across heterogeneous phenotypes, demonstrating clonal lineages and phenotypic plasticity. We validate these findings in 31 patients with large B cell lymphoma treated with CD19 CAR T therapy. For these patients, we identify using longitudinal mass-cytometry data an association between NK-like subsets and clinical outcomes at 6 months with both CAR+ and CAR- T cells. These results suggest that non-CAR-derived signals can provide information about patients' immune recovery and be used as correlate of clinically relevant parameters.


Asunto(s)
Linfoma de Células B Grandes Difuso , Receptores de Antígenos de Linfocitos T , Humanos , Linfocitos B , Inmunoterapia Adoptiva/métodos , Linfoma de Células B Grandes Difuso/patología , Linfocitos T
13.
Cell Genom ; 3(11): 100379, 2023 Nov 08.
Artículo en Inglés | MEDLINE | ID: mdl-38020977

RESUMEN

Synthetic chromosome engineering is a complex process due to the need to identify and repair growth defects and deal with combinatorial gene essentiality when rearranging chromosomes. To alleviate these issues, we have demonstrated novel approaches for repairing and rearranging synthetic Saccharomyces cerevisiae genomes. We have designed, constructed, and restored wild-type fitness to a synthetic 753,096-bp version of S. cerevisiae chromosome XIV as part of the Synthetic Yeast Genome project. In parallel to the use of rational engineering approaches to restore wild-type fitness, we used adaptive laboratory evolution to generate a general growth-defect-suppressor rearrangement in the form of increased TAR1 copy number. We also extended the utility of the synthetic chromosome recombination and modification by loxPsym-mediated evolution (SCRaMbLE) system by engineering synthetic-wild-type tetraploid hybrid strains that buffer against essential gene loss, highlighting the plasticity of the S. cerevisiae genome in the presence of rational and non-rational modifications.

14.
J Virol ; 97(11): e0070523, 2023 Nov 30.
Artículo en Inglés | MEDLINE | ID: mdl-37843370

RESUMEN

IMPORTANCE: The lack of a reliable method to accurately detect when replication-competent HIV has been cleared is a major challenge in developing a cure. This study introduces a new approach called the HIVepsilon-seq (HIVε-seq) assay, which uses long-read sequencing technology and bioinformatics to scrutinize the HIV genome at the nucleotide level, distinguishing between defective and intact HIV. This study included 30 participants on antiretroviral therapy, including 17 women, and was able to discriminate between defective and genetically intact viruses at the single DNA strand level. The HIVε-seq assay is an improvement over previous methods, as it requires minimal sample, less specialized lab equipment, and offers a shorter turnaround time. The HIVε-seq assay offers a promising new tool for researchers to measure the intact HIV reservoir, advancing efforts towards finding a cure for this devastating disease.


Asunto(s)
Infecciones por VIH , VIH , Provirus , Femenino , Humanos , Linfocitos T CD4-Positivos , ADN Viral/genética , Infecciones por VIH/tratamiento farmacológico , Infecciones por VIH/epidemiología , Infecciones por VIH/virología , Nucleótidos , Provirus/genética , Carga Viral , Análisis de Secuencia de ADN , Masculino , Factores Sexuales , VIH/genética
15.
Brain Commun ; 5(4): fcad208, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-37621409

RESUMEN

Cerebellar ataxia, neuropathy and vestibular areflexia syndrome is a progressive, generally late-onset, neurological disorder associated with biallelic pentanucleotide expansions in Intron 2 of the RFC1 gene. The locus exhibits substantial genetic variability, with multiple pathogenic and benign pentanucleotide repeat alleles previously identified. To determine the contribution of pathogenic RFC1 expansions to neurological disease within an Australasian cohort and further investigate the heterogeneity exhibited at the locus, a combination of flanking and repeat-primed PCR was used to screen a cohort of 242 Australasian patients with neurological disease. Patients whose data indicated large gaps within expanded alleles following repeat-primed PCR, underwent targeted long-read sequencing to identify novel repeat motifs at the locus. To increase diagnostic yield, additional probes at the RFC1 repeat region were incorporated into the PathWest diagnostic laboratory targeted neurological disease gene panel to enable first-pass screening of the locus for all samples tested on the panel. Within the Australasian cohort, we detected known pathogenic biallelic expansions in 15.3% (n = 37) of patients. Thirty indicated biallelic AAGGG expansions, two had biallelic 'Maori alleles' [(AAAGG)exp(AAGGG)exp], two samples were compound heterozygous for the Maori allele and an AAGGG expansion, two samples had biallelic ACAGG expansions and one sample was compound heterozygous for the ACAGG and AAGGG expansions. Forty-five samples tested indicated the presence of biallelic expansions not known to be pathogenic. A large proportion (84%) showed complex interrupted patterns following repeat-primed PCR, suggesting that these expansions are likely to be comprised of more than one repeat motif, including previously unknown repeats. Using targeted long-read sequencing, we identified three novel repeat motifs in expanded alleles. Here, we also show that short-read sequencing can be used to reliably screen for the presence or absence of biallelic RFC1 expansions in all samples tested using the PathWest targeted neurological disease gene panel. Our results show that RFC1 pathogenic expansions make a substantial contribution to neurological disease in the Australasian population and further extend the heterogeneity of the locus. To accommodate the increased complexity, we outline a multi-step workflow utilizing both targeted short- and long-read sequencing to achieve a definitive genotype and provide accurate diagnoses for patients.

16.
NPJ Genom Med ; 8(1): 16, 2023 Jul 07.
Artículo en Inglés | MEDLINE | ID: mdl-37419908

RESUMEN

Autosomal dominant polycystic kidney disease (ADPKD) is the most common monogenic cause of kidney failure and is primarily associated with PKD1 or PKD2. Approximately 10% of patients remain undiagnosed after standard genetic testing. We aimed to utilise short and long-read genome sequencing and RNA studies to investigate undiagnosed families. Patients with typical ADPKD phenotype and undiagnosed after genetic diagnostics were recruited. Probands underwent short-read genome sequencing, PKD1 and PKD2 coding and non-coding analyses and then genome-wide analysis. Targeted RNA studies investigated variants suspected to impact splicing. Those undiagnosed then underwent Oxford Nanopore Technologies long-read genome sequencing. From over 172 probands, 9 met inclusion criteria and consented. A genetic diagnosis was made in 8 of 9 (89%) families undiagnosed on prior genetic testing. Six had variants impacting splicing, five in non-coding regions of PKD1. Short-read genome sequencing identified novel branchpoint, AG-exclusion zone and missense variants generating cryptic splice sites and a deletion causing critical intron shortening. Long-read sequencing confirmed the diagnosis in one family. Most undiagnosed families with typical ADPKD have splice-impacting variants in PKD1. We describe a pragmatic method for diagnostic laboratories to assess PKD1 and PKD2 non-coding regions and validate suspected splicing variants through targeted RNA studies.

17.
Brain ; 146(12): 5060-5069, 2023 12 01.
Artículo en Inglés | MEDLINE | ID: mdl-37450567

RESUMEN

Cerebellar ataxia, neuropathy and vestibular areflexia syndrome (CANVAS) is an autosomal recessive neurodegenerative disease, usually caused by biallelic AAGGG repeat expansions in RFC1. In this study, we leveraged whole genome sequencing data from nearly 10 000 individuals recruited within the Genomics England sequencing project to investigate the normal and pathogenic variation of the RFC1 repeat. We identified three novel repeat motifs, AGGGC (n = 6 from five families), AAGGC (n = 2 from one family) and AGAGG (n = 1), associated with CANVAS in the homozygous or compound heterozygous state with the common pathogenic AAGGG expansion. While AAAAG, AAAGGG and AAGAG expansions appear to be benign, we revealed a pathogenic role for large AAAGG repeat configuration expansions (n = 5). Long-read sequencing was used to characterize the entire repeat sequence, and six patients exhibited a pure AGGGC expansion, while the other patients presented complex motifs with AAGGG or AAAGG interruptions. All pathogenic motifs appeared to have arisen from a common haplotype and were predicted to form highly stable G quadruplexes, which have previously been demonstrated to affect gene transcription in other conditions. The assessment of these novel configurations is warranted in CANVAS patients with negative or inconclusive genetic testing. Particular attention should be paid to carriers of compound AAGGG/AAAGG expansions when the AAAGG motif is very large (>500 repeats) or the AAGGG motif is interrupted. Accurate sizing and full sequencing of the satellite repeat with long-read sequencing is recommended in clinically selected cases to enable accurate molecular diagnosis and counsel patients and their families.


Asunto(s)
Ataxia Cerebelosa , Enfermedades del Sistema Nervioso Periférico , Síndrome , Enfermedades Vestibulares , Humanos , Vestibulopatía Bilateral , Ataxia Cerebelosa/genética , Ataxia Cerebelosa/diagnóstico , Enfermedades Neurodegenerativas , Enfermedades del Sistema Nervioso Periférico/diagnóstico , Enfermedades del Sistema Nervioso Periférico/genética , Enfermedades Vestibulares/diagnóstico , Enfermedades Vestibulares/genética
18.
Bioinformatics ; 39(6)2023 06 01.
Artículo en Inglés | MEDLINE | ID: mdl-37252813

RESUMEN

MOTIVATION: Nanopore sequencing is emerging as a key pillar in the genomic technology landscape but computational constraints limiting its scalability remain to be overcome. The translation of raw current signal data into DNA or RNA sequence reads, known as 'basecalling', is a major friction in any nanopore sequencing workflow. Here, we exploit the advantages of the recently developed signal data format 'SLOW5' to streamline and accelerate nanopore basecalling on high-performance computing (HPC) and cloud environments. RESULTS: SLOW5 permits highly efficient sequential data access, eliminating a potential analysis bottleneck. To take advantage of this, we introduce Buttery-eel, an open-source wrapper for Oxford Nanopore's Guppy basecaller that enables SLOW5 data access, resulting in performance improvements that are essential for scalable, affordable basecalling. AVAILABILITY AND IMPLEMENTATION: Buttery-eel is available at https://github.com/Psy-Fer/buttery-eel.


Asunto(s)
Nanoporos , Programas Informáticos , Análisis de Secuencia de ADN/métodos , Genoma , Genómica , Secuenciación de Nucleótidos de Alto Rendimiento
19.
BMC Genomics ; 24(1): 243, 2023 May 05.
Artículo en Inglés | MEDLINE | ID: mdl-37147622

RESUMEN

BACKGROUND: Sex determination is the process whereby the bipotential embryonic gonads become committed to differentiate into testes or ovaries. In genetic sex determination (GSD), the sex determining trigger is encoded by a gene on the sex chromosomes, which activates a network of downstream genes; in mammals these include SOX9, AMH and DMRT1 in the male pathway, and FOXL2 in the female pathway. Although mammalian and avian GSD systems have been well studied, few data are available for reptilian GSD systems. RESULTS: We conducted an unbiased transcriptome-wide analysis of gonad development throughout differentiation in central bearded dragon (Pogona vitticeps) embryos with GSD. We found that sex differentiation of transcriptomic profiles occurs at a very early stage, before the gonad consolidates as a body distinct from the gonad-kidney complex. The male pathway genes dmrt1 and amh and the female pathway gene foxl2 play a key role in early sex differentiation in P. vitticeps, but the central player of the mammalian male trajectory, sox9, is not differentially expressed in P. vitticeps at the bipotential stage. The most striking difference from GSD systems of other amniotes is the high expression of the male pathway genes amh and sox9 in female gonads during development. We propose that a default male trajectory progresses if not repressed by a W-linked dominant gene that tips the balance of gene expression towards the female trajectory. Further, weighted gene expression correlation network analysis revealed novel candidates for male and female sex differentiation. CONCLUSION: Our data reveal that interpretation of putative mechanisms of GSD in reptiles cannot solely depend on lessons drawn from mammals.


Asunto(s)
Reptiles , Procesos de Determinación del Sexo , Diferenciación Sexual , Animales , Femenino , Masculino , Expresión Génica , Regulación del Desarrollo de la Expresión Génica , Gónadas/metabolismo , Reptiles/genética , Procesos de Determinación del Sexo/genética , Diferenciación Sexual/genética , Factor de Transcripción SOX9/genética
20.
Neurol Genet ; 9(2): e200064, 2023 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-37090938

RESUMEN

Objective: Duchenne muscular dystrophy (DMD) is caused by pathogenic variants in the dystrophin gene (DMD). Hypermethylated CGG expansions within DIP2B 5' UTR are associated with an intellectual development disorder. Here, we demonstrate the diagnostic utility of genomic short-read sequencing (SRS) and transcriptome sequencing to identify a novel DMD structural variant (SV) and a DIP2B CGG expansion in a patient with DMD for whom conventional diagnostic testing failed to yield a genetic diagnosis. Methods: We performed genomic SRS, skeletal muscle transcriptome sequencing, and targeted programmable long-read sequencing (LRS). Results: The proband had a typical DMD clinical presentation, autism spectrum disorder (ASD), and dystrophinopathy on muscle biopsy. Transcriptome analysis identified 6 aberrantly expressed genes; DMD and DIP2B were the strongest underexpression and overexpression outliers, respectively. Genomic SRS identified a 216 kb paracentric inversion (NC_000023.11: g.33162217-33378800) overlapping 2 DMD promoters. ExpansionHunter indicated an expansion of 109 CGG repeats within the 5' UTR of DIP2B. Targeted genomic LRS confirmed the SV and genotyped the DIP2B repeat expansion as 270 CGG repeats. Discussion: Here, transcriptome data heavily guided genomic analysis to resolve a complex DMD inversion and a DIP2B repeat expansion. Longitudinal follow-up will be important for clarifying the clinical significance of the DIP2B genotype.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...