Búsqueda | Portal de Búsqueda de la BVS

1.

The functional impact of 1,570 individual amino acid substitutions in human OTC.

Lo, Russell S; Cromie, Gareth A; Tang, Michelle; Teng, Kevin; Owens, Katherine; Sirr, Amy; Kutz, J Nathan; Morizono, Hiroki; Caldovic, Ljubica; Ah Mew, Nicholas; Gropman, Andrea; Dudley, Aimée M.

Am J Hum Genet ; 110(5): 863-879, 2023 05 04.

Artículo en Inglés | MEDLINE | ID: mdl-37146589

RESUMEN

Deleterious mutations in the X-linked gene encoding ornithine transcarbamylase (OTC) cause the most common urea cycle disorder, OTC deficiency. This rare but highly actionable disease can present with severe neonatal onset in males or with later onset in either sex. Individuals with neonatal onset appear normal at birth but rapidly develop hyperammonemia, which can progress to cerebral edema, coma, and death, outcomes ameliorated by rapid diagnosis and treatment. Here, we develop a high-throughput functional assay for human OTC and individually measure the impact of 1,570 variants, 84% of all SNV-accessible missense mutations. Comparison to existing clinical significance calls, demonstrated that our assay distinguishes known benign from pathogenic variants and variants with neonatal onset from late-onset disease presentation. This functional stratification allowed us to identify score ranges corresponding to clinically relevant levels of impairment of OTC activity. Examining the results of our assay in the context of protein structure further allowed us to identify a 13 amino acid domain, the SMG loop, whose function appears to be required in human cells but not in yeast. Finally, inclusion of our data as PS3 evidence under the current ACMG guidelines, in a pilot reclassification of 34 variants with complete loss of activity, would change the classification of 22 from variants of unknown significance to clinically actionable likely pathogenic variants. These results illustrate how large-scale functional assays are especially powerful when applied to rare genetic diseases.

Asunto(s)

Hiperamonemia , Enfermedad por Deficiencia de Ornitina Carbamoiltransferasa , Ornitina Carbamoiltransferasa , Humanos , Sustitución de Aminoácidos , Hiperamonemia/etiología , Hiperamonemia/genética , Mutación Missense/genética , Ornitina Carbamoiltransferasa/genética , Enfermedad por Deficiencia de Ornitina Carbamoiltransferasa/genética , Enfermedad por Deficiencia de Ornitina Carbamoiltransferasa/diagnóstico , Enfermedad por Deficiencia de Ornitina Carbamoiltransferasa/terapia

2.

Oxford Nanopore's 2024 sequencing technology for Listeria monocytogenes outbreak detection and source attribution: progress and clone-specific challenges.

Biggel, Michael; Cernela, Nicole; Horlbog, Jule Anna; Stephan, Roger.

J Clin Microbiol ; : e0108324, 2024 Oct 04.

Artículo en Inglés | MEDLINE | ID: mdl-39365069

RESUMEN

Whole genome sequencing is an essential cornerstone of pathogen surveillance and outbreak detection. Established sequencing technologies are currently being challenged by Oxford Nanopore Technologies (ONT), which offers an accessible and cost-effective alternative enabling gap-free assemblies of chromosomes and plasmids. Limited accuracy has hindered its use for investigating pathogen transmission, but recent technology updates have brought significant improvements. To evaluate its readiness for outbreak detection, we selected 78 Listeria monocytogenes isolates from diverse lineages or known epidemiological clusters for sequencing with ONT's V14 Rapid Barcoding Kit and R10.4.1 flow cells. The most accurate of several tested workflows generated assemblies with a median of one error (SNP or indel) per assembly. For 66 isolates, the cgMLST profiles from ONT-only assemblies were identical to those generated from Illumina data. Eight assemblies were of lower quality, with more than 20 erroneous sites each, primarily caused by methylations at the GAAGAC motif (5'-GAAG6mAC-3'/5'-GT4mCTTC-3'). This led to inaccurate clustering, failing to group isolates from a persistence-associated clone that carried the responsible restriction-modification system. Out of 50 methylation motifs detected among the 78 isolates, only the GAAGAC motif was linked to substantially increased error rates. Our study shows that most L. monocytogenes genomes assembled from ONT-only data are suitable for high-resolution genotyping, but further improvements of chemistries or basecallers are required for reliable routine use in outbreak and food safety investigations.

3.

Forensic DNA phenotyping using Oxford Nanopore Sequencing system.

Sapan, Veysel; Simsek, Sumeyye Zulal; Filoglu, Gonul; Bulbul, Ozlem.

Electrophoresis ; 2024 May 25.

Artículo en Inglés | MEDLINE | ID: mdl-38794987

RESUMEN

In forensic science, the demand for precision, consistency, and cost-effectiveness has driven the exploration of next-generation sequencing technologies. This study investigates the potential of Oxford Nanopore Sequencing (ONT) Technology for analyzing the HIrisPlex-S panel, a set of 41 single nucleotide polymorphism (SNP) markers used to predict eye, hair, and skin color. Using ONT sequencing, we assessed the accuracy and reliability of ONT-generated data by comparing it with conventional capillary electrophoresis (CE) in 18 samples. The Guppy v6.1 was used as a basecaller, and sample profiles were obtained using Burrows-Wheeler Aligner, Samtools, BCFtools, and Python. Comparing accuracy with CE, we found that 62% of SNPs in ONT-unligated samples were correctly genotyped, with 36% showing allele dropout, and 2% being incorrectly genotyped. In the ONT-ligated samples, 85% of SNPs were correctly genotyped, with 10% showing allele dropout, and 5% being incorrectly genotyped. Our findings indicate that ONT, particularly when combined with ligation, enhances genotyping accuracy and coverage, thereby reducing allele dropouts. However, challenges associated with the technology's error rates and the impact on genotyping accuracy are recognized. Phenotype predictions based on ONT data demonstrate varying degrees of success, with the technology showing high accuracy in several cases. Although ONT technology holds promise in forensic genetics, further optimization and quality control measures are essential to harness its full potential. This study contributes to the ongoing efforts to refine sequence read tuning and improve correction tools in the context of ONT technology's application in forensic genetics.

4.

Novel regulatory variant in ABO intronic RUNX1 binding site inducing A₃ phenotype.

Thun, Gian Andri; Gueuning, Morgan; Sigurdardottir, Sonja; Meyer, Eduardo; Gourri, Elise; Schneider, Linda; Merki, Yvonne; Trost, Nadine; Neuenschwander, Kathrin; Engström, Charlotte; Frey, Beat M; Meyer, Stefan; Mattle-Greminger, Maja P.

Vox Sang ; 119(4): 377-382, 2024 Apr.

Artículo en Inglés | MEDLINE | ID: mdl-38226545

RESUMEN

BACKGROUND AND OBJECTIVES: Mixed-field agglutination in ABO phenotyping (A3, B3) has been linked to genetically different blood cell populations such as in chimerism, or to rare variants in either ABO exon 7 or regulatory regions. Clarification of such cases is challenging and would greatly benefit from sequencing technologies that allow resolving full-gene haplotypes at high resolution. MATERIALS AND METHODS: We used long-read sequencing by Oxford Nanopore Technologies to sequence the entire ABO gene, amplified in two overlapping long-range PCR fragments, in a blood donor presented with A3B phenotype. Confirmation analyses were carried out by Sanger sequencing and included samples from other family members. RESULTS: Our data revealed a novel heterozygous g.10924C>A variant on the ABO*A allele located in the transcription factor binding site for RUNX1 in intron 1 (+5.8 kb site). Inheritance was shown by the results of the donor's mother, who shared the novel variant and the anti-A specific mixed-field agglutination. CONCLUSION: We discovered a regulatory variant in the 8-bp RUNX1 motif of ABO, which extends current knowledge of three other variants affecting the same motif and also leading to A3 or B3 phenotypes. Overall, long-range PCR combined with nanopore sequencing proved powerful and showed great potential as an emerging strategy for resolving cases with cryptic ABO phenotypes.

Asunto(s)

Sistema del Grupo Sanguíneo ABO , Subunidad alfa 2 del Factor de Unión al Sitio Principal , Humanos , Intrones/genética , Subunidad alfa 2 del Factor de Unión al Sitio Principal/genética , Fenotipo , Alelos , Sitios de Unión , Sistema del Grupo Sanguíneo ABO/genética , Genotipo

5.

Parallel in-depth analysis of repeat expansions in ataxia patients by long-read sequencing.

Erdmann, Hannes; Schöberl, Florian; Giurgiu, Madalina; Leal Silva, Rafaela Magalhaes; Scholz, Veronika; Scharf, Florentine; Wendlandt, Martin; Kleinle, Stephanie; Deschauer, Marcus; Nübling, Georg; Heide, Wolfgang; Babacan, Sait Seymen; Schneider, Christine; Neuhann, Teresa; Hahn, Katrin; Schoser, Benedikt; Holinski-Feder, Elke; Wolf, Dieter A; Abicht, Angela.

Brain ; 146(5): 1831-1843, 2023 05 02.

Artículo en Inglés | MEDLINE | ID: mdl-36227727

RESUMEN

Instability of simple DNA repeats has been known as a common cause of hereditary ataxias for over 20 years. Routine genetic diagnostics of these phenotypically similar diseases still rely on an iterative workflow for quantification of repeat units by PCR-based methods of limited precision. We established and validated clinical nanopore Cas9-targeted sequencing, an amplification-free method for simultaneous analysis of 10 repeat loci associated with clinically overlapping hereditary ataxias. The method combines target enrichment by CRISPR-Cas9, Oxford Nanopore long-read sequencing and a bioinformatics pipeline using the tools STRique and Megalodon for parallel detection of length, sequence, methylation and composition of the repeat loci. Clinical nanopore Cas9-targeted sequencing allowed for the precise and parallel analysis of 10 repeat loci associated with adult-onset ataxia and revealed additional parameter such as FMR1 promotor methylation and repeat sequence required for diagnosis at the same time. Using clinical nanopore Cas9-targeted sequencing we analysed 100 clinical samples of undiagnosed ataxia patients and identified causative repeat expansions in 28 patients. Parallel repeat analysis enabled a molecular diagnosis of ataxias independent of preconceptions on the basis of clinical presentation. Biallelic expansions within RFC1 were identified as the most frequent cause of ataxia. We characterized the RFC1 repeat composition of all patients and identified a novel repeat motif, AGGGG. Our results highlight the power of clinical nanopore Cas9-targeted sequencing as a readily expandable workflow for the in-depth analysis and diagnosis of phenotypically overlapping repeat expansion disorders.

Asunto(s)

Ataxia Cerebelosa , Degeneraciones Espinocerebelosas , Adulto , Humanos , Ataxia/genética , Ataxia Cerebelosa/genética , Biología Computacional , Secuenciación de Nucleótidos de Alto Rendimiento , Proteína de la Discapacidad Intelectual del Síndrome del Cromosoma X Frágil

6.

Detection and Quantification of 5moU RNA Modification from Direct RNA Sequencing Data.

Li, Jiayi; Sun, Feiyang; He, Kunyang; Zhang, Lin; Meng, Jia; Huang, Daiyun; Zhang, Yuxin.

Curr Genomics ; 25(3): 212-225, 2024 May 31.

Artículo en Inglés | MEDLINE | ID: mdl-39086998

RESUMEN

Background: Chemically modified therapeutic mRNAs have gained momentum recently. In addition to commonly used modifications (e.g., pseudouridine), 5moU is considered a promising substitution for uridine in therapeutic mRNAs. Accurate identification of 5-methoxyuridine (5moU) would be crucial for the study and quality control of relevant in vitro-transcribed (IVT) mRNAs. However, current methods exhibit deficiencies in providing quantitative methodologies for detecting such modification. Utilizing the capabilities of Oxford nanopore direct RNA sequencing, in this study, we present NanoML-5moU, a machine-learning framework designed specifically for the read-level detection and quantification of 5moU modification for IVT data. Materials and Methods: Nanopore direct RNA sequencing data from both 5moU-modified and unmodified control samples were collected. Subsequently, a comprehensive analysis and modeling of signal event characteristics (mean, median current intensities, standard deviations, and dwell times) were performed. Furthermore, classical machine learning algorithms, notably the Support Vector Machine (SVM), Random Forest (RF), and XGBoost were employed to discern 5moU modifications within NNUNN (where N represents A, C, U, or G) 5-mers. Results: Notably, the signal event attributes pertaining to each constituent base of the NNUNN 5-mers, in conjunction with the utilization of the XGBoost algorithm, exhibited remarkable performance levels (with a maximum AUROC of 0.9567 in the "AGTTC" reference 5-mer dataset and a minimum AUROC of 0.8113 in the "TGTGC" reference 5-mer dataset). This accomplishment markedly exceeded the efficacy of the prevailing background error comparison model (ELIGOs AUC 0.751 for site-level prediction). The model's performance was further validated through a series of curated datasets, which featured customized modification ratios designed to emulate broader data patterns, demonstrating its general applicability in quality control of IVT mRNA vaccines. The NanoML-5moU framework is publicly available on GitHub (https://github.com/JiayiLi21/NanoML-5moU). Conclusion: NanoML-5moU enables accurate read-level profiling of 5moU modification with nanopore direct RNA-sequencing, which is a powerful tool specialized in unveiling signal patterns in in vitro-transcribed (IVT) mRNAs.

7.

Telomere sequence variability in genotypes from natural plant populations: unusual block-organized double-monomer terminal telomeric arrays.

Belyayev, Alexander; Kalendar, Ruslan; Josefiová, Jirina; Pastová, Ladislava; Habibi, Farzaneh; Mahelka, Václav; Mandák, Bohumil; Krak, Karol.

BMC Genomics ; 24(1): 572, 2023 Sep 26.

Artículo en Inglés | MEDLINE | ID: mdl-37752451

RESUMEN

BACKGROUND: Telomeres are the nucleoprotein complexes that physically cap the ends of eukaryotic chromosomes. Most plants possess Arabidopsis-type telomere sequences (TSs). In addition to terminal TSs, more diverse interstitial TSs exists in plants. Although telomeres have been sufficiently studied, the actual diversity of TSs in land plants is underestimated. RESULTS: We investigate genotypes from seven natural populations with contrasting environments of four Chenopodium species to reveal the variability in TSs by analyzing Oxford Nanopore reads. Fluorescent in situ hybridization was used to localize telomeric repeats on chromosomes. We identified a number of derivative monomers that arise in part of both terminal and interstitial telomeric arrays of a single genotype. The former presents a case of block-organized double-monomer telomers, where blocks of Arabidopsis-type TTTAGGG motifs were interspersed with blocks of derivative TTTAAAA motifs. The latter is an integral part of the satellitome with transformations specific to the inactive genome fraction. CONCLUSIONS: We suggested two alternative models for the possible formation of derivative monomers from telomeric heptamer motifs of Arabidopsis-type. It was assumed that derivatization of TSs is a ubiquitous process in the plant genome but occurrence and frequencies of derivatives may be genotype-specific. We also propose that the formation of non-canonical arrays of TSs, especially at chromosomal termini, may be a source for genomic variability in nature.

Asunto(s)

Arabidopsis , Humanos , Arabidopsis/genética , Hibridación Fluorescente in Situ , Telómero/genética , Genotipo , Eucariontes

8.

Quantitative trait locus for calving traits on Bos taurus autosome 18 in Holstein cattle is embedded in a complex genomic region.

Dachs, Nina; Upadhyay, Maulik; Hannemann, Elisabeth; Hauser, Andreas; Krebs, Stefan; Seichter, Doris; Russ, Ingolf; Gehrke, Lilian Johanna; Thaller, Georg; Medugorac, Ivica.

J Dairy Sci ; 106(3): 1925-1941, 2023 Mar.

Artículo en Inglés | MEDLINE | ID: mdl-36710189

RESUMEN

Although the quantitative trait locus (QTL) on chromosome 18 (BTA18) associated with paternal calving ease and stillbirth in Holstein Friesian cattle and its cross has been known for over 20 years, to our knowledge, the exact causal genetic sequence has yet escaped identification. The aim of this study was to re-examine the region of the published QTL on BTA18 and to investigate the possible reasons behind this elusiveness. For this purpose, we carried out a combined linkage disequilibrium and linkage analysis using genotyping data of 2,697 German Holstein Friesian (HF) animals and subsequent whole-genome sequencing (WGS) data analyses and genome assembly of HF samples. We confirmed the known QTL in the 95% confidence interval of 1.089 Mbp between 58.34 and 59.43 Mbp on BTA18. Additionally, these 4 SNPs in the near-perfect linkage disequilibrium with the QTL haplotype were identified: rs381577268 (on 57,816,137 bp, C/T), rs381878735 (on 59,574,329 bp, A/T), rs464221818 (on 59,329,176 bp, C/T), and rs472502785 (on 59,345,689 bp, T/C). Search for the causal mutation using short and long-read sequences, and methylation data of the BTA18 QTL region did not reveal any candidates though. The assembly showed problems in the region, as well as an abundance of segmental duplications within and around the region. Taking the QTL of BTA18 in Holstein cattle as an example, the data presented in this study comprehensively characterize the genomic features that could also be relevant for other such elusive QTL in various other cattle breeds and livestock species as well.

Asunto(s)

Cromosomas , Sitios de Carácter Cuantitativo , Bovinos , Animales , Fenotipo , Desequilibrio de Ligamiento , Genómica , Polimorfismo de Nucleótido Simple

9.

Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology.

Lemay, Marc-André; Sibbesen, Jonas A; Torkamaneh, Davoud; Hamel, Jérémie; Levesque, Roger C; Belzile, François.

BMC Biol ; 20(1): 53, 2022 02 23.

Artículo en Inglés | MEDLINE | ID: mdl-35197050

RESUMEN

BACKGROUND: Structural variants (SVs), including deletions, insertions, duplications, and inversions, are relatively long genomic variations implicated in a diverse range of processes from human disease to ecology and evolution. Given their complex signatures, tendency to occur in repeated regions, and large size, discovering SVs based on short reads is challenging compared to single-nucleotide variants. The increasing availability of long-read technologies has greatly facilitated SV discovery; however, these technologies remain too costly to apply routinely to population-level studies. Here, we combined short-read and long-read sequencing technologies to provide a comprehensive population-scale assessment of structural variation in a panel of Canadian soybean cultivars. RESULTS: We used Oxford Nanopore long-read sequencing data (~12× mean coverage) for 17 samples to both benchmark SV calls made from Illumina short-read data and predict SVs that were subsequently genotyped in a population of 102 samples using Illumina data. Benchmarking results show that variants discovered using Oxford Nanopore can be accurately genotyped from the Illumina data. We first use the genotyped deletions and insertions for population genetics analyses and show that results are comparable to those based on single-nucleotide variants. We observe that the population frequency and distribution within the genome of deletions and insertions are constrained by the location of genes. Gene Ontology and PFAM domain enrichment analyses also confirm previous reports that genes harboring high-frequency deletions and insertions are enriched for functions in defense response. Finally, we discover polymorphic transposable elements from the deletions and insertions and report evidence of the recent activity of a Stowaway MITE. CONCLUSIONS: We show that structural variants discovered using Oxford Nanopore data can be genotyped with high accuracy from Illumina data. Our results demonstrate that long-read and short-read sequencing technologies can be efficiently combined to enhance SV analysis in large populations, providing a reusable framework for their study in a wider range of samples and non-model species.

Asunto(s)

Nanoporos , Canadá , Elementos Transponibles de ADN/genética , Genómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Nucleótidos , Análisis de Secuencia de ADN , Glycine max/genética

10.

Long-read Sequencing and de novo Genome Assembly of Three Aspergillus fumigatus Genomes.

Hemmings, Samuel J; Rhodes, Johanna L; Fisher, Matthew C.

Mycopathologia ; 188(4): 409-412, 2023 Aug.

Artículo en Inglés | MEDLINE | ID: mdl-37227556

RESUMEN

Aspergillus fumigatus is a genetically diverse fungal species, which is near ubiquitous in its global distribution and is the major cause of the life-threatening disease invasive aspergillosis. We present 3 de novo genome assemblies that were selected to be representative of the genetic diversity of clinical and environmental A. fumigatus. Sequencing using long-read Oxford Nanopore and subsequent assembly of the genomes yielded 10-23 contigs with an N50 of 4.05 Mbp to 4.93 Mbp.

Asunto(s)

Aspergilosis , Aspergillus fumigatus , Aspergillus fumigatus/genética , Genoma , Aspergilosis/microbiología , Análisis de Secuencia de ADN

11.

Duet: SNP-assisted structural variant calling and phasing using Oxford nanopore sequencing.

Zhou, Yekai; Leung, Amy Wing-Sze; Ahmed, Syed Shakeel; Lam, Tak-Wah; Luo, Ruibang.

BMC Bioinformatics ; 23(1): 465, 2022 Nov 07.

Artículo en Inglés | MEDLINE | ID: mdl-36344913

RESUMEN

BACKGROUND: Whole genome sequencing using the long-read Oxford Nanopore Technologies (ONT) MinION sequencer provides a cost-effective option for structural variant (SV) detection in clinical applications. Despite the advantage of using long reads, however, accurate SV calling and phasing are still challenging. RESULTS: We introduce Duet, an SV detection tool optimized for SV calling and phasing using ONT data. The tool uses novel features integrated from both SV signatures and single-nucleotide polymorphism signatures, which can accurately distinguish SV haplotype from a false signal. Duet was benchmarked against state-of-the-art tools on multiple ONT sequencing datasets of sequencing coverage ranging from 8× to 40×. At low sequencing coverage of 8×, Duet performs better than all other tools in SV calling, SV genotyping and SV phasing. When the sequencing coverage is higher (20× to 40×), the F1-score for SV phasing is further improved in comparison to the performance of other tools, while its performance of SV genotyping and SV calling remains higher than other tools. CONCLUSION: Duet can perform accurate SV calling, SV genotyping and SV phasing using low-coverage ONT data, making it very useful for low-coverage genomes. It has great performance when scaled to high-coverage genomes, which is adaptable to various clinical applications. Duet is open source and is available at https://github.com/yekaizhou/duet .

Asunto(s)

Secuenciación de Nanoporos , Polimorfismo de Nucleótido Simple , Análisis de Secuencia de ADN , Secuenciación de Nucleótidos de Alto Rendimiento , Secuenciación Completa del Genoma

12.

NAD tagSeq reveals that NAD⁺-capped RNAs are mostly produced from a large number of protein-coding genes in Arabidopsis.

Zhang, Hailei; Zhong, Huan; Zhang, Shoudong; Shao, Xiaojian; Ni, Min; Cai, Zongwei; Chen, Xuemei; Xia, Yiji.

Proc Natl Acad Sci U S A ; 116(24): 12072-12077, 2019 06 11.

Artículo en Inglés | MEDLINE | ID: mdl-31142650

RESUMEN

The 5' end of a eukaryotic mRNA transcript generally has a 7-methylguanosine (m7G) cap that protects mRNA from degradation and mediates almost all other aspects of gene expression. Some RNAs in Escherichia coli, yeast, and mammals were recently found to contain an NAD+ cap. Here, we report the development of the method NAD tagSeq for transcriptome-wide identification and quantification of NAD+-capped RNAs (NAD-RNAs). The method uses an enzymatic reaction and then a click chemistry reaction to label NAD-RNAs with a synthetic RNA tag. The tagged RNA molecules can be enriched and directly sequenced using the Oxford Nanopore sequencing technology. NAD tagSeq can allow more accurate identification and quantification of NAD-RNAs, as well as reveal the sequences of whole NAD-RNA transcripts using single-molecule RNA sequencing. Using NAD tagSeq, we found that NAD-RNAs in Arabidopsis were produced by at least several thousand genes, most of which are protein-coding genes, with the majority of these transcripts coming from <200 genes. For some Arabidopsis genes, over 5% of their transcripts were NAD capped. Gene ontology terms overrepresented in the 2,000 genes that produced the highest numbers of NAD-RNAs are related to photosynthesis, protein synthesis, and responses to cytokinin and stresses. The NAD-RNAs in Arabidopsis generally have the same overall sequence structures as the canonical m7G-capped mRNAs, although most of them appear to have a shorter 5' untranslated region (5' UTR). The identification and quantification of NAD-RNAs and revelation of their sequence features can provide essential steps toward understanding the functions of NAD-RNAs.

Asunto(s)

Arabidopsis/genética , NAD/genética , Caperuzas de ARN/genética , ARN Mensajero/genética , Regiones no Traducidas 5'/genética , Expresión Génica/genética , Análisis de Secuencia de ARN

13.

Polishing the Oxford Nanopore long-read assemblies of bacterial pathogens with Illumina short reads to improve genomic analyses.

Chen, Zhao; Erickson, David L; Meng, Jianghong.

Genomics ; 113(3): 1366-1377, 2021 05.

Artículo en Inglés | MEDLINE | ID: mdl-33716184

RESUMEN

Oxford Nanopore sequencing has been widely used to achieve complete genomes of bacterial pathogens. However, the error rates of Oxford Nanopore long reads are high. Various polishing algorithms using Illumina short reads to correct the errors in Oxford Nanopore long-read assemblies have been developed. The impact of polishing the Oxford Nanopore long-read assemblies of bacterial pathogens with Illumina short reads on improving genomic analyses was evaluated using both simulated and real reads. Ten species (10 strains) were selected for simulated reads, while real reads were tested on 11 species (11 strains). Oxford Nanopore long reads were assembled with Unicycler to produce a draft assembly, followed by three rounds of polishing with Illumina short reads using two polishing tools, Pilon and NextPolish. One round of NextPolish polishing generated genome completeness and accuracy parameters similar to the reference genomes, whereas two or three rounds of Pilon polishing were needed, though contiguity remained unchanged after polishing. The polished assemblies of Escherichia coli O157:H7, Salmonella Typhimurium, and Cronobacter sakazakii with simulated reads did not provide accurate plasmid identifications. One round of NextPolish polishing was needed for accurately identifying plasmids in Staphylococcus aureus and E. coli O26:H11 with real reads, whereas one and two rounds of Pilon polishing were necessary for these two strains, respectively. Polishing failed to provide an accurate antimicrobial resistance (AMR) genotype for S. aureus with real reads. One round of polishing recovered an accurate AMR genotype for Klebsiella pneumoniae with real reads. The reference genome and draft assembly of Citrobacter braakii with real reads differed, which carried blaCMY-83 and fosA6, respectively, while both genes were present after one round of polishing. However, polishing did not improve the assembly of E. coli O26:H11 with real reads to achieve numbers of virulence genes similar to the reference genome. The draft and polished assemblies showed a phylogenetic tree topology comparable with the reference genomes. For multilocus sequence typing and pan-genome analyses, one round of NextPolish polishing was sufficient to obtain accurate results, while two or three rounds of Pilon polishing were needed. Overall, NextPolish outperformed Pilon for polishing the Oxford Nanopore long-read assemblies of bacterial pathogens, though both polishing strategies improved genomic analyses compared to the draft assemblies.

Asunto(s)

Nanoporos , Escherichia coli , Genoma Bacteriano , Genómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Filogenia , Análisis de Secuencia de ADN/métodos , Staphylococcus aureus

14.

Telomere-to-telomere assembly of the genome of an individual Oikopleura dioica from Okinawa using Nanopore-based sequencing.

Bliznina, Aleksandra; Masunaga, Aki; Mansfield, Michael J; Tan, Yongkai; Liu, Andrew W; West, Charlotte; Rustagi, Tanmay; Chien, Hsiao-Chiao; Kumar, Saurabh; Pichon, Julien; Plessy, Charles; Luscombe, Nicholas M.

BMC Genomics ; 22(1): 222, 2021 Mar 29.

Artículo en Inglés | MEDLINE | ID: mdl-33781200

RESUMEN

BACKGROUND: The larvacean Oikopleura dioica is an abundant tunicate plankton with the smallest (65-70 Mbp) non-parasitic, non-extremophile animal genome identified to date. Currently, there are two genomes available for the Bergen (OdB3) and Osaka (OSKA2016) O. dioica laboratory strains. Both assemblies have full genome coverage and high sequence accuracy. However, a chromosome-scale assembly has not yet been achieved. RESULTS: Here, we present a chromosome-scale genome assembly (OKI2018_I69) of the Okinawan O. dioica produced using long-read Nanopore and short-read Illumina sequencing data from a single male, combined with Hi-C chromosomal conformation capture data for scaffolding. The OKI2018_I69 assembly has a total length of 64.3 Mbp distributed among 19 scaffolds. 99% of the assembly is contained within five megabase-scale scaffolds. We found telomeres on both ends of the two largest scaffolds, which represent assemblies of two fully contiguous autosomal chromosomes. Each of the other three large scaffolds have telomeres at one end only and we propose that they correspond to sex chromosomes split into a pseudo-autosomal region and X-specific or Y-specific regions. Indeed, these five scaffolds mostly correspond to equivalent linkage groups in OdB3, suggesting overall agreement in chromosomal organization between the two populations. At a more detailed level, the OKI2018_I69 assembly possesses similar genomic features in gene content and repetitive elements reported for OdB3. The Hi-C map suggests few reciprocal interactions between chromosome arms. At the sequence level, multiple genomic features such as GC content and repetitive elements are distributed differently along the short and long arms of the same chromosome. CONCLUSIONS: We show that a hybrid approach of integrating multiple sequencing technologies with chromosome conformation information results in an accurate de novo chromosome-scale assembly of O. dioica's highly polymorphic genome. This genome assembly opens up the possibility of cross-genome comparison between O. dioica populations, as well as of studies of chromosomal evolution in this lineage.

Asunto(s)

Secuenciación de Nanoporos , Nanoporos , Urocordados , Animales , Genoma , Masculino , Telómero/genética , Urocordados/genética

15.

Perspectives and Benefits of High-Throughput Long-Read Sequencing in Microbial Ecology.

Tedersoo, Leho; Albertsen, Mads; Anslan, Sten; Callahan, Benjamin.

Appl Environ Microbiol ; 87(17): e0062621, 2021 08 11.

Artículo en Inglés | MEDLINE | ID: mdl-34132589

RESUMEN

Short-read, high-throughput sequencing (HTS) methods have yielded numerous important insights into microbial ecology and function. Yet, in many instances short-read HTS techniques are suboptimal, for example, by providing insufficient phylogenetic resolution or low integrity of assembled genomes. Single-molecule and synthetic long-read (SLR) HTS methods have successfully ameliorated these limitations. In addition, nanopore sequencing has generated a number of unique analysis opportunities, such as rapid molecular diagnostics and direct RNA sequencing, and both Pacific Biosciences (PacBio) and nanopore sequencing support detection of epigenetic modifications. Although initially suffering from relatively low sequence quality, recent advances have greatly improved the accuracy of long-read sequencing technologies. In spite of great technological progress in recent years, the long-read HTS methods (PacBio and nanopore sequencing) are still relatively costly, require large amounts of high-quality starting material, and commonly need specific solutions in various analysis steps. Despite these challenges, long-read sequencing technologies offer high-quality, cutting-edge alternatives for testing hypotheses about microbiome structure and functioning as well as assembly of eukaryote genomes from complex environmental DNA samples.

Asunto(s)

Bacterias/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Bacterias/clasificación , Bacterias/aislamiento & purificación , ADN Bacteriano/genética , Microbiota , Filogenia , Análisis de Secuencia de ADN

16.

Sat-BSA: an NGS-based method using local de novo assembly of long reads for rapid identification of genomic structural variations associated with agronomic traits.

Segawa, Tenta; Nishiyama, Chisato; Tamiru-Oli, Muluneh; Sugihara, Yu; Abe, Akira; Sone, Hinako; Itoh, Noriaki; Asukai, Mayu; Uemura, Aiko; Oikawa, Kaori; Utsushi, Hiroe; Ikegami-Katayama, Ayako; Imamura, Tomohiro; Mori, Masashi; Terauchi, Ryohei; Takagi, Hiroki.

Breed Sci ; 71(3): 299-312, 2021 Jun.

Artículo en Inglés | MEDLINE | ID: mdl-34776737

RESUMEN

Advances in next generation sequencing (NGS)-based methodologies have accelerated the identifications of simple genetic variants such as point mutations and small insertions/deletions (InDels). Structural variants (SVs) including large InDels and rearrangements provide vital sources of genetic diversity for plant breeding. However, their analysis remains a challenge due to their complex nature. Consequently, novel NGS-based approaches are needed to rapidly and accurately identify SVs. Here, we present an NGS-based bulked-segregant analysis (BSA) technique called Sat-BSA (SVs associated with traits) for identifying SVs controlling traits of interest in crops. Sat-BSA targets allele frequencies at all SNP positions to first identify candidate genomic regions associated with a trait, which is then reconstructed by long reads-based local de novo assembly. Finally, the association between SVs, RNA-seq-based gene expression patterns and trait is evaluated for multiple cultivars to narrow down the candidate genes. We applied Sat-BSA to segregating F2 progeny obtained from crosses between turnip cultivars with different tuber colors and successfully isolated two genes harboring SVs that are responsible for tuber phenotypes. The current study demonstrates the utility of Sat-BSA for the identification of SVs associated with traits of interest in species with large and heterozygous genomes.

17.

High-Throughput Sequencing of Phage Display Libraries Reveals Parasitic Enrichment of Indel Mutants Caused by Amplification Bias.

Plessers, Sander; Van Deuren, Vincent; Lavigne, Rob; Robben, Johan.

Int J Mol Sci ; 22(11)2021 May 24.

Artículo en Inglés | MEDLINE | ID: mdl-34073702

RESUMEN

The combination of phage display technology with high-throughput sequencing enables in-depth analysis of library diversity and selection-driven dynamics. We applied short-read sequencing of the mutagenized region on focused display libraries of two homologous nucleic acid modification eraser proteins-AlkB and FTO-biopanned against methylated DNA. This revealed enriched genotypes with small indels and concomitant doubtful amino acid motifs within the FTO library. Nanopore sequencing of the entire display vector showed additional enrichment of large deletions overlooked by region-specific sequencing, and further impacted the interpretation of the obtained amino acid motifs. We could attribute enrichment of these corrupted clones to amplification bias due to arduous FTO display slowing down host cell growth as well as phage production. This amplification bias appeared to be stronger than affinity-based target selection. Recommendations are provided for proper sequence analysis of phage display data, which can improve motive discovery in libraries of proteins that are difficult to display.

Asunto(s)

Bacteriófagos/genética , Variación Genética , Secuenciación de Nucleótidos de Alto Rendimiento , Biblioteca de Péptidos , Mutación INDEL , Análisis de Secuencia de ADN

18.

A chromosome-scale assembly of the smallest Dothideomycete genome reveals a unique genome compaction mechanism in filamentous fungi.

Wang, Bo; Liang, Xiaofei; Gleason, Mark L; Hsiang, Tom; Zhang, Rong; Sun, Guangyu.

BMC Genomics ; 21(1): 321, 2020 Apr 23.

Artículo en Inglés | MEDLINE | ID: mdl-32326892

RESUMEN

BACKGROUND: The wide variation in the size of fungal genomes is well known, but the reasons for this size variation are less certain. Here, we present a chromosome-scale assembly of ectophytic Peltaster fructicola, a surface-dwelling extremophile, based on long-read DNA sequencing technology, to assess possible mechanisms associated with genome compaction. RESULTS: At 18.99 million bases (Mb), P. fructicola possesses one of the smallest known genomes sequence among filamentous fungi. The genome is highly compact relative to other fungi, with substantial reductions in repeat content, ribosomal DNA copies, tRNA gene quantity, and intron sizes, as well as intergenic lengths and the size of gene families. Transposons take up just 0.05% of the entire genome, and no full-length transposon was found. We concluded that reduced genome sizes in filamentous fungi such as P. fructicola, Taphrina deformans and Pneumocystis jirovecii occurred through reduction in ribosomal DNA copy number and reduced intron sizes. These dual mechanisms contrast with genome reduction in the yeast fungus Saccharomyces cerevisiae, whose small and compact genome is associated solely with intron loss. CONCLUSIONS: Our results reveal a unique genomic compaction architecture of filamentous fungi inhabiting plant surfaces, and broaden the understanding of the mechanisms associated with compaction of fungal genomes.

Asunto(s)

Ascomicetos/genética , Cromosomas Fúngicos/genética , Hongos/genética , Genoma Fúngico/genética , Ascomicetos/clasificación , Ascomicetos/fisiología , Genómica/métodos , Filogenia , Enfermedades de las Plantas/microbiología , Análisis de Secuencia de ADN/métodos , Especificidad de la Especie

19.

Benchmarking hybrid assembly approaches for genomic analyses of bacterial pathogens using Illumina and Oxford Nanopore sequencing.

Chen, Zhao; Erickson, David L; Meng, Jianghong.

BMC Genomics ; 21(1): 631, 2020 Sep 14.

Artículo en Inglés | MEDLINE | ID: mdl-32928108

RESUMEN

BACKGROUND: We benchmarked the hybrid assembly approaches of MaSuRCA, SPAdes, and Unicycler for bacterial pathogens using Illumina and Oxford Nanopore sequencing by determining genome completeness and accuracy, antimicrobial resistance (AMR), virulence potential, multilocus sequence typing (MLST), phylogeny, and pan genome. Ten bacterial species (10 strains) were tested for simulated reads of both mediocre- and low-quality, whereas 11 bacterial species (12 strains) were tested for real reads. RESULTS: Unicycler performed the best for achieving contiguous genomes, closely followed by MaSuRCA, while all SPAdes assemblies were incomplete. MaSuRCA was less tolerant of low-quality long reads than SPAdes and Unicycler. The hybrid assemblies of five antimicrobial-resistant strains with simulated reads provided consistent AMR genotypes with the reference genomes. The MaSuRCA assembly of Staphylococcus aureus with real reads contained msr(A) and tet(K), while the reference genome and SPAdes and Unicycler assemblies harbored blaZ. The AMR genotypes of the reference genomes and hybrid assemblies were consistent for the other five antimicrobial-resistant strains with real reads. The numbers of virulence genes in all hybrid assemblies were similar to those of the reference genomes, irrespective of simulated or real reads. Only one exception existed that the reference genome and hybrid assemblies of Pseudomonas aeruginosa with mediocre-quality long reads carried 241 virulence genes, whereas 184 virulence genes were identified in the hybrid assemblies of low-quality long reads. The MaSuRCA assemblies of Escherichia coli O157:H7 and Salmonella Typhimurium with mediocre-quality long reads contained 126 and 118 virulence genes, respectively, while 110 and 107 virulence genes were detected in their MaSuRCA assemblies of low-quality long reads, respectively. All approaches performed well in our MLST and phylogenetic analyses. The pan genomes of the hybrid assemblies of S. Typhimurium with mediocre-quality long reads were similar to that of the reference genome, while SPAdes and Unicycler were more tolerant of low-quality long reads than MaSuRCA for the pan-genome analysis. All approaches functioned well in the pan-genome analysis of Campylobacter jejuni with real reads. CONCLUSIONS: Our research demonstrates the hybrid assembly pipeline of Unicycler as a superior approach for genomic analyses of bacterial pathogens using Illumina and Oxford Nanopore sequencing.

Asunto(s)

Genoma Bacteriano , Genómica/métodos , Secuenciación de Nanoporos/métodos , Benchmarking , Campylobacter jejuni , Mapeo Contig/métodos , Mapeo Contig/normas , Cronobacter sakazakii , Farmacorresistencia Bacteriana , Genómica/normas , Listeria monocytogenes , Secuenciación de Nanoporos/normas , Pseudomonas aeruginosa , Salmonella typhimurium , Virulencia

20.

Comparison of the performance of an amplicon sequencing assay based on Oxford Nanopore technology to real-time PCR assays for detecting bacterial biodefense pathogens.

Player, Robert; Verratti, Kathleen; Staab, Andrea; Bradburne, Christopher; Grady, Sarah; Goodwin, Bruce; Sozhamannan, Shanmuga.

BMC Genomics ; 21(1): 166, 2020 Feb 17.

Artículo en Inglés | MEDLINE | ID: mdl-32066372

RESUMEN

BACKGROUND: The state-of-the-art in nucleic acid based biodetection continues to be polymerase chain reaction (PCR), and many real-time PCR assays targeting biodefense pathogens for biosurveillance are in widespread use. These assays are predominantly singleplex; i.e. one assay tests for the presence of one target, found in a single organism, one sample at a time. Due to the intrinsic limitations of such tests, there exists a critical need for high-throughput multiplex assays to reduce the time and cost incurred when screening multiple targets, in multiple pathogens, and in multiple samples. Such assays allow users to make an actionable call while maximizing the utility of the small volumes of test samples. Unfortunately, current multiplex real-time PCR assays are limited in the number of targets that can be probed simultaneously due to the availability of fluorescence channels in real-time PCR instruments. RESULTS: To address this gap, we developed a pipeline in which the amplicons produced by a 14-plex end-point PCR assay using spiked samples were subsequently sequenced using Nanopore technology. We used bar codes to sequence multiple samples simultaneously, leading to the generation and subsequent analysis of sequence data resulting from a short sequencing run time (< 10 min). We compared the limits of detection (LoD) of real-time PCR assays to Oxford Nanopore Technologies (ONT)-based amplicon sequencing and estimated the sample-to-answer time needed for this approach. Overall, LoDs determined from the first 10 min of sequencing data were at least one to two orders of magnitude lower than real-time PCR. Given enough time, the amplicon sequencing approach is approximately 100 times more sensitive than real-time PCR, with detection of amplicon specific reads even at the lowest tested spiking concentration (around 2.5-50 Colony Forming Units (CFU)/ml). CONCLUSIONS: Based on these results, we propose amplicon sequencing assay as a viable alternative to replace the current real-time PCR based singleplex assays for higher throughput biodefense applications. We note, however, that targeted amplicon specific reads were not detectable even at the highest tested spike concentrations (2.5 X 104-5.0 X105 CFU/ml) without an initial amplification step, indicating that PCR is still necessary when utilizing this protocol.

Asunto(s)

Bacterias/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Reacción en Cadena de la Polimerasa Multiplex , Nanoporos , Nanotecnología , Reacción en Cadena en Tiempo Real de la Polimerasa , Humanos , Reacción en Cadena de la Polimerasa Multiplex/métodos , Reacción en Cadena en Tiempo Real de la Polimerasa/métodos , Reproducibilidad de los Resultados , Sensibilidad y Especificidad

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA