Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 35
Filtrar
1.
J Hered ; 2024 Aug 22.
Artículo en Inglés | MEDLINE | ID: mdl-39171826

RESUMEN

Pteronarcys californica (Newport 1848) is commonly referred to as the giant salmonfly and is the largest species of stonefly (Insecta: Plecoptera) in the western United States. Historically, it was widespread and abundant in western rivers, but populations have experienced a substantial decline in the past few decades, becoming locally extirpated in numerous rivers in Utah, Colorado, and Montana. Although previous research has explored the ecological variables conducive to the survivability of populations of the giant salmonfly, a lack of genomic resources hampers exploration of how genetic variation is spread across extant populations. To accelerate research on this imperiled species, we present a de novo chromosomal-length genome assembly of P. californica generated from PacBio HiFi sequencing and Hi-C chromosome conformation capture. Our assembly includes 14 predicted pseudo chromosomes and 98.8% of Insecta universal core orthologs. At 2.40 gigabases, the P. californica assembly is the largest of available stonefly assemblies, highlighting at least 9.5-fold variation in assembly size across the order. Repetitive elements (REs) account for much of the genome size increase in P. californica relative to other stonefly species, with the content of Class I retroelements alone exceeding the entire assembly size of all but two other species studied. We also observed preliminary suborder-specific trends in genome size that merit testing with more robust taxon sampling.

2.
J Hered ; 115(1): 139-148, 2024 Feb 03.
Artículo en Inglés | MEDLINE | ID: mdl-37712349

RESUMEN

The Yuma myotis bat (Myotis yumanensis) is a small vespertilionid bat and one of 52 species of new world Myotis bats in the subgenus Pizonyx. While M. yumanensis populations currently appear relatively stable, it is one of 12 bat species known or suspected to be susceptible to white-nose syndrome, the fungal disease causing declines in bat populations across North America. Only two of these 12 species have genome resources available, which limits the ability of resource managers to use genomic techniques to track the responses of bat populations to white-nose syndrome generally. Here we present the first de novo genome assembly for Yuma myotis, generated as a part of the California Conservation Genomics Project. The M. yumanensis genome was generated using a combination of PacBio HiFi long reads and Omni-C chromatin-proximity sequencing technology. This high-quality genome is one of the most complete bat assemblies available, with a contig N50 of 28.03 Mb, scaffold N50 of 99.14 Mb, and BUSCO completeness score of 93.7%. The Yuma myotis genome provides a high-quality resource that will aid in comparative genomic and evolutionary studies, as well as inform conservation management related to white-nose syndrome.


Asunto(s)
Quirópteros , Animales , Quirópteros/genética , América del Norte , Genoma , Genómica , Evolución Biológica
3.
J Hered ; 115(2): 203-211, 2024 Mar 13.
Artículo en Inglés | MEDLINE | ID: mdl-38092381

RESUMEN

Townsend's big-eared bat, Corynorhinus townsendii, is a cave- and mine-roosting species found largely in western North America. Considered a species of conservation concern throughout much of its range, protection efforts would greatly benefit from understanding patterns of population structure, genetic diversity, and local adaptation. To facilitate such research, we present the first de novo genome assembly of C. townsendii as part of the California Conservation Genomics Project (CCGP). Pacific Biosciences HiFi long reads and Omni-C chromatin-proximity sequencing technologies were used to produce a de novo genome assembly, consistent with the standard CCGP reference genome protocol. This assembly comprises 391 scaffolds spanning 2.1 Gb, represented by a scaffold N50 of 174.6 Mb, a contig N50 of 23.4 Mb, and a benchmarking universal single-copy ortholog (BUSCO) completeness score of 96.6%. This high-quality genome will be a key tool for informed conservation and management of this vulnerable species in California and across its range.


Asunto(s)
Quirópteros , Animales , Quirópteros/genética , Genoma , Genómica/métodos , América del Norte
4.
J Hered ; 115(2): 212-220, 2024 Mar 13.
Artículo en Inglés | MEDLINE | ID: mdl-38245832

RESUMEN

The dugong (Dugong dugon) is a marine mammal widely distributed throughout the Indo-Pacific and the Red Sea, with a Vulnerable conservation status, and little is known about many of the more peripheral populations, some of which are thought to be close to extinction. We present a de novo high-quality genome assembly for the dugong from an individual belonging to the well-monitored Moreton Bay population in Queensland, Australia. Our assembly uses long-read PacBio HiFi sequencing and Omni-C data following the Vertebrate Genome Project pipeline to reach chromosome-level contiguity (24 chromosome-level scaffolds; 3.16 Gbp) and high completeness (97.9% complete BUSCOs). We observed relatively high genome-wide heterozygosity, which likely reflects historical population abundance before the last interglacial period, approximately 125,000 yr ago. Demographic inference suggests that dugong populations began declining as sea levels fell after the last interglacial period, likely a result of population fragmentation and habitat loss due to the exposure of seagrass meadows. We find no evidence for ongoing recent inbreeding in this individual. However, runs of homozygosity indicate some past inbreeding. Our draft genome assembly will enable range-wide assessments of genetic diversity and adaptation, facilitate effective management of dugong populations, and allow comparative genomics analyses including with other sirenians, the oldest marine mammal lineage.


Asunto(s)
Caniformia , Dugong , Animales , Australia , Ecosistema , Océano Índico , Cetáceos , Cromosomas
5.
Proc Natl Acad Sci U S A ; 118(11)2021 03 16.
Artículo en Inglés | MEDLINE | ID: mdl-33836575

RESUMEN

Technological advances have allowed improvements in genome reference sequence assemblies. Here, we combined long- and short-read sequence resources to assemble the genome of a female Great Dane dog. This assembly has improved continuity compared to the existing Boxer-derived (CanFam3.1) reference genome. Annotation of the Great Dane assembly identified 22,182 protein-coding gene models and 7,049 long noncoding RNAs, including 49 protein-coding genes not present in the CanFam3.1 reference. The Great Dane assembly spans the majority of sequence gaps in the CanFam3.1 reference and illustrates that 2,151 gaps overlap the transcription start site of a predicted protein-coding gene. Moreover, a subset of the resolved gaps, which have an 80.95% median GC content, localize to transcription start sites and recombination hotspots more often than expected by chance, suggesting the stable canine recombinational landscape has shaped genome architecture. Alignment of the Great Dane and CanFam3.1 assemblies identified 16,834 deletions and 15,621 insertions, as well as 2,665 deletions and 3,493 insertions located on secondary contigs. These structural variants are dominated by retrotransposon insertion/deletion polymorphisms and include 16,221 dimorphic canine short interspersed elements (SINECs) and 1,121 dimorphic long interspersed element-1 sequences (LINE-1_Cfs). Analysis of sequences flanking the 3' end of LINE-1_Cfs (i.e., LINE-1_Cf 3'-transductions) suggests multiple retrotransposition-competent LINE-1_Cfs segregate among dog populations. Consistent with this conclusion, we demonstrate that a canine LINE-1_Cf element with intact open reading frames can retrotranspose its own RNA and that of a SINEC_Cf consensus sequence in cultured human cells, implicating ongoing retrotransposon activity as a driver of canine genetic variation.


Asunto(s)
Perros/genética , Secuencia Rica en GC , Genoma , Secuencias Repetitivas Esparcidas , Animales , Perros/clasificación , Elementos de Nucleótido Esparcido Largo , Elementos de Nucleótido Esparcido Corto , Especificidad de la Especie
6.
J Hered ; 114(1): 44-51, 2023 03 16.
Artículo en Inglés | MEDLINE | ID: mdl-36099176

RESUMEN

The tricolored blackbird, Agelaius tricolor, is a gregarious species that forms enormous breeding and foraging colonies in wetland and agricultural habitats, primarily in California, USA. Once extremely abundant, species numbers have declined dramatically in the past century, largely due to losses of breeding and foraging habitats. Tricolored blackbirds are currently listed as Endangered by the IUCN, and Threatened under the California Endangered Species Act. Increased genetic information is needed to detail the evolutionary consequences of a species-wide bottleneck and inform conservation management. Here, we present a contiguous tricolored blackbird reference genome, assembled with PacBio HiFi long reads and Dovetail Omni-C data to generate a scaffold-level assembly containing multiple chromosome-length scaffolds. This genome adds a valuable resource for important evolutionary and conservation research on tricolored blackbirds and related species.


Asunto(s)
Genoma , Pájaros Cantores , Pájaros Cantores/genética , Animales , Conservación de los Recursos Naturales
7.
Mol Plant Microbe Interact ; 35(12): 1127-1130, 2022 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-36384030

RESUMEN

Most of the Pythium species are pathogenic to a wide range of economically important crops and, sometimes, can even cause diseases in animals and humans. An exception is that the soil-inhabiting P. oligandrum is an effective biocontrol agent against a diverse suite of pathogens and promotes plant growth. In this work, we sequenced the whole genome of P. oligandrum PO-1, isolated from rhizosphere soils of Chinese Angelica sinensis, using a combination of long-read single-molecule real-time sequencing technology (Pacific Biosciences [PacBio]) and Illumina sequencing. The 2.5-Gb and 5.2-Gb bases were generated respectively. The sequencing depths were 93× with PacBio and 145× with Illumina sequencing. With the PacBio sequencing results further corrected by Illumina sequencing, the genome was assembled into 71 scaffolds with a total size of 39.10 Mb (N50 = 1.45 Mb; L50 = 9)and the longest scaffold is 3.49 Mb. Genome annotation identifies 15,632 protein-coding genes and 0.47 Mb of transposable elements. Our genomic assembly and annotation have been greatly improved compared with the already released three genomes of P. oligandrum. This genomic data will provide valuable information to understand the mechanism underlying its biocontrol potentials and will also facilitate the dissection of genome evolution and environmental adaptation within the genus Pythium. [Formula: see text] The author(s) have dedicated the work to the public domain under the Creative Commons CC0 "No Rights Reserved" license by waiving all of his or her rights to the work worldwide under copyright law, including all related and neighboring rights, to the extent allowed by law, 2022.


Asunto(s)
Angelica sinensis , Pythium , Animales , Angelica sinensis/genética , Genoma , Pythium/genética , Rizosfera
8.
J Hered ; 113(6): 665-672, 2022 11 30.
Artículo en Inglés | MEDLINE | ID: mdl-35567593

RESUMEN

The once abundant black abalone, Haliotis cracherodii, is a large, long-lived grazing marine mollusk that inhabits the rocky intertidal along the coast of California. The species has experienced dramatic declines since the mid-1980s largely due to the fatal bacterial disease called withering syndrome, leading to the collapse of an economically important fishery and to its inclusion into the IUCN listing as a critically endangered species. In some places impacted by the disease, populations of black abalone have declined by more than 90%, prompting population crashes associated with very little recruitment of new individuals and changes to intertidal communities. Habitats that were dominated by crustose coralline algae and bare rock have become dominated instead by fleshy algae and sessile invertebrates. Here, we present the first high-quality black abalone reference genome, assembled with PacBio HiFi long-reads and assembled with Dovetail Omni-C data to generate a scaffold-level assembly. The black abalone reference genome will be an essential resource in understanding the evolutionary history of this species as well as for exploring its current levels of genetic diversity and establishing future management and restoration plans.


Asunto(s)
Gastrópodos , Humanos , Animales , Gastrópodos/genética , Especies en Peligro de Extinción , Explotaciones Pesqueras , Ecosistema
9.
J Hered ; 113(6): 615-623, 2022 11 30.
Artículo en Inglés | MEDLINE | ID: mdl-35696092

RESUMEN

The bobcat (Lynx rufus) is a medium-sized carnivore well adapted to various environments and an indicator species for landscape connectivity. It is one of the 4 species within the extant Lynx genus in the family Felidae. Because of its broad geographic distribution and central role in food webs, the bobcat is important for conservation. Here we present a high-quality de novo genome assembly of a male bobcat located in Mendocino County, CA, as part of the California Conservation Genomics Project (CCGP). The assembly was generated using the standard CCGP pipeline from a combination of Omni-C and HiFi technologies. The primary assembly comprises 76 scaffolds spanning 2.4 Gb, represented by a scaffold N50 of 142 Mb, a contig N50 of 66.2 Mb, and a BUSCO completeness score of 95.90%. The bobcat genome will be an important resource for the effective management and conservation of this species and comparative genomics exploration.


Asunto(s)
Felidae , Lynx , Animales , Masculino , Lynx/genética
10.
Mol Plant Microbe Interact ; 34(6): 703-706, 2021 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-33461337

RESUMEN

Genus Fusarium (Ascomycota, Hypocreales, Nectriaceae) includes many economically important plant pathogens that cause devastating diseases of a wide range of crops and trees. Interestingly, there is increasing evidence that some Fusarium species also live as endophytes and benefit plant growth and stress tolerance. In this work, we sequence the whole genomes of endophytic F. culmorum and F. pseudograminearum, isolated from a coastal dunegrass (Leymus mollis), using long-read single-molecule real-time sequencing technology. Their genomes are assembled into four chromosomes and a mitochondrial genome with a total assembly size of 40.05 and 42.90 M, respectively. This resource should not only facilitate functional studies designed to better understand what makes the two Fusarium species such successful plant-beneficial fungi but should also reveal their genome evolution and adaptation.[Formula: see text] Copyright © 2021 The Author(s). This is an open access article distributed under the CC BY-NC-ND 4.0 International license.


Asunto(s)
Ascomicetos , Fusarium , Genoma Mitocondrial , Cromosomas , Endófitos/genética , Fusarium/genética , Enfermedades de las Plantas
11.
Mol Plant Microbe Interact ; 34(6): 711-714, 2021 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-33522840

RESUMEN

Clitopilus hobsonii (Entolomataceae, Agaricales, Basidiomycetes) is a common soil saprotroph. There is also evidence that C. hobsonii can act as a root endophyte benefitting tree growth. Here, we report the genome assembly of C. hobsonii QYL-10, isolated from ectomycorrhizal root tips of Quercus lyrata. The genome size is 36.93 Mb, consisting of 13 contigs (N50 = 3.3 Mb) with 49.2% GC content. Of them, 10 contigs approached the length of intact chromosomes, and three had telomeres at one end only. BUSCO analysis reported a completeness score of 98.4%, using Basidiomycota_odb10 lineage data. Combining ab-initio, RNA-seq data, and homology-based predictions, we identified 12,710 protein-coding genes. Approximately, 1.43 Mb of transposable elements (3.88% of the assembly), 36 secondary metabolite biosynthetic gene clusters, and 361 genes encoding putative carbohydrate-active enzymes were identified. This genomic resource will allow functional studies aimed to characterize the symbiotic interactions between C. hobsonii and its host trees and will also provide a valuable foundation for further research on comparative genomics of the Entolomataceae.[Formula: see text] Copyright © 2021 The Author(s). This is an open access article distributed under the CC BY-NC-ND 4.0 International license.


Asunto(s)
Agaricales , Basidiomycota , Agaricales/genética , Basidiomycota/genética , Elementos Transponibles de ADN , Endófitos/genética
12.
Exp Cell Res ; 394(2): 112127, 2020 09 15.
Artículo en Inglés | MEDLINE | ID: mdl-32504677

RESUMEN

We are entering into an exciting era of genomics where truly complete, high-quality assemblies of human chromosomes are available end-to-end, or from 'telomere-to-telomere' (T2T). This technological advance offers a new opportunity to include endogenous human centromeric regions in high-resolution, sequence-based studies. These emerging reference maps are expected to reveal a new functional landscape in the human genome, where centromere proteins, transcriptional regulation, and spatial organization can be examined with base-level resolution across different stages of development and disease. Such studies will depend on innovative assembly methods of extremely long tandem repeats (ETRs), or satellite DNAs, paired with the development of new, orthogonal validation methods to ensure accuracy and completeness. This review reflects the progress in centromere genomics, credited by recent advancements in long-read sequencing and assembly methods. In doing so, I will discuss the challenges that remain and the promise for a new period of scientific discovery for satellite DNA biology and centromere function.


Asunto(s)
Centrómero/metabolismo , Genómica , Telómero/metabolismo , ADN Satélite/genética , Humanos , Reproducibilidad de los Resultados , Secuencias Repetidas en Tándem/genética
13.
Mol Plant Microbe Interact ; 32(11): 1472-1474, 2019 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-31306082

RESUMEN

The NA1 clonal lineage of Phytophthora ramorum is responsible for sudden oak death, an epidemic that has devastated California coastal forest ecosystems. An NA1 isolate, Pr102, derived from coast live oak in California, was previously sequenced and reported with a 65-Mb assembly containing 12 Mb of gaps in 2,576 scaffolds. Here, we report an improved 70-Mb genome in 1,512 scaffolds with 6,752 bp of gaps after incorporating PacBio P5-C3 long reads. This assembly contains 19,494 gene models (average gene length of 2,515 bp) compared with 16,134 genes (average gene length of 1,673 bp) in the previous version. We predicted 29 new RXLR genes and 76 new paralogs of a total 392 RXLR genes from this assembly. We predicted 35 CRN genes compared with 19 in an earlier version with six paralogs. Our long non-coding RNA prediction identified 255 candidates. This new resource will be invaluable for future evolution studies on the invasive plant pathogen.


Asunto(s)
Genoma de Protozoos , Phytophthora , California , Genoma de Protozoos/genética , Phytophthora/genética , Enfermedades de las Plantas/parasitología , Quercus/parasitología , Análisis de Secuencia de ADN
14.
Parasitology ; 144(10): 1302-1315, 2017 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-28583210

RESUMEN

Genome assemblies can form the basis of comparative analyses fostering insight into the evolutionary genetics of a parasite's pathogenicity, host-pathogen interactions, environmental constraints and invasion biology; however, the length and complexity of many parasite genomes has hampered the development of well-resolved assemblies. In order to improve Trichinella genome assemblies, the genome of the sylvatic encapsulated species Trichinella murrelli was sequenced using third-generation, long-read technology and, using syntenic comparisons, scaffolded to a reference genome assembly of Trichinella spiralis, markedly improving both. A high-quality draft assembly for T. murrelli was achieved that totalled 63·2 Mbp, half of which was condensed into 26 contigs each longer than 571 000 bp. When compared with previous assemblies for parasites in the genus, ours required 10-fold fewer contigs, which were five times longer, on average. Better assembly across repetitive regions also enabled resolution of 8 Mbp of previously indeterminate sequence. Furthermore, syntenic comparisons identified widespread scaffold misassemblies in the T. spiralis reference genome. The two new assemblies, organized for the first time into three chromosomal scaffolds, will be valuable resources for future studies linking phenotypic traits within each species to their underlying genetic bases.


Asunto(s)
Evolución Molecular , Genoma de los Helmintos/genética , Sintenía , Trichinella/genética , Animales , Análisis de Secuencia de ADN
15.
Microb Genom ; 10(5)2024 May.
Artículo en Inglés | MEDLINE | ID: mdl-38713194

RESUMEN

Whole-genome reconstruction of bacterial pathogens has become an important tool for tracking transmission and antimicrobial resistance gene spread, but highly accurate and complete assemblies have largely only historically been achievable using hybrid long- and short-read sequencing. We previously found the Oxford Nanopore Technologies (ONT) R10.4/kit12 flowcell/chemistry produced improved assemblies over the R9.4.1/kit10 combination, however long-read only assemblies contained more errors compared to Illumina-ONT hybrid assemblies. ONT have since released an R10.4.1/kit14 flowcell/chemistry upgrade and recommended the use of Bovine Serum Albumin (BSA) during library preparation, both of which reportedly increase accuracy and yield. They have also released updated basecallers trained using native bacterial DNA containing methylation sites intended to fix systematic basecalling errors, including common adenosine (A) to guanine (G) and cytosine (C) to thymine (T) substitutions. To evaluate these improvements, we successfully sequenced four bacterial reference strains, namely Escherichia coli, Klebsiella pneumoniae, Pseudomonas aeruginosa and Staphylococcus aureus, and nine genetically diverse E. coli bloodstream infection-associated isolates from different phylogroups and sequence types, both with and without BSA. These sequences were de novo assembled and compared against Illumina-corrected reference genomes. In this small evaluation of 13 isolates we found that nanopore long-read-only R10.4.1/kit 14 assemblies with updated basecallers trained using bacterial methylated DNA produce accurate assemblies with ≥40×depth, sufficient to be cost-effective compared with hybrid ONT/Illumina sequencing in our setting.


Asunto(s)
Genoma Bacteriano , Nanoporos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Escherichia coli/genética , Staphylococcus aureus/genética , Análisis de Secuencia de ADN/métodos , Pseudomonas aeruginosa/genética , Secuenciación de Nanoporos/métodos , ADN Bacteriano/genética , Klebsiella pneumoniae/genética , Secuenciación Completa del Genoma/métodos , Bacterias/genética , Bacterias/clasificación , Humanos
16.
Mol Ecol Resour ; : e13982, 2024 May 27.
Artículo en Inglés | MEDLINE | ID: mdl-38800997

RESUMEN

The improvement and decreasing costs of third-generation sequencing technologies has widened the scope of biological questions researchers can address with de novo genome assemblies. With the increasing number of reference genomes, validating their integrity with minimal overhead is vital for establishing confident results in their applications. Here, we present Klumpy, a tool for detecting and visualizing both misassembled regions in a genome assembly and genetic elements (e.g. genes) of interest in a set of sequences. By leveraging the initial raw reads in combination with their respective genome assembly, we illustrate Klumpy's utility by investigating antifreeze glycoprotein (afgp) loci across two icefishes, by searching for a reported absent gene in the northern snakehead fish, and by scanning the reference genomes of a mudskipper and bumblebee for misassembled regions. In the two former cases, we were able to provide support for the noncanonical placement of an afgp locus in the icefishes and locate the missing snakehead gene. Furthermore, our genome scans were able identify an unmappable locus in the mudskipper reference genome and identify a putative repetitive element shared among several species of bees.

17.
G3 (Bethesda) ; 14(10)2024 Oct 07.
Artículo en Inglés | MEDLINE | ID: mdl-39115373

RESUMEN

The northern pike Esox lucius is a freshwater fish with low genetic diversity but ecological success throughout the Northern Hemisphere. Here, we generate an annotated chromosome-level genome assembly of 941 Mbp in length with 25 chromosome-length scaffolds. We then genotype 47 northern pike from Alaska through New Jersey at a genome-wide scale and characterize a striking decrease in genetic diversity along the sampling range. Individuals west of the North American Continental Divide have substantially higher diversity than those to the east (e.g. Interior Alaska and St. Lawrence River have on average 181 and 64K heterozygous SNPs per individual, or a heterozygous SNP every 5.2 and 14.6 kbp, respectively). Individuals clustered within each population with strong support, with numerous private alleles observed within each population. Evidence for recent population expansion was observed for a Manitoba hatchery and the St. Lawrence population (Tajima's D = -1.07 and -1.30, respectively). Several chromosomes have large regions with elevated diversity, including LG24, which holds amhby, the ancestral sex determining gene. As expected amhby was largely male-specific in Alaska and the Yukon and absent southeast to these populations, but we document some amhby(-) males in Alaska and amhby(+) males in the Columbia River, providing evidence for a patchwork of presence of this system in the western region. These results support the theory that northern pike recolonized North America from refugia in Alaska and expanded following deglaciation from west to east, with probable founder effects resulting in loss of both neutral and functional diversity (e.g. amhby).


Asunto(s)
Esocidae , Variación Genética , Polimorfismo de Nucleótido Simple , Secuenciación Completa del Genoma , Animales , Esocidae/genética , Masculino , Procesos de Determinación del Sexo/genética , Femenino , Genoma , América del Norte , Genética de Población , Genotipo
18.
Genes (Basel) ; 15(2)2024 02 09.
Artículo en Inglés | MEDLINE | ID: mdl-38397211

RESUMEN

The SpTransformer (SpTrf) gene family in the purple sea urchin, Strongylocentrotus purpuratus, encodes immune response proteins. The genes are clustered, surrounded by short tandem repeats, and some are present in genomic segmental duplications. The genes share regions of sequence and include repeats in the coding exon. This complex structure is consistent with putative local genomic instability. Instability of the SpTrf gene cluster was tested by 10 days of growth of Escherichia coli harboring bacterial artificial chromosome (BAC) clones of sea urchin genomic DNA with inserts containing SpTrf genes. After the growth period, the BAC DNA inserts were analyzed for size and SpTrf gene content. Clones with multiple SpTrf genes showed a variety of deletions, including loss of one, most, or all genes from the cluster. Alternatively, a BAC insert with a single SpTrf gene was stable. BAC insert instability is consistent with variations in the gene family composition among sea urchins, the types of SpTrf genes in the family, and a reduction in the gene copy number in single coelomocytes. Based on the sequence variability among SpTrf genes within and among sea urchins, local genomic instability of the family may be important for driving sequence diversity in this gene family that would be of benefit to sea urchins in their arms race with marine microbes.


Asunto(s)
Strongylocentrotus purpuratus , Animales , Strongylocentrotus purpuratus/genética , Cromosomas Artificiales Bacterianos/genética , Familia de Multigenes , ADN , Erizos de Mar/genética , Inestabilidad Genómica
19.
Microb Genom ; 9(1)2023 01.
Artículo en Inglés | MEDLINE | ID: mdl-36748454

RESUMEN

Complete, accurate, cost-effective, and high-throughput reconstruction of bacterial genomes for large-scale genomic epidemiological studies is currently only possible with hybrid assembly, combining long- (typically using nanopore sequencing) and short-read (Illumina) datasets. Being able to use nanopore-only data would be a significant advance. Oxford Nanopore Technologies (ONT) have recently released a new flowcell (R10.4) and chemistry (Kit12), which reportedly generate per-read accuracies rivalling those of Illumina data. To evaluate this, we sequenced DNA extracts from four commonly studied bacterial pathogens, namely Escherichia coli, Klebsiella pneumoniae, Pseudomonas aeruginosa and Staphylococcus aureus, using Illumina and ONT's R9.4.1/Kit10, R10.3/Kit12, R10.4/Kit12 flowcells/chemistries. We compared raw read accuracy and assembly accuracy for each modality, considering the impact of different nanopore basecalling models, commonly used assemblers, sequencing depth, and the use of duplex versus simplex reads. 'Super accuracy' (sup) basecalled R10.4 reads - in particular duplex reads - have high per-read accuracies and could be used to robustly reconstruct bacterial genomes without the use of Illumina data. However, the per-run yield of duplex reads generated in our hands with standard sequencing protocols was low (typically <10 %), with substantial implications for cost and throughput if relying on nanopore data only to enable bacterial genome reconstruction. In addition, recovery of small plasmids with the best-performing long-read assembler (Flye) was inconsistent. R10.4/Kit12 combined with sup basecalling holds promise as a singular sequencing technology in the reconstruction of commonly studied bacterial genomes, but hybrid assembly (Illumina+R9.4.1 hac) currently remains the highest throughput, most robust, and cost-effective approach to fully reconstruct these bacterial genomes.


Asunto(s)
Nanoporos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Genómica/métodos , Análisis de Secuencia de ADN/métodos , Genoma Bacteriano/genética
20.
Mol Ecol Resour ; 23(6): 1442-1457, 2023 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-36939021

RESUMEN

Although plastid genome (plastome) structure is highly conserved across most seed plants, investigations during the past two decades have revealed several disparately related lineages that experienced substantial rearrangements. Most plastomes contain a large inverted repeat and two single-copy regions, and a few dispersed repeats; however, the plastomes of some taxa harbour long repeat sequences (>300 bp). These long repeats make it challenging to assemble complete plastomes using short-read data, leading to misassemblies and consensus sequences with spurious rearrangements. Single-molecule, long-read sequencing has the potential to overcome these challenges, yet there is no consensus on the most effective method for accurately assembling plastomes using long-read data. We generated a pipeline, plastid Genome Assembly Using Long-read data (ptGAUL), to address the problem of plastome assembly using long-read data from Oxford Nanopore Technologies (ONT) or Pacific Biosciences platforms. We demonstrated the efficacy of the ptGAUL pipeline using 16 published long-read data sets. We showed that ptGAUL quickly produces accurate and unbiased assemblies using only ~50× coverage of plastome data. Additionally, we deployed ptGAUL to assemble four new Juncus (Juncaceae) plastomes using ONT long reads. Our results revealed many long repeats and rearrangements in Juncus plastomes compared with basal lineages of Poales. The ptGAUL pipeline is available on GitHub: https://github.com/Bean061/ptgaul.


Asunto(s)
Genoma de Plastidios , Secuencias Repetitivas de Ácidos Nucleicos , Reordenamiento Génico , Plastidios/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA