Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 230
Filtrar
Mais filtros











Base de dados
Intervalo de ano de publicação
1.
Elife ; 122024 Oct 04.
Artigo em Inglês | MEDLINE | ID: mdl-39365295

RESUMO

The Xerces Blue (Glaucopsyche xerces) is considered to be the first butterfly to become extinct in historical times. It was notable for its chalky lavender wings with conspicuous white spots on the ventral wings. The last individuals were collected in their restricted habitat, in the dunes near the Presidio military base in San Francisco, in 1941. We sequenced the genomes of four 80- to 100-year-old Xerces Blue, and seven historical and one modern specimens of its closest relative, the Silvery Blue (Glaucopsyche lygdamus). We compared these to a novel annotated genome of the Green-Underside Blue (Glaucopsyche alexis). Phylogenetic relationships inferred from complete mitochondrial genomes indicate that Xerces Blue was a distinct species that diverged from the Silvery Blue lineage at least 850,000 years ago. Using nuclear genomes, both species experienced population growth during the Eemian interglacial period, but the Xerces Blue decreased to a very low effective population size subsequently, a trend opposite to that observed in the Silvery Blue. Runs of homozygosity and deleterious load in the former were significantly greater than in the later, suggesting a higher incidence of inbreeding. These signals of population decline observed in Xerces Blue could be used to identify and monitor other insects threatened by human activities, whose extinction patterns are still not well known.


Assuntos
Borboletas , Extinção Biológica , Genoma de Inseto , Filogenia , Borboletas/genética , Borboletas/classificação , Animais
2.
NPJ Biodivers ; 3(1): 28, 2024 Sep 17.
Artigo em Inglês | MEDLINE | ID: mdl-39289538

RESUMO

A genomic database of all Earth's eukaryotic species could contribute to many scientific discoveries; however, only a tiny fraction of species have genomic information available. In 2018, scientists across the world united under the Earth BioGenome Project (EBP), aiming to produce a database of high-quality reference genomes containing all ~1.5 million recognized eukaryotic species. As the European node of the EBP, the European Reference Genome Atlas (ERGA) sought to implement a new decentralised, equitable and inclusive model for producing reference genomes. For this, ERGA launched a Pilot Project establishing the first distributed reference genome production infrastructure and testing it on 98 eukaryotic species from 33 European countries. Here we outline the infrastructure and explore its effectiveness for scaling high-quality reference genome production, whilst considering equity and inclusion. The outcomes and lessons learned provide a solid foundation for ERGA while offering key learnings to other transnational, national genomic resource projects and the EBP.

3.
Wellcome Open Res ; 9: 292, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-39114493

RESUMO

We present a genome assembly of the free-living nematode Caenorhabditis drosophilae (Nematoda; Chromadorea; Rhabditida; Rhabditidae). The genome sequence is 51.3 megabases in span. Most of the assembly is scaffolded into six chromosomal pseudomolecules, including the X sex chromosome. The mitochondrial genome has also been assembled and is 15.15 kilobases in length.

4.
Sci Rep ; 14(1): 18658, 2024 08 12.
Artigo em Inglês | MEDLINE | ID: mdl-39134591

RESUMO

Cyanobacteria are globally occurring photosynthetic bacteria notable for their contribution to primary production and production of toxins which have detrimental ecosystem impacts. Furthermore, cyanobacteria can form mutualistic symbiotic relationships with a diverse set of eukaryotes, including land plants, aquatic plankton and fungi. Nevertheless, not all cyanobacteria are found in symbiotic associations suggesting symbiotic cyanobacteria have evolved specializations that facilitate host-interactions. Photosynthetic capabilities, nitrogen fixation, and the production of complex biochemicals are key functions provided by host-associated cyanobacterial symbionts. To explore if additional specializations are associated with such lifestyles in cyanobacteria, we have conducted comparative phylogenomics of molecular functions and of biosynthetic gene clusters (BGCs) in 984 cyanobacterial genomes. Cyanobacteria with host-associated and symbiotic lifestyles were concentrated in the family Nostocaceae, where eight monophyletic clades correspond to specific host taxa. In agreement with previous studies, symbionts are likely to provide fixed nitrogen to their eukaryotic partners, through multiple different nitrogen fixation pathways. Additionally, our analyses identified chitin metabolising pathways in cyanobacteria associated with specific host groups, while obligate symbionts had fewer BGCs. The conservation of molecular functions and BGCs between closely related symbiotic and free-living cyanobacteria suggests the potential for additional cyanobacteria to form symbiotic relationships than is currently known.


Assuntos
Cianobactérias , Fixação de Nitrogênio , Filogenia , Simbiose , Cianobactérias/genética , Cianobactérias/metabolismo , Genoma Bacteriano , Família Multigênica , Fotossíntese
5.
BMC Genomics ; 25(1): 615, 2024 Jun 18.
Artigo em Inglês | MEDLINE | ID: mdl-38890582

RESUMO

BACKGROUND: Nematodes are the most abundant and diverse metazoans on Earth, and are known to significantly affect ecosystem functioning. A better understanding of their biology and ecology, including potential adaptations to diverse habitats and lifestyles, is key to understanding their response to global change scenarios. Mitochondrial genomes offer high species level characterization, low cost of sequencing, and an ease of data handling that can provide insights into nematode evolutionary pressures. RESULTS: Generally, nematode mitochondrial genomes exhibited similar structural characteristics (e.g., gene size and GC content), but displayed remarkable variability around these general patterns. Compositional strand biases showed strong codon position specific G skews and relationships with nematode life traits (especially parasitic feeding habits) equal to or greater than with predicted phylogeny. On average, nematode mitochondrial genomes showed low non-synonymous substitution rates, but also high clade specific deviations from these means. Despite the presence of significant mutational saturation, non-synonymous (dN) and synonymous (dS) substitution rates could still be significantly explained by feeding habit and/or habitat. Low ratios of dN:dS rates, particularly associated with the parasitic lifestyles, suggested the presence of strong purifying selection. CONCLUSIONS: Nematode mitochondrial genomes demonstrated a capacity to accumulate diversity in composition, structure, and content while still maintaining functional genes. Moreover, they demonstrated a capacity for rapid evolutionary change pointing to a potential interaction between multi-level selection pressures and rapid evolution. In conclusion, this study helps establish a background for our understanding of the potential evolutionary pressures shaping nematode mitochondrial genomes, while outlining likely routes of future inquiry.


Assuntos
Genoma Mitocondrial , Genômica , Nematoides , Filogenia , Seleção Genética , Animais , Nematoides/genética , Genômica/métodos , Composição de Bases , Evolução Molecular , Códon/genética
6.
BMC Genomics ; 25(1): 596, 2024 Jun 13.
Artigo em Inglês | MEDLINE | ID: mdl-38872121

RESUMO

Molluscan mitochondrial genomes are unusual because they show wide variation in size, radical genome rearrangements and frequently show high variation (> 10%) within species. As progress in understanding this variation has been limited, we used whole genome sequencing of a six-generation matriline of the terrestrial snail Cepaea nemoralis, as well as whole genome sequences from wild-collected C. nemoralis, the sister species C. hortensis, and multiple other snail species to explore the origins of mitochondrial DNA (mtDNA) variation. The main finding is that a high rate of SNP heteroplasmy in somatic tissue was negatively correlated with mtDNA copy number in both Cepaea species. In individuals with under ten mtDNA copies per nuclear genome, more than 10% of all positions were heteroplasmic, with evidence for transmission of this heteroplasmy through the germline. Further analyses showed evidence for purifying selection acting on non-synonymous mutations, even at low frequency of the rare allele, especially in cytochrome oxidase subunit 1 and cytochrome b. The mtDNA of some individuals of Cepaea nemoralis contained a length heteroplasmy, including up to 12 direct repeat copies of tRNA-Val, with 24 copies in another snail, Candidula rugosiuscula, and repeats of tRNA-Thr in C. hortensis. These repeats likely arise due to error prone replication but are not correlated with mitochondrial copy number in C. nemoralis. Overall, the findings provide key insights into mechanisms of replication, mutation and evolution in molluscan mtDNA, and so will inform wider studies on the biology and evolution of mtDNA across animal phyla.


Assuntos
Variações do Número de Cópias de DNA , DNA Mitocondrial , Genoma Mitocondrial , Heteroplasmia , Mutação , Seleção Genética , Caramujos , Animais , Caramujos/genética , DNA Mitocondrial/genética , Heteroplasmia/genética , Polimorfismo de Nucleotídeo Único
7.
Nat Commun ; 15(1): 5001, 2024 Jun 12.
Artigo em Inglês | MEDLINE | ID: mdl-38866741

RESUMO

Theory predicts that compensatory genetic changes reduce negative indirect effects of selected variants during adaptive evolution, but evidence is scarce. Here, we test this in a wild population of Hawaiian crickets using temporal genomics and a high-quality chromosome-level cricket genome. In this population, a mutation, flatwing, silences males and rapidly spread due to an acoustically-orienting parasitoid. Our sampling spanned a social transition during which flatwing fixed and the population went silent. We find long-range linkage disequilibrium around the putative flatwing locus was maintained over time, and hitchhiking genes had functions related to negative flatwing-associated effects. We develop a combinatorial enrichment approach using transcriptome data to test for compensatory, intragenomic coevolution. Temporal changes in genomic selection were distributed genome-wide and functionally associated with the population's transition to silence, particularly behavioural responses to silent environments. Our results demonstrate how 'adaptation begets adaptation'; changes to the sociogenetic environment accompanying rapid trait evolution can generate selection provoking further, compensatory adaptation.


Assuntos
Genômica , Gryllidae , Animais , Gryllidae/genética , Gryllidae/fisiologia , Masculino , Genômica/métodos , Havaí , Adaptação Fisiológica/genética , Desequilíbrio de Ligação , Genoma de Inseto , Evolução Biológica , Feminino , Mutação , Seleção Genética , Evolução Molecular , Transcriptoma/genética
8.
J Eukaryot Microbiol ; : e13033, 2024 May 24.
Artigo em Inglês | MEDLINE | ID: mdl-38785208

RESUMO

Microsporidia and Apicomplexa are eukaryotic, single-celled, intracellular parasites with huge public health and economic importance. Typically, these parasites are studied separately, emphasizing their uniqueness and diversity. In this review, we explore the huge amount of genomic data that has recently become available for the two groups. We compare and contrast their genome evolution and discuss how their transitions to intracellular life may have shaped it. In particular, we explore genome reduction and compaction, genome expansion and ploidy, gene shuffling and rearrangements, and the evolution of centromeres and telomeres.

9.
Wellcome Open Res ; 9: 33, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38617467

RESUMO

Contamination of public databases by mislabelled sequences has been highlighted for many years and the avalanche of novel sequencing data now being deposited has the potential to make databases difficult to use effectively. It is therefore crucial that sequencing projects and database curators perform pre-submission checks to remove obvious contamination and avoid propagating erroneous taxonomic relationships. However, it is important also to recognise that biological contamination of a target sample with unexpected species' DNA can also lead to the discovery of fascinating biological phenomena through the identification of environmental organisms or endosymbionts. Here, we present a novel, integrated method for detection and generation of high-quality genomes of all non-target genomes co-sequenced in eukaryotic genome sequencing projects. After performing taxonomic profiling of an assembly from the raw data, and leveraging the identity of small rRNA sequences discovered therein as markers, a targeted classification approach retrieves and assembles high-quality genomes. The genomes of these cobionts are then not only removed from the target species' genome but also available for further interrogation. Source code is available from https://github.com/CobiontID/MarkerScan. MarkerScan is written in Python and is deployed as a Docker container.


This article addresses a common issue in genetic research: the accidental mixing of genetic information from different species in public databases, often due to mislabelling or contamination. Interestingly, this 'contamination' can sometimes lead to exciting discoveries, like identifying DNA from unexpected species in a sample, revealing insights about organisms that live in the environment of the target organism. In our study, we developed a tool called MarkerScan for identifying these additional species found alongside the target species in eukaryotic genome sequencing projects. The method includes a way to sequence the whole genomes of the additional species. Our method involves sorting through the genetic data to identify certain small RNA sequences, which we then use as markers. These markers help to classify and assemble high-quality genomes from these additional species. This not only cleans up the main target species' genome data but also provides new, valuable genomes for further exploration.

10.
Wellcome Open Res ; 9: 3, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38686023

RESUMO

We present a genome assembly from an individual female Coleophora flavipennella (the Tipped Oak Case-bearer; Arthropoda; Insecta; Lepidoptera; Coleophoridae). The genome sequence is 989.3 megabases in span. Most of the assembly is scaffolded into 57 chromosomal pseudomolecules, including the W and Z sex chromosomes. The mitochondrial genome has also been assembled and is 15.77 kilobases in length.

11.
Nat Ecol Evol ; 8(4): 777-790, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38383850

RESUMO

Chromosomes are a central unit of genome organization. One-tenth of all described species on Earth are butterflies and moths, the Lepidoptera, which generally possess 31 chromosomes. However, some species display dramatic variation in chromosome number. Here we analyse 210 chromosomally complete lepidopteran genomes and show that the chromosomes of extant lepidopterans are derived from 32 ancestral linkage groups, which we term Merian elements. Merian elements have remained largely intact through 250 million years of evolution and diversification. Against this stable background, eight lineages have undergone extensive reorganization either through numerous fissions or a combination of fusion and fission events. Outside these lineages, fusions are rare and fissions are rarer still. Fusions often involve small, repeat-rich Merian elements and the sex-linked element. Our results reveal the constraints on genome architecture in Lepidoptera and provide a deeper understanding of chromosomal rearrangements in eukaryotic genome evolution.


Assuntos
Borboletas , Mariposas , Animais , Borboletas/genética , Cromossomos , Genômica/métodos , Genoma , Mariposas/genética
12.
PLoS Genet ; 20(1): e1011116, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-38227589

RESUMO

Heteromorphic sex chromosomes are usually thought to have originated from a pair of autosomes that acquired a sex-determining locus and subsequently stopped recombining, leading to degeneration of the sex-limited chromosome. The majority of nematode species lack heteromorphic sex chromosomes and determine sex using an X-chromosome counting mechanism, with males being hemizygous for one or more X chromosomes (XX/X0). Some filarial nematode species, including important parasites of humans, have heteromorphic XX/XY karyotypes. It has been assumed that sex is determined by a Y-linked locus in these species. However, karyotypic analyses suggested that filarial Y chromosomes are derived from the unfused homologue of an autosome involved in an X-autosome fusion event. Here, we generated a chromosome-level reference genome for Litomosoides sigmodontis, a filarial nematode with the ancestral filarial karyotype and sex determination mechanism (XX/X0). By mapping the assembled chromosomes to the rhabditid nematode ancestral linkage (or Nigon) elements, we infer that the ancestral filarial X chromosome was the product of a fusion between NigonX (the ancestrally X-linked element) and NigonD (ancestrally autosomal). In the two filarial lineages with XY systems, there have been two independent X-autosome chromosome fusion events involving different autosomal Nigon elements. In both lineages, the region shared by the neo-X and neo-Y chromosomes is within the ancestrally autosomal portion of the X, confirming that the filarial Y chromosomes are derived from the unfused homologue of the autosome. Sex determination in XY filarial nematodes therefore likely continues to operate via the ancestral X-chromosome counting mechanism, rather than via a Y-linked sex-determining locus.


Assuntos
Filarioidea , Nematoides , Animais , Masculino , Humanos , Cromossomo Y/genética , Cromossomos Sexuais , Cromossomo X/genética , Cromossomos Humanos X , Filarioidea/genética
13.
Microbiol Spectr ; 12(2): e0366923, 2024 Feb 06.
Artigo em Inglês | MEDLINE | ID: mdl-38214524

RESUMO

Microsporidia are obligate intracellular eukaryotic parasites with an extremely broad host range. They have both economic and public health importance. Ploidy in microsporidia is variable, with a few species formally identified as diploid and one as polyploid. Given the increase in the number of studies sequencing microsporidian genomes, it is now possible to assess ploidy levels across all currently explored microsporidian diversity. We estimate ploidy for all microsporidian data sets available on the Sequence Read Archive using k-mer-based analyses, indicating that polyploidy is widespread in Microsporidia and that ploidy change is dynamic in the group. Using genome-wide heterozygosity estimates, we also show that polyploid microsporidian genomes are relatively homozygous, and we discuss the implications of these findings on the timing of polyploidization events and their origin.IMPORTANCEMicrosporidia are single-celled intracellular parasites, distantly related to fungi, that can infect a broad range of hosts, from humans all the way to protozoans. Exploiting the wealth of microsporidian genomic data available, we use k-mer-based analyses to assess ploidy status across the group. Understanding a genome's ploidy is crucial in order to assemble it effectively and may also be relevant for better understanding a parasite's behavior and life cycle. We show that tetraploidy is present in at least six species in Microsporidia and that these polyploidization events are likely to have occurred independently. We discuss why these findings may be paradoxical, given that Microsporidia, like other intracellular parasites, have extremely small, reduced genomes.


Assuntos
Microsporídios , Humanos , Filogenia , Evolução Molecular , Genoma Fúngico , Poliploidia
14.
Nat Commun ; 14(1): 7776, 2023 Nov 27.
Artigo em Inglês | MEDLINE | ID: mdl-38012132

RESUMO

Host-parasite interactions exert strong selection pressures on the genomes of both host and parasite. These interactions can lead to negative frequency-dependent selection, a form of balancing selection that is hypothesised to explain the high levels of polymorphism seen in many host immune and parasite antigen loci. Here, we sequence the genomes of several individuals of Heligmosomoides bakeri, a model parasite of house mice, and Heligmosomoides polygyrus, a closely related parasite of wood mice. Although H. bakeri is commonly referred to as H. polygyrus in the literature, their genomes show levels of divergence that are consistent with at least a million years of independent evolution. The genomes of both species contain hyper-divergent haplotypes that are enriched for proteins that interact with the host immune response. Many of these haplotypes originated prior to the divergence between H. bakeri and H. polygyrus, suggesting that they have been maintained by long-term balancing selection. Together, our results suggest that the selection pressures exerted by the host immune response have played a key role in shaping patterns of genetic diversity in the genomes of parasitic nematodes.


Assuntos
Nematospiroides dubius , Trichostrongyloidea , Camundongos , Animais , Interações Hospedeiro-Parasita/fisiologia , Nematospiroides dubius/genética
15.
Mol Biol Evol ; 40(11)2023 Nov 03.
Artigo em Inglês | MEDLINE | ID: mdl-37713621

RESUMO

Conservation genetic analyses of many endangered species have been based on genotyping of microsatellite loci and sequencing of short fragments of mtDNA. The increase in power and resolution afforded by whole genome approaches may challenge conclusions made on limited numbers of loci and maternally inherited haploid markers. Here, we provide a matched comparison of whole genome sequencing versus microsatellite and control region (CR) genotyping for Eurasian otters (Lutra lutra). Previous work identified four genetically differentiated "stronghold" populations of otter in Britain, derived from regional populations that survived the population crash of the 1950s-1980s. Using whole genome resequencing data from 45 samples from across the British stronghold populations, we confirmed some aspects of population structure derived from previous marker-driven studies. Importantly, we showed that genomic signals of the population crash bottlenecks matched evidence from otter population surveys. Unexpectedly, two strongly divergent mitochondrial lineages were identified that were undetectable using CR fragments, and otters in the east of England were genetically distinct and surprisingly variable. We hypothesize that this previously unsuspected variability may derive from past releases of Eurasian otters from other, non-British source populations in England around the time of the population bottleneck. Our work highlights that even reasonably well-studied species may harbor genetic surprises, if studied using modern high-throughput sequencing methods.


Assuntos
Lontras , Animais , Lontras/genética , Reino Unido , DNA Mitocondrial/genética , Espécies em Perigo de Extinção , Genômica
16.
Heredity (Edinb) ; 131(5-6): 327-337, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37758900

RESUMO

Molluscs are a highly speciose phylum that exhibits an astonishing array of colours and patterns, yet relatively little progress has been made in identifying the underlying genes that determine phenotypic variation. One prominent example is the land snail Cepaea nemoralis for which classical genetic studies have shown that around nine loci, several physically linked and inherited together as a 'supergene', control the shell colour and banding polymorphism. As a first step towards identifying the genes involved, we used whole-genome resequencing of individuals from a laboratory cross to construct a high-density linkage map, and then trait mapping to identify 95% confidence intervals for the chromosomal region that contains the supergene, specifically the colour locus (C), and the unlinked mid-banded locus (U). The linkage map is made up of 215,593 markers, ordered into 22 linkage groups, with one large group making up ~27% of the genome. The C locus was mapped to a ~1.3 cM region on linkage group 11, and the U locus was mapped to a ~0.7 cM region on linkage group 15. The linkage map will serve as an important resource for further evolutionary and population genomic studies of C. nemoralis and related species, as well as the identification of candidate genes within the supergene and for the mid-banding phenotype.


Assuntos
Genoma , Polimorfismo Genético , Humanos , Cor , Mapeamento Cromossômico , Fenótipo , Ligação Genética
17.
BMC Bioinformatics ; 24(1): 288, 2023 Jul 18.
Artigo em Inglês | MEDLINE | ID: mdl-37464285

RESUMO

BACKGROUND:  PacBio high fidelity (HiFi) sequencing reads are both long (15-20 kb) and highly accurate (> Q20). Because of these properties, they have revolutionised genome assembly leading to more accurate and contiguous genomes. In eukaryotes the mitochondrial genome is sequenced alongside the nuclear genome often at very high coverage. A dedicated tool for mitochondrial genome assembly using HiFi reads is still missing. RESULTS:  MitoHiFi was developed within the Darwin Tree of Life Project to assemble mitochondrial genomes from the HiFi reads generated for target species. The input for MitoHiFi is either the raw reads or the assembled contigs, and the tool outputs a mitochondrial genome sequence fasta file along with annotation of protein and RNA genes. Variants arising from heteroplasmy are assembled independently, and nuclear insertions of mitochondrial sequences are identified and not used in organellar genome assembly. MitoHiFi has been used to assemble 374 mitochondrial genomes (368 Metazoa and 6 Fungi species) for the Darwin Tree of Life Project, the Vertebrate Genomes Project and the Aquatic Symbiosis Genome Project. Inspection of 60 mitochondrial genomes assembled with MitoHiFi for species that already have reference sequences in public databases showed the widespread presence of previously unreported repeats. CONCLUSIONS:  MitoHiFi is able to assemble mitochondrial genomes from a wide phylogenetic range of taxa from Pacbio HiFi data. MitoHiFi is written in python and is freely available on GitHub ( https://github.com/marcelauliano/MitoHiFi ). MitoHiFi is available with its dependencies as a Docker container on GitHub (ghcr.io/marcelauliano/mitohifi:master).


Assuntos
Genoma Mitocondrial , Filogenia , RNA , Eucariotos , Análise de Sequência de DNA , Sequenciamento de Nucleotídeos em Larga Escala
18.
Wellcome Open Res ; 8: 6, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36875804

RESUMO

We present a genome assembly from an individual Haliclystus octoradiatus (the spotted kaleidoscope jellyfish; Cnidaria; Staurozoa; Stauromedusae; Haliclystidae). The genome sequence is 262 megabases in span. Most of the assembly (98.3%) is scaffolded into nine (9) chromosomal pseudomolecules. The mitochondrial genome was also assembled and is 18.3 kilobases in length.

19.
Wellcome Open Res ; 8: 24, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36864925

RESUMO

As genomic data transform our understanding of biodiversity, the Earth BioGenome Project (EBP) has set a goal of generating reference quality genome assemblies for all ~1.9 million described eukaryotic taxa. Meeting this goal requires coordination among many individual regional and taxon-focussed projects working under the EBP umbrella. Large-scale sequencing projects require ready access to validated genome-relevant metadata, such as genome sizes and karyotypes, but these data are dispersed across the literature, and directly measured values are lacking for most taxa. To meet these needs, we have developed Genomes on a Tree (GoaT), an Elasticsearch-powered datastore and search index for genome-relevant metadata and sequencing project plans and statuses. GoaT indexes publicly available metadata for all eukaryotic species and interpolates missing values through phylogenetic comparison. GoaT also holds target priority and sequencing status information for many projects affiliated to the EBP to aid project coordination. Metadata and status attributes in GoaT can be queried through a mature API, a web front end, and a command line interface. The web front end additionally provides summary visualisations for data exploration and reporting (see https://goat.genomehubs.org). GoaT currently holds direct or estimated values for over 70 taxon attributes and over 30 assembly attributes across 1.5 million eukaryotic species. The depth and breadth of curated data, frequent updates, and a versatile query interface make GoaT a powerful data aggregator and portal to explore and report underlying data for the eukaryotic tree of life. We illustrate this utility through a series of use cases from planning through to completion of a genome-sequencing project.

20.
PLoS Biol ; 21(1): e3001972, 2023 01.
Artigo em Inglês | MEDLINE | ID: mdl-36689552

RESUMO

The Darwin Tree of Life (DToL) project aims to sequence all described terrestrial and aquatic eukaryotic species found in Britain and Ireland. Reference genome sequences are generated from single individuals for each target species. In addition to the target genome, sequenced samples often contain genetic material from microbiomes, endosymbionts, parasites, and other cobionts. Wolbachia endosymbiotic bacteria are found in a diversity of terrestrial arthropods and nematodes, with supergroups A and B the most common in insects. We identified and assembled 110 complete Wolbachia genomes from 93 host species spanning 92 families by filtering data from 368 insect species generated by the DToL project. From 15 infected species, we assembled more than one Wolbachia genome, including cases where individuals carried simultaneous supergroup A and B infections. Different insect orders had distinct patterns of infection, with Lepidopteran hosts mostly infected with supergroup B, while infections in Diptera and Hymenoptera were dominated by A-type Wolbachia. Other than these large-scale order-level associations, host and Wolbachia phylogenies revealed no (or very limited) cophylogeny. This points to the occurrence of frequent host switching events, including between insect orders, in the evolutionary history of the Wolbachia pandemic. While supergroup A and B genomes had distinct GC% and GC skew, and B genomes had a larger core gene set and tended to be longer, it was the abundance of copies of bacteriophage WO who was a strong determinant of Wolbachia genome size. Mining raw genome data generated for reference genome assemblies is a robust way of identifying and analysing cobiont genomes and giving greater ecological context for their hosts.


Assuntos
Dípteros , Nematoides , Wolbachia , Humanos , Animais , Filogenia , Wolbachia/genética , Genômica , Simbiose/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA