Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Bioinformatics ; 40(5)2024 May 02.
Artigo em Inglês | MEDLINE | ID: mdl-38632086

RESUMO

MOTIVATION: Major improvements in sequencing technologies and genome sequence assembly have led to a huge increase in the number of available genome sequences. In turn, these genome sequences form an invaluable source for evolutionary, ecological, and comparative studies. One kind of analysis that has become routine is the search for traces of ancient polyploidy, particularly for plant genomes, where whole-genome duplication (WGD) is rampant. RESULTS: Here, we present a major update of a previously developed tool wgd, namely wgd v2, to look for remnants of ancient polyploidy, or WGD. We implemented novel and improved previously developed tools to (a) construct KS age distributions for the whole-paranome (collection of all duplicated genes in a genome), (b) unravel intragenomic and intergenomic collinearity resulting from WGDs, (c) fit mixture models to age distributions of gene duplicates, (d) correct substitution rate variation for phylogenetic placement of WGDs, and (e) date ancient WGDs via phylogenetic dating of WGD-retained gene duplicates. The applicability and feasibility of wgd v2 for the identification and the relative and absolute dating of ancient WGDs is demonstrated using different plant genomes. AVAILABILITY AND IMPLEMENTATION: wgd v2 is open source and available at https://github.com/heche-psb/wgd.


Assuntos
Duplicação Gênica , Genoma de Planta , Filogenia , Poliploidia , Evolução Molecular , Software , Genômica/métodos
2.
New Phytol ; 237(4): 1405-1417, 2023 02.
Artigo em Inglês | MEDLINE | ID: mdl-36349406

RESUMO

Ferns, and particularly homosporous ferns, have long been assumed to have experienced recurrent whole-genome duplication (WGD) events because of their substantially large genome sizes, surprisingly high chromosome numbers, and high degrees of polyploidy among many extant members. As the number of sequenced fern genomes is limited, recent studies have employed transcriptome data to find evidence for WGDs in ferns. However, they have reached conflicting results concerning the occurrence of ancient polyploidy, for instance, in the lineage of leptosporangiate ferns. Because identifying WGDs in a phylogenetic context is the foremost step in studying the contribution of ancient polyploidy to evolution, we here revisited earlier identified WGDs in leptosporangiate ferns, mainly the core leptosporangiate ferns, by building KS -age distributions and applying substitution rate corrections and by conducting statistical gene tree-species tree reconciliation analyses. Our integrative analyses not only identified four ancient WGDs in the sampled core leptosporangiate ferns but also identified false positives and false negatives for WGDs that recent studies have reported earlier. In conclusion, we underscore the significance of substitution rate corrections and uncertainties in gene tree-species tree reconciliations in calling WGD events and advance an exemplar workflow to overcome such often-overlooked issues.


Assuntos
Gleiquênias , Gleiquênias/genética , Filogenia , Duplicação Gênica , Tamanho do Genoma , Poliploidia , Evolução Molecular , Genoma de Planta
3.
Mol Biol Evol ; 37(9): 2734-2746, 2020 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-32359154

RESUMO

Ancient whole-genome duplications (WGDs) leave signatures in comparative genomic data sets that can be harnessed to detect these events of presumed evolutionary importance. Current statistical approaches for the detection of ancient WGDs in a phylogenetic context have two main drawbacks. The first is that unwarranted restrictive assumptions on the "background" gene duplication and loss rates make inferences unreliable in the face of model violations. The second is that most methods can only be used to examine a limited set of a priori selected WGD hypotheses and cannot be used to discover WGDs in a phylogeny. In this study, we develop an approach for WGD inference using gene count data that seeks to overcome both issues. We employ a phylogenetic birth-death model that includes WGD in a flexible hierarchical Bayesian approach and use reversible-jump Markov chain Monte Carlo to perform Bayesian inference of branch-specific duplication, loss, and WGD retention rates across the space of WGD configurations. We evaluate the proposed method using simulations, apply it to data sets from flowering plants, and discuss the statistical intricacies of model-based WGD inference.


Assuntos
Duplicação Gênica , Técnicas Genéticas , Genoma , Modelos Genéticos , Filogenia , Teorema de Bayes , Duplicação Cromossômica , Simulação por Computador , Magnoliopsida/genética , Cadeias de Markov , Método de Monte Carlo , Poliploidia
4.
BMC Biol ; 18(1): 57, 2020 05 27.
Artigo em Inglês | MEDLINE | ID: mdl-32460826

RESUMO

BACKGROUND: Gene duplication events play an important role in the evolution and adaptation of organisms. Duplicated genes can arise through different mechanisms, including whole-genome duplications (WGDs). Recently, WGD was suggested to be an important driver of evolution, also in hexapod animals. RESULTS: Here, we analyzed 20 high-quality hexapod genomes using whole-paranome distributions of estimated synonymous distances (KS), patterns of within-genome co-linearity, and phylogenomic gene tree-species tree reconciliation methods. We observe an abundance of gene duplicates in the majority of these hexapod genomes, yet we find little evidence for WGD. The majority of gene duplicates seem to have originated through small-scale gene duplication processes. We did detect segmental duplications in six genomes, but these lacked the within-genome co-linearity signature typically associated with WGD, and the age of these duplications did not coincide with particular peaks in KS distributions. Furthermore, statistical gene tree-species tree reconciliation failed to support all but one of the previously hypothesized WGDs. CONCLUSIONS: Our analyses therefore provide very limited evidence for WGD having played a significant role in the evolution of hexapods and suggest that alternative mechanisms drive gene duplication events in this group of animals. For instance, we propose that, along with small-scale gene duplication events, episodes of increased transposable element activity could have been an important source for gene duplicates in hexapods.


Assuntos
Evolução Molecular , Duplicação Gênica , Genoma , Insetos/genética , Animais , Artrópodes/genética , Filogenia
5.
Mol Biol Evol ; 36(7): 1384-1404, 2019 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-31004147

RESUMO

Gene tree-species tree reconciliation methods have been employed for studying ancient whole-genome duplication (WGD) events across the eukaryotic tree of life. Most approaches have relied on using maximum likelihood trees and the maximum parsimony reconciliation thereof to count duplication events on specific branches of interest in a reference species tree. Such approaches do not account for uncertainty in the gene tree and reconciliation, or do so only heuristically. The effects of these simplifications on the inference of ancient WGDs are unclear. In particular, the effects of variation in gene duplication and loss rates across the species tree have not been considered. Here, we developed a full probabilistic approach for phylogenomic reconciliation-based WGD inference, accounting for both gene tree and reconciliation uncertainty using a method based on the principle of amalgamated likelihood estimation. The model and methods are implemented in a maximum likelihood and Bayesian setting and account for variation of duplication and loss rates across the species tree, using methods inspired by phylogenetic divergence time estimation. We applied our newly developed framework to ancient WGDs in land plants and investigated the effects of duplication and loss rate variation on reconciliation and gene count based assessment of these earlier proposed WGDs.


Assuntos
Evolução Biológica , Duplicação Gênica , Técnicas Genéticas , Plantas/genética , Algoritmos , Filogenia
6.
Bioinformatics ; 35(12): 2153-2155, 2019 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-30398564

RESUMO

SUMMARY: Ancient whole-genome duplications (WGDs) have been uncovered in almost all major lineages of life on Earth and the search for traces or remnants of such events has become standard practice in most genome analyses. This is especially true for plants, where ancient WGDs are abundant. Common approaches to find evidence for ancient WGDs include the construction of KS distributions and the analysis of intragenomic colinearity. Despite the increased interest in WGDs and the acknowledgment of their evolutionary importance, user-friendly and comprehensive tools for their analysis are lacking. Here, we present an easy to use command-line tool for KS distribution construction named wgd. The wgd suite provides commonly used KS and colinearity analysis workflows together with tools for modeling and visualization, rendering these analyses accessible to genomics researchers in a convenient manner. AVAILABILITY AND IMPLEMENTATION: wgd is free and open source software implemented in Python and is available at https://github.com/arzwa/wgd. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Duplicação Gênica , Genoma de Planta , Evolução Biológica , Genômica , Plantas , Software
7.
Methods Mol Biol ; 2545: 3-18, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36720805

RESUMO

Whole-genome sequence data have revealed that numerous eukaryotic organisms derive from distant polyploid ancestors, even when these same organisms are genetically and karyotypically diploid. Such ancient whole-genome duplications (WGDs) have been important for long-term genome evolution and are often speculatively associated with important evolutionary events such as key innovations, adaptive radiations, or survival after mass extinctions. Clearly, reliable methods for unveiling ancient WGDs are key toward furthering understanding of the long-term evolutionary significance of polyploidy. In this chapter, we describe a set of basic established comparative genomics approaches for the inference of ancient WGDs from genomic data based on empirical age distributions and collinearity analyses, explain the principles on which they are based, and illustrate a basic workflow using the software "wgd," geared toward a typical exploratory analysis of a newly obtained genome sequence.


Assuntos
Diploide , Genômica , Humanos , Eucariotos , Extinção Biológica , Poliploidia
8.
Nat Commun ; 12(1): 3498, 2021 06 09.
Artigo em Inglês | MEDLINE | ID: mdl-34108452

RESUMO

Plant genomes vary greatly in size, organization, and architecture. Such structural differences may be highly relevant for inference of genome evolution dynamics and phylogeny. Indeed, microsynteny-the conservation of local gene content and order-is recognized as a valuable source of phylogenetic information, but its use for the inference of large phylogenies has been limited. Here, by combining synteny network analysis, matrix representation, and maximum likelihood phylogenetic inference, we provide a way to reconstruct phylogenies based on microsynteny information. Both simulations and use of empirical data sets show our method to be accurate, consistent, and widely applicable. As an example, we focus on the analysis of a large-scale whole-genome data set for angiosperms, including more than 120 available high-quality genomes, representing more than 50 different plant families and 30 orders. Our 'microsynteny-based' tree is largely congruent with phylogenies proposed based on more traditional sequence alignment-based methods and current phylogenetic classifications but differs for some long-contested and controversial relationships. For instance, our synteny-based tree finds Vitales as early diverging eudicots, Saxifragales within superasterids, and magnoliids as sister to monocots. We discuss how synteny-based phylogenetic inference can complement traditional methods and could provide additional insights into some long-standing controversial phylogenetic relationships.


Assuntos
Genoma de Planta/genética , Magnoliopsida/genética , Filogenia , Sintenia , Evolução Molecular , Genes de Plantas/genética , Magnoliopsida/classificação , Modelos Genéticos , Família Multigênica/genética
9.
Gigascience ; 10(3)2021 03 12.
Artigo em Inglês | MEDLINE | ID: mdl-33710327

RESUMO

BACKGROUND: Digitaria exilis, white fonio, is a minor but vital crop of West Africa that is valued for its resilience in hot, dry, and low-fertility environments and for the exceptional quality of its grain for human nutrition. Its success is hindered, however, by a low degree of plant breeding and improvement. FINDINGS: We sequenced the fonio genome with long-read SMRT-cell technology, yielding a ∼761 Mb assembly in 3,329 contigs (N50, 1.73 Mb; L50, 126). The assembly approaches a high level of completion, with a BUSCO score of >99%. The fonio genome was found to be a tetraploid, with most of the genome retained as homoeologous duplications that differ overall by ∼4.3%, neglecting indels. The 2 genomes within fonio were found to have begun their independent divergence ∼3.1 million years ago. The repeat content (>49%) is fairly standard for a grass genome of this size, but the ratio of Gypsy to Copia long terminal repeat retrotransposons (∼6.7) was found to be exceptionally high. Several genes related to future improvement of the crop were identified including shattering, plant height, and grain size. Analysis of fonio population genetics, primarily in Mali, indicated that the crop has extensive genetic diversity that is largely partitioned across a north-south gradient coinciding with the Sahel and Sudan grassland domains. CONCLUSIONS: We provide a high-quality assembly, annotation, and diversity analysis for a vital African crop. The availability of this information should empower future research into further domestication and improvement of fonio.


Assuntos
Digitaria , Melhoramento Vegetal , Digitaria/genética , Variação Genética , Genoma de Planta , Humanos , Preparações de Plantas
10.
Nat Plants ; 6(2): 107-118, 2020 02.
Artigo em Inglês | MEDLINE | ID: mdl-32042158

RESUMO

Hornworts, liverworts and mosses are three early diverging clades of land plants, and together comprise the bryophytes. Here, we report the draft genome sequence of the hornwort Anthoceros angustus. Phylogenomic inferences confirm the monophyly of bryophytes, with hornworts sister to liverworts and mosses. The simple morphology of hornworts correlates with low genetic redundancy in plant body plan, while the basic transcriptional regulation toolkit for plant development has already been established in this early land plant lineage. Although the Anthoceros genome is small and characterized by minimal redundancy, expansions are observed in gene families related to RNA editing, UV protection and desiccation tolerance. The genome of A. angustus bears the signatures of horizontally transferred genes from bacteria and fungi, in particular of genes operating in stress-response and metabolic pathways. Our study provides insight into the unique features of hornworts and their molecular adaptations to live on land.


Assuntos
Anthocerotophyta/genética , Evolução Biológica , Genoma de Planta , Família Multigênica , Filogenia
11.
Front Plant Sci ; 9: 352, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-29616063

RESUMO

Recent times have seen an enormous growth of "omics" data, of which high-throughput gene expression data are arguably the most important from a functional perspective. Despite huge improvements in computational techniques for the functional classification of gene sequences, common similarity-based methods often fall short of providing full and reliable functional information. Recently, the combination of comparative genomics with approaches in functional genomics has received considerable interest for gene function analysis, leveraging both gene expression based guilt-by-association methods and annotation efforts in closely related model organisms. Besides the identification of missing genes in pathways, these methods also typically enable the discovery of biological regulators (i.e., transcription factors or signaling genes). A previously built guilt-by-association method is MORPH, which was proven to be an efficient algorithm that performs particularly well in identifying and prioritizing missing genes in plant metabolic pathways. Here, we present MorphDB, a resource where MORPH-based candidate genes for large-scale functional annotations (Gene Ontology, MapMan bins) are integrated across multiple plant species. Besides a gene centric query utility, we present a comparative network approach that enables researchers to efficiently browse MORPH predictions across functional gene sets and species, facilitating efficient gene discovery and candidate gene prioritization. MorphDB is available at http://bioinformatics.psb.ugent.be/webtools/morphdb/morphDB/index/. We also provide a toolkit, named "MORPH bulk" (https://github.com/arzwa/morph-bulk), for running MORPH in bulk mode on novel data sets, enabling researchers to apply MORPH to their own species of interest.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA