Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 61
Filtrar
1.
Cell ; 185(16): 2975-2987.e10, 2022 08 04.
Artículo en Inglés | MEDLINE | ID: mdl-35853453

RESUMEN

Horizontal gene transfer (HGT) is an important evolutionary force shaping prokaryotic and eukaryotic genomes. HGT-acquired genes have been sporadically reported in insects, a lineage containing >50% of animals. We systematically examined HGT in 218 high-quality genomes of diverse insects and found that they acquired 1,410 genes exhibiting diverse functions, including many not previously reported, via 741 distinct transfers from non-metazoan donors. Lepidopterans had the highest average number of HGT-acquired genes. HGT-acquired genes containing introns exhibited substantially higher expression levels than genes lacking introns, suggesting that intron gains were likely involved in HGT adaptation. Lastly, we used the CRISPR-Cas9 system to edit the prevalent unreported gene LOC105383139, which was transferred into the last common ancestor of moths and butterflies. In diamondback moths, males lacking LOC105383139 courted females significantly less. We conclude that HGT has been a major contributor to insect adaptation.


Asunto(s)
Mariposas Diurnas , Transferencia de Gen Horizontal , Animales , Mariposas Diurnas/genética , Cortejo , Evolución Molecular , Masculino , Filogenia
2.
Cell ; 176(6): 1356-1366.e10, 2019 03 07.
Artículo en Inglés | MEDLINE | ID: mdl-30799038

RESUMEN

Operons are a hallmark of bacterial genomes, where they allow concerted expression of functionally related genes as single polycistronic transcripts. They are rare in eukaryotes, where each gene usually drives expression of its own independent messenger RNAs. Here, we report the horizontal operon transfer of a siderophore biosynthesis pathway from relatives of Escherichia coli into a group of budding yeast taxa. We further show that the co-linearly arranged secondary metabolism genes are expressed, exhibit eukaryotic transcriptional features, and enable the sequestration and uptake of iron. After transfer, several genetic changes occurred during subsequent evolution, including the gain of new transcription start sites that were sometimes within protein-coding sequences, acquisition of polyadenylation sites, structural rearrangements, and integration of eukaryotic genes into the cluster. We conclude that the genes were likely acquired as a unit, modified for eukaryotic gene expression, and maintained by selection to adapt to the highly competitive, iron-limited environment.


Asunto(s)
Eucariontes/genética , Transferencia de Gen Horizontal/genética , Operón/genética , Bacterias/genética , Escherichia coli/genética , Células Eucariotas , Evolución Molecular , Regulación Bacteriana de la Expresión Génica/genética , Genes Bacterianos/genética , Genoma Bacteriano/genética , Genoma Fúngico/genética , Saccharomycetales/genética , Sideróforos/genética
3.
Cell ; 175(6): 1533-1545.e20, 2018 11 29.
Artículo en Inglés | MEDLINE | ID: mdl-30415838

RESUMEN

Budding yeasts (subphylum Saccharomycotina) are found in every biome and are as genetically diverse as plants or animals. To understand budding yeast evolution, we analyzed the genomes of 332 yeast species, including 220 newly sequenced ones, which represent nearly one-third of all known budding yeast diversity. Here, we establish a robust genus-level phylogeny comprising 12 major clades, infer the timescale of diversification from the Devonian period to the present, quantify horizontal gene transfer (HGT), and reconstruct the evolution of 45 metabolic traits and the metabolic toolkit of the budding yeast common ancestor (BYCA). We infer that BYCA was metabolically complex and chronicle the tempo and mode of genomic and phenotypic evolution across the subphylum, which is characterized by very low HGT levels and widespread losses of traits and the genes that control them. More generally, our results argue that reductive evolution is a major mode of evolutionary diversification.


Asunto(s)
Evolución Molecular , Transferencia de Gen Horizontal , Genoma Fúngico , Filogenia , Saccharomycetales/clasificación , Saccharomycetales/genética
4.
Nat Rev Genet ; 24(12): 834-850, 2023 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-37369847

RESUMEN

Genome-scale data and the development of novel statistical phylogenetic approaches have greatly aided the reconstruction of a broad sketch of the tree of life and resolved many of its branches. However, incongruence - the inference of conflicting evolutionary histories - remains pervasive in phylogenomic data, hampering our ability to reconstruct and interpret the tree of life. Biological factors, such as incomplete lineage sorting, horizontal gene transfer, hybridization, introgression, recombination and convergent molecular evolution, can lead to gene phylogenies that differ from the species tree. In addition, analytical factors, including stochastic, systematic and treatment errors, can drive incongruence. Here, we review these factors, discuss methodological advances to identify and handle incongruence, and highlight avenues for future research.


Asunto(s)
Evolución Biológica , Genoma , Filogenia , Evolución Molecular , Hibridación Genética
5.
Plant Cell ; 36(5): 1637-1654, 2024 May 01.
Artículo en Inglés | MEDLINE | ID: mdl-38114096

RESUMEN

MicroRNAs (miRNAs) are a class of nonprotein-coding short transcripts that provide a layer of post-transcriptional regulation essential to many plant biological processes. MiR858, which targets the transcripts of MYB transcription factors, can affect a range of secondary metabolic processes. Although miR858 and its 187-nt precursor have been well studied in Arabidopsis (Arabidopsis thaliana), a systematic investigation of miR858 precursors and their functions across plant species is lacking due to a problem in identifying the transcripts that generate this subclass. By re-evaluating the transcript of miR858 and relaxing the length cut-off for identifying hairpins, we found in kiwifruit (Actinidia chinensis) that miR858 has long-loop hairpins (1,100 to 2,100 nt), whose intervening sequences between miRNA generating complementary sites were longer than all previously reported miRNA hairpins. Importantly, these precursors of miR858 containing long-loop hairpins (termed MIR858L) are widespread in seed plants including Arabidopsis, varying between 350 and 5,500 nt. Moreover, we showed that MIR858L has a greater impact on proanthocyanidin and flavonol levels in both Arabidopsis and kiwifruit. We suggest that an active MIR858L-MYB regulatory module appeared in the transition of early land plants to large upright flowering plants, making a key contribution to plant secondary metabolism.


Asunto(s)
Actinidia , Arabidopsis , Regulación de la Expresión Génica de las Plantas , MicroARNs , ARN de Planta , MicroARNs/genética , MicroARNs/metabolismo , Actinidia/genética , Actinidia/metabolismo , Arabidopsis/genética , ARN de Planta/genética , ARN de Planta/metabolismo , Semillas/genética , Semillas/metabolismo , Secuencia de Bases
6.
PLoS Biol ; 22(9): e3002794, 2024 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-39283949

RESUMEN

Ancient divergences within Opisthokonta-a major lineage that includes organisms in the kingdoms Animalia, Fungi, and their unicellular relatives-remain contentious. To assess progress toward a genome-scale Opisthokonta phylogeny, we conducted the most taxon rich phylogenomic analysis using sets of genes inferred with different orthology inference methods and established the geological timeline of Opisthokonta diversification. We also conducted sensitivity analysis by subsampling genes or taxa from the full data matrix based on filtering criteria previously shown to improve phylogenomic inference. We found that approximately 85% of internal branches were congruent across data matrices and the approaches used. Notably, the use of different orthology inference methods was a substantial contributor to the observed incongruence: analyses using the same set of orthologs showed high congruence of 97% to 98%, whereas different sets of orthologs resulted in somewhat lower congruence (87% to 91%). Examination of unicellular Holozoa relationships suggests that the instability observed across varying gene sets may stem from weak phylogenetic signals. Our results provide a comprehensive Opisthokonta phylogenomic framework that will be useful for illuminating ancient evolutionary episodes concerning the origin and diversification of the 2 major eukaryotic kingdoms and emphasize the importance of investigating effects of orthology inference on phylogenetic analyses to resolve ancient divergences.


Asunto(s)
Genoma , Filogenia , Genoma/genética , Animales , Evolución Molecular , Genómica/métodos , Hongos/genética , Hongos/clasificación
7.
PLoS Biol ; 22(9): e3002832, 2024 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-39312572

RESUMEN

Many distantly related organisms have convergently evolved traits and lifestyles that enable them to live in similar ecological environments. However, the extent of phenotypic convergence evolving through the same or distinct genetic trajectories remains an open question. Here, we leverage a comprehensive dataset of genomic and phenotypic data from 1,049 yeast species in the subphylum Saccharomycotina (Kingdom Fungi, Phylum Ascomycota) to explore signatures of convergent evolution in cactophilic yeasts, ecological specialists associated with cacti. We inferred that the ecological association of yeasts with cacti arose independently approximately 17 times. Using a machine learning-based approach, we further found that cactophily can be predicted with 76% accuracy from both functional genomic and phenotypic data. The most informative feature for predicting cactophily was thermotolerance, which we found to be likely associated with altered evolutionary rates of genes impacting the cell envelope in several cactophilic lineages. We also identified horizontal gene transfer and duplication events of plant cell wall-degrading enzymes in distantly related cactophilic clades, suggesting that putatively adaptive traits evolved independently through disparate molecular mechanisms. Notably, we found that multiple cactophilic species and their close relatives have been reported as emerging human opportunistic pathogens, suggesting that the cactophilic lifestyle-and perhaps more generally lifestyles favoring thermotolerance-might preadapt yeasts to cause human disease. This work underscores the potential of a multifaceted approach involving high-throughput genomic and phenotypic data to shed light onto ecological adaptation and highlights how convergent evolution to wild environments could facilitate the transition to human pathogenicity.

8.
Proc Natl Acad Sci U S A ; 121(18): e2315314121, 2024 Apr 30.
Artículo en Inglés | MEDLINE | ID: mdl-38669185

RESUMEN

How genomic differences contribute to phenotypic differences is a major question in biology. The recently characterized genomes, isolation environments, and qualitative patterns of growth on 122 sources and conditions of 1,154 strains from 1,049 fungal species (nearly all known) in the yeast subphylum Saccharomycotina provide a powerful, yet complex, dataset for addressing this question. We used a random forest algorithm trained on these genomic, metabolic, and environmental data to predict growth on several carbon sources with high accuracy. Known structural genes involved in assimilation of these sources and presence/absence patterns of growth in other sources were important features contributing to prediction accuracy. By further examining growth on galactose, we found that it can be predicted with high accuracy from either genomic (92.2%) or growth data (82.6%) but not from isolation environment data (65.6%). Prediction accuracy was even higher (93.3%) when we combined genomic and growth data. After the GALactose utilization genes, the most important feature for predicting growth on galactose was growth on galactitol, raising the hypothesis that several species in two orders, Serinales and Pichiales (containing the emerging pathogen Candida auris and the genus Ogataea, respectively), have an alternative galactose utilization pathway because they lack the GAL genes. Growth and biochemical assays confirmed that several of these species utilize galactose through an alternative oxidoreductive D-galactose pathway, rather than the canonical GAL pathway. Machine learning approaches are powerful for investigating the evolution of the yeast genotype-phenotype map, and their application will uncover novel biology, even in well-studied traits.


Asunto(s)
Galactosa , Aprendizaje Automático , Galactosa/metabolismo , Genoma Fúngico , Redes y Vías Metabólicas/genética , Saccharomyces cerevisiae/metabolismo , Saccharomyces cerevisiae/genética
9.
Proc Natl Acad Sci U S A ; 121(10): e2316031121, 2024 Mar 05.
Artículo en Inglés | MEDLINE | ID: mdl-38412132

RESUMEN

The Saccharomycotina yeasts ("yeasts" hereafter) are a fungal clade of scientific, economic, and medical significance. Yeasts are highly ecologically diverse, found across a broad range of environments in every biome and continent on earth; however, little is known about what rules govern the macroecology of yeast species and their range limits in the wild. Here, we trained machine learning models on 12,816 terrestrial occurrence records and 96 environmental variables to infer global distribution maps at ~1 km2 resolution for 186 yeast species (~15% of described species from 75% of orders) and to test environmental drivers of yeast biogeography and macroecology. We found that predicted yeast diversity hotspots occur in mixed montane forests in temperate climates. Diversity in vegetation type and topography were some of the greatest predictors of yeast species richness, suggesting that microhabitats and environmental clines are key to yeast diversity. We further found that range limits in yeasts are significantly influenced by carbon niche breadth and range overlap with other yeast species, with carbon specialists and species in high-diversity environments exhibiting reduced geographic ranges. Finally, yeasts contravene many long-standing macroecological principles, including the latitudinal diversity gradient, temperature-dependent species richness, and a positive relationship between latitude and range size (Rapoport's rule). These results unveil how the environment governs the global diversity and distribution of species in the yeast subphylum. These high-resolution models of yeast species distributions will facilitate the prediction of economically relevant and emerging pathogenic species under current and future climate scenarios.


Asunto(s)
Biodiversidad , Ecosistema , Clima , Bosques , Carbono , Levaduras
10.
Mol Biol Evol ; 41(4)2024 Apr 02.
Artículo en Inglés | MEDLINE | ID: mdl-38415839

RESUMEN

Siderophores are crucial for iron-scavenging in microorganisms. While many yeasts can uptake siderophores produced by other organisms, they are typically unable to synthesize siderophores themselves. In contrast, Wickerhamiella/Starmerella (W/S) clade yeasts gained the capacity to make the siderophore enterobactin following the remarkable horizontal acquisition of a bacterial operon enabling enterobactin synthesis. Yet, how these yeasts absorb the iron bound by enterobactin remains unresolved. Here, we demonstrate that Enb1 is the key enterobactin importer in the W/S-clade species Starmerella bombicola. Through phylogenomic analyses, we show that ENB1 is present in all W/S clade yeast species that retained the enterobactin biosynthetic genes. Conversely, it is absent in species that lost the ent genes, except for Starmerella stellata, making this species the only cheater in the W/S clade that can utilize enterobactin without producing it. Through phylogenetic analyses, we infer that ENB1 is a fungal gene that likely existed in the W/S clade prior to the acquisition of the ent genes and subsequently experienced multiple gene losses and duplications. Through phylogenetic topology tests, we show that ENB1 likely underwent horizontal gene transfer from an ancient W/S clade yeast to the order Saccharomycetales, which includes the model yeast Saccharomyces cerevisiae, followed by extensive secondary losses. Taken together, these results suggest that the fungal ENB1 and bacterial ent genes were cooperatively integrated into a functional unit within the W/S clade that enabled adaptation to iron-limited environments. This integrated fungal-bacterial circuit and its dynamic evolution determine the extant distribution of yeast enterobactin producers and cheaters.


Asunto(s)
Enterobactina , Evolución Molecular , Operón , Filogenia , Enterobactina/metabolismo , Enterobactina/genética , Sideróforos/metabolismo , Sideróforos/genética , Genes Fúngicos , Saccharomycetales/genética , Saccharomycetales/metabolismo , Transferencia de Gen Horizontal
11.
Syst Biol ; 2024 Jun 28.
Artículo en Inglés | MEDLINE | ID: mdl-38940001

RESUMEN

Maximum likelihood (ML) phylogenetic inference is widely used in phylogenomics. As heuristic searches most likely find suboptimal trees, it is recommended to conduct multiple (e.g., ten) tree searches in phylogenetic analyses. However, beyond its positive role, how and to what extent multiple tree searches aid ML phylogenetic inference remains poorly explored. Here, we found that a random starting tree was not as effective as the BioNJ and parsimony starting trees in inferring ML gene tree and that RAxML-NG and PhyML were less sensitive to different starting trees than IQ-TREE. We then examined the effect of the number of tree searches on ML tree inference with IQ-TREE and RAxML-NG, by running 100 tree searches on 19,414 gene alignments from 15 animal, plant, and fungal phylogenomic datasets. We found that the number of tree searches substantially impacted the recovery of the best-of-100 ML gene tree topology among 100 searches for a given ML program. In addition, all of the concatenation-based trees were topologically identical if the number of tree searches was ≥ 10. Quartet-based ASTRAL trees inferred from 1 to 80 tree searches differed topologically from those inferred from 100 tree searches for 6 /15 phylogenomic datasets. Lastly, our simulations showed that gene alignments with lower difficulty scores had a higher chance of finding the best-of-100 gene tree topology and were more likely to yield the correct trees.

12.
PLoS Biol ; 20(10): e3001827, 2022 10.
Artículo en Inglés | MEDLINE | ID: mdl-36228036

RESUMEN

Molecular evolution studies, such as phylogenomic studies and genome-wide surveys of selection, often rely on gene families of single-copy orthologs (SC-OGs). Large gene families with multiple homologs in 1 or more species-a phenomenon observed among several important families of genes such as transporters and transcription factors-are often ignored because identifying and retrieving SC-OGs nested within them is challenging. To address this issue and increase the number of markers used in molecular evolution studies, we developed OrthoSNAP, a software that uses a phylogenetic framework to simultaneously split gene families into SC-OGs and prune species-specific inparalogs. We term SC-OGs identified by OrthoSNAP as SNAP-OGs because they are identified using a splitting and pruning procedure analogous to snapping branches on a tree. From 415,129 orthologous groups of genes inferred across 7 eukaryotic phylogenomic datasets, we identified 9,821 SC-OGs; using OrthoSNAP on the remaining 405,308 orthologous groups of genes, we identified an additional 10,704 SNAP-OGs. Comparison of SNAP-OGs and SC-OGs revealed that their phylogenetic information content was similar, even in complex datasets that contain a whole-genome duplication, complex patterns of duplication and loss, transcriptome data where each gene typically has multiple transcripts, and contentious branches in the tree of life. OrthoSNAP is useful for increasing the number of markers used in molecular evolution data matrices, a critical step for robustly inferring and exploring the tree of life.


Asunto(s)
Algoritmos , Evolución Molecular , Filogenia , Linaje , Factores de Transcripción
13.
Yeast ; 2024 Sep 18.
Artículo en Inglés | MEDLINE | ID: mdl-39295298

RESUMEN

Yeasts in the subphylum Saccharomycotina are found across the globe in disparate ecosystems. A major aim of yeast research is to understand the diversity and evolution of ecological traits, such as carbon metabolic breadth, insect association, and cactophily. This includes studying aspects of ecological traits like genetic architecture or association with other phenotypic traits. Genomic resources in the Saccharomycotina have grown rapidly. Ecological data, however, are still limited for many species, especially those only known from species descriptions where usually only a limited number of strains are studied. Moreover, ecological information is recorded in natural language format limiting high throughput computational analysis. To address these limitations, we developed an ontological framework for the analysis of yeast ecology. A total of 1,088 yeast strains were added to the Ontology of Yeast Environments (OYE) and analyzed in a machine-learning framework to connect genotype to ecology. This framework is flexible and can be extended to additional isolates, species, or environmental sequencing data. Widespread adoption of OYE would greatly aid the study of macroecology in the Saccharomycotina subphylum.

14.
New Phytol ; 2024 Aug 21.
Artículo en Inglés | MEDLINE | ID: mdl-39166427

RESUMEN

Horizontal gene transfer (HGT) is a major driving force in the evolution of prokaryotic and eukaryotic genomes. Despite recent advances in distribution and ecological importance, the extensive pattern, especially in seed plants, and post-transfer adaptation of HGT-acquired genes in land plants remain elusive. We systematically identified 1150 foreign genes in 522 land plant genomes that were likely acquired via at least 322 distinct transfers from nonplant donors and confirmed that recent HGT events were unevenly distributed between seedless and seed plants. HGT-acquired genes evolved to be more similar to native genes in terms of average intron length due to intron gains, and HGT-acquired genes containing introns exhibited higher expression levels than those lacking introns, suggesting that intron gains may be involved in the post-transfer adaptation of HGT in land plants. Functional validation of bacteria-derived gene GuaD in mosses and gymnosperms revealed that the invasion of foreign genes introduced a novel bypass of guanine degradation and resulted in the loss of native pathway genes in some gymnosperms, eventually shaping three major types of guanine metabolism in land plants. We conclude that HGT has played a critical role in land plant evolution.

15.
PLoS Biol ; 19(8): e3001365, 2021 08.
Artículo en Inglés | MEDLINE | ID: mdl-34358228

RESUMEN

Phylogenomic analyses of hundreds of protein-coding genes aimed at resolving phylogenetic relationships is now a common practice. However, no software currently exists that includes tools for dataset construction and subsequent analysis with diverse validation strategies to assess robustness. Furthermore, there are no publicly available high-quality curated databases designed to assess deep (>100 million years) relationships in the tree of eukaryotes. To address these issues, we developed an easy-to-use software package, PhyloFisher (https://github.com/TheBrownLab/PhyloFisher), written in Python 3. PhyloFisher includes a manually curated database of 240 protein-coding genes from 304 eukaryotic taxa covering known eukaryotic diversity, a novel tool for ortholog selection, and utilities that will perform diverse analyses required by state-of-the-art phylogenomic investigations. Through phylogenetic reconstructions of the tree of eukaryotes and of the Saccharomycetaceae clade of budding yeasts, we demonstrate the utility of the PhyloFisher workflow and the provided starting database to address phylogenetic questions across a large range of evolutionary time points for diverse groups of organisms. We also demonstrate that undetected paralogy can remain in phylogenomic "single-copy orthogroup" datasets constructed using widely accepted methods such as all vs. all BLAST searches followed by Markov Cluster Algorithm (MCL) clustering and application of automated tree pruning algorithms. Finally, we show how the PhyloFisher workflow helps detect inadvertent paralog inclusions, allowing the user to make more informed decisions regarding orthology assignments, leading to a more accurate final dataset.


Asunto(s)
Eucariontes/genética , Filogenia , Programas Informáticos
16.
Environ Microbiol ; 25(3): 642-645, 2023 03.
Artículo en Inglés | MEDLINE | ID: mdl-36511824

RESUMEN

As the most diverse group of animals on Earth, insects are key organisms in ecosystems. Horizontal gene transfer (HGT) refers to the transfer of genetic material between species by non-reproductive means. HGT is a major evolutionary force in prokaryotic genome evolution, but its importance in different eukaryotic groups, such as insects, has only recently begun to be understood. Genomic data from hundreds of insect species have enabled the detection of large numbers of HGT events and the elucidation of the functions of some of these foreign genes. Although quantification of the extent of HGT in insects broadens our understanding of its role in insect evolution, the scope of its influence and underlying mechanism(s) of its occurrence remain open questions for the field.


Asunto(s)
Evolución Molecular , Transferencia de Gen Horizontal , Animales , Ecosistema , Células Procariotas , Insectos , Genoma de los Insectos , Filogenia
17.
PLoS Biol ; 18(12): e3001007, 2020 12.
Artículo en Inglés | MEDLINE | ID: mdl-33264284

RESUMEN

Highly divergent sites in multiple sequence alignments (MSAs), which can stem from erroneous inference of homology and saturation of substitutions, are thought to negatively impact phylogenetic inference. Thus, several different trimming strategies have been developed for identifying and removing these sites prior to phylogenetic inference. However, a recent study reported that doing so can worsen inference, underscoring the need for alternative alignment trimming strategies. Here, we introduce ClipKIT, an alignment trimming software that, rather than identifying and removing putatively phylogenetically uninformative sites, instead aims to identify and retain parsimony-informative sites, which are known to be phylogenetically informative. To test the efficacy of ClipKIT, we examined the accuracy and support of phylogenies inferred from 14 different alignment trimming strategies, including those implemented in ClipKIT, across nearly 140,000 alignments from a broad sampling of evolutionary histories. Phylogenies inferred from ClipKIT-trimmed alignments are accurate, robust, and time saving. Furthermore, ClipKIT consistently outperformed other trimming methods across diverse datasets, suggesting that strategies based on identifying and retaining parsimony-informative sites provide a robust framework for alignment trimming.


Asunto(s)
Alineación de Secuencia/métodos , Análisis de Secuencia de ADN/métodos , Algoritmos , Simulación por Computador , Evolución Molecular , Modelos Genéticos , Filogenia , Programas Informáticos
18.
Mol Biol Evol ; 38(10): 4322-4333, 2021 09 27.
Artículo en Inglés | MEDLINE | ID: mdl-34097041

RESUMEN

Identifying our most distant animal relatives has emerged as one of the most challenging problems in phylogenetics. This debate has major implications for our understanding of the origin of multicellular animals and of the earliest events in animal evolution, including the origin of the nervous system. Some analyses identify sponges as our most distant animal relatives (Porifera-sister hypothesis), and others identify comb jellies (Ctenophora-sister hypothesis). These analyses vary in many respects, making it difficult to interpret previous tests of these hypotheses. To gain insight into why different studies yield different results, an important next step in the ongoing debate, we systematically test these hypotheses by synthesizing 15 previous phylogenomic studies and performing new standardized analyses under consistent conditions with additional models. We find that Ctenophora-sister is recovered across the full range of examined conditions, and Porifera-sister is recovered in some analyses under narrow conditions when most outgroups are excluded and site-heterogeneous CAT models are used. We additionally find that the number of categories in site-heterogeneous models is sufficient to explain the Porifera-sister results. Furthermore, our cross-validation analyses show CAT models that recover Porifera-sister have hundreds of additional categories and fail to fit significantly better than site-heterogenuous models with far fewer categories. Systematic and standardized testing of diverse phylogenetic models suggests that we should be skeptical of Porifera-sister results both because they are recovered under such narrow conditions and because the models in these conditions fit the data no better than other models that recover Ctenophora-sister.


Asunto(s)
Ctenóforos , Animales , Filogenia
19.
EMBO J ; 37(1): 63-74, 2018 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-29054852

RESUMEN

DNA glycosylases preserve genome integrity and define the specificity of the base excision repair pathway for discreet, detrimental modifications, and thus, the mechanisms by which glycosylases locate DNA damage are of particular interest. Bacterial AlkC and AlkD are specific for cationic alkylated nucleobases and have a distinctive HEAT-like repeat (HLR) fold. AlkD uses a unique non-base-flipping mechanism that enables excision of bulky lesions more commonly associated with nucleotide excision repair. In contrast, AlkC has a much narrower specificity for small lesions, principally N3-methyladenine (3mA). Here, we describe how AlkC selects for and excises 3mA using a non-base-flipping strategy distinct from that of AlkD. A crystal structure resembling a catalytic intermediate complex shows how AlkC uses unique HLR and immunoglobulin-like domains to induce a sharp kink in the DNA, exposing the damaged nucleobase to active site residues that project into the DNA This active site can accommodate and excise N3-methylcytosine (3mC) and N1-methyladenine (1mA), which are also repaired by AlkB-catalyzed oxidative demethylation, providing a potential alternative mechanism for repair of these lesions in bacteria.


Asunto(s)
Bacillus cereus/enzimología , Aductos de ADN/química , Aductos de ADN/metabolismo , Daño del ADN , ADN Glicosilasas/química , ADN Glicosilasas/metabolismo , Reparación del ADN , Adenina/análogos & derivados , Adenina/química , Alquilación , Secuencia de Aminoácidos , Dominio Catalítico , Cristalografía por Rayos X , Modelos Moleculares , Conformación Proteica , Homología de Secuencia
20.
Bioinformatics ; 37(16): 2325-2331, 2021 Aug 25.
Artículo en Inglés | MEDLINE | ID: mdl-33560364

RESUMEN

MOTIVATION: Diverse disciplines in biology process and analyze multiple sequence alignments (MSAs) and phylogenetic trees to evaluate their information content, infer evolutionary events and processes and predict gene function. However, automated processing of MSAs and trees remains a challenge due to the lack of a unified toolkit. To fill this gap, we introduce PhyKIT, a toolkit for the UNIX shell environment with 30 functions that process MSAs and trees, including but not limited to estimation of mutation rate, evaluation of sequence composition biases, calculation of the degree of violation of a molecular clock and collapsing bipartitions (internal branches) with low support. RESULTS: To demonstrate the utility of PhyKIT, we detail three use cases: (1) summarizing information content in MSAs and phylogenetic trees for diagnosing potential biases in sequence or tree data; (2) evaluating gene-gene covariation of evolutionary rates to identify functional relationships, including novel ones, among genes and (3) identify lack of resolution events or polytomies in phylogenetic trees, which are suggestive of rapid radiation events or lack of data. We anticipate PhyKIT will be useful for processing, examining and deriving biological meaning from increasingly large phylogenomic datasets. AVAILABILITY AND IMPLEMENTATION: PhyKIT is freely available on GitHub (https://github.com/JLSteenwyk/PhyKIT), PyPi (https://pypi.org/project/phykit/) and the Anaconda Cloud (https://anaconda.org/JLSteenwyk/phykit) under the MIT license with extensive documentation and user tutorials (https://jlsteenwyk.com/PhyKIT). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA