RESUMEN
The Andes mountains of western South America are a globally important biodiversity hotspot, yet there is a paucity of resolved phylogenies for plant clades from this region. Filling an important gap in our understanding of the World's richest flora, we present the first phylogeny of Freziera (Pentaphylacaceae), an Andean-centered, cloud forest radiation. Our dataset was obtained via hybrid-enriched target sequence capture of Angiosperms353 universal loci for 50 of the ca. 75 spp., obtained almost entirely from herbarium specimens. We identify high phylogenomic complexity in Freziera, including the presence of data artifacts. Via by-eye observation of gene trees, detailed examination of warnings from recently improved assembly pipelines, and gene tree filtering, we identified that artifactual orthologs (i.e., the presence of only one copy of a multicopy gene due to differential assembly) were an important source of gene tree heterogeneity that had a negative impact on phylogenetic inference and support. These artifactual orthologs may be common in plant phylogenomic datasets, where multiple instances of genome duplication are common. After accounting for artifactual orthologs as source of gene tree error, we identified a significant, but nonspecific signal of introgression using Patterson's D and f4 statistics. Despite phylogenomic complexity, we were able to resolve Freziera into 9 well-supported subclades whose evolution has been shaped by multiple evolutionary processes, including incomplete lineage sorting, historical gene flow, and gene duplication. Our results highlight the complexities of plant phylogenomics, which are heightened in Andean radiations, and show the impact of filtering data processing artifacts and standard filtering approaches on phylogenetic inference.
Asunto(s)
Filogenia , Clasificación/métodos , América del Sur , Genoma de PlantaRESUMEN
PREMISE: To date, phylogenetic relationships within the monogeneric Brunelliaceae have been based on morphological evidence, which does not provide sufficient phylogenetic resolution. Here we use target-enriched nuclear data to improve our understanding of phylogenetic relationships in the family. METHODS: We used the Angiosperms353 toolkit for targeted recovery of exonic regions and supercontigs (exons + introns) from low copy nuclear genes from 53 of 70 species in Brunellia, and several outgroup taxa. We removed loci that indicated biased inference of relationships and applied concatenated and coalescent methods to infer Brunellia phylogeny. We identified conflicts among gene trees that may reflect hybridization or incomplete lineage sorting events and assessed their impact on phylogenetic inference. Finally, we performed ancestral-state reconstructions of morphological traits and assessed the homology of character states used to define sections and subsections in Brunellia. RESULTS: Brunellia comprises two major clades and several subclades. Most of these clades/subclades do not correspond to previous infrageneric taxa. There is high topological incongruence among the subclades across analyses. CONCLUSIONS: Phylogenetic reconstructions point to rapid species diversification in Brunelliaceae, reflected in very short branches between successive species splits. The removal of putatively biased loci slightly improves phylogenetic support for individual clades. Reticulate evolution due to hybridization and/or incomplete lineage sorting likely both contribute to gene-tree discordance. Morphological characters used to define taxa in current classification schemes are homoplastic in the ancestral character-state reconstructions. While target enrichment data allows us to broaden our understanding of diversification in Brunellia, the relationships among subclades remain incompletely understood.