RESUMO
Accurately reconstructing the reticulate histories of polyploids remains a central challenge for understanding plant evolution. Although phylogenetic networks can provide insights into relationships among polyploid lineages, inferring networks may be hindered by the complexities of homology determination in polyploid taxa. We use simulations to show that phasing alleles from allopolyploid individuals can improve phylogenetic network inference under the multispecies coalescent by obtaining the true network with fewer loci compared with haplotype consensus sequences or sequences with heterozygous bases represented as ambiguity codes. Phased allelic data can also improve divergence time estimates for networks, which is helpful for evaluating allopolyploid speciation hypotheses and proposing mechanisms of speciation. To achieve these outcomes in empirical data, we present a novel pipeline that leverages a recently developed phasing algorithm to reliably phase alleles from polyploids. This pipeline is especially appropriate for target enrichment data, where the depth of coverage is typically high enough to phase entire loci. We provide an empirical example in the North American Dryopteris fern complex that demonstrates insights from phased data as well as the challenges of network inference. We establish that our pipeline (PATÉ: Phased Alleles from Target Enrichment data) is capable of recovering a high proportion of phased loci from both diploids and polyploids. These data may improve network estimates compared with using haplotype consensus assemblies by accurately inferring the direction of gene flow, but statistical nonidentifiability of phylogenetic networks poses a barrier to inferring the evolutionary history of reticulate complexes.
Assuntos
Alelos , Filogenia , Poliploidia , Classificação/métodos , Gleiquênias/genética , Gleiquênias/classificação , Simulação por Computador , Algoritmos , Modelos GenéticosRESUMO
Vaccinieae is a morphologically diverse and species-rich (â¼1430 species) tribe in Ericaceae. Although the majority of diversity is tropical, Vaccinieae are best known for temperate crops (i.e., blueberries, cranberries, and lingonberries) in Vaccinium. Vaccinium itself (â¼500 species) has been previously suggested as highly polyphyletic and taxonomic boundaries among many of the other genera in the tribe remain uncertain. We assessed the evolutionary history of Vaccinieae with phylogenomic analyses based on a target-enrichment dataset containing 256 low-copy nuclear loci and 210 species representing 30 of the 35 genera in the tribe and 25 of the 29 sections of Vaccinium. We conducted time-calibrated biogeographic analyses and diversification analyses to explore the area of origin and global dispersal history of the tribe. The analysis recovered a temperate North American origin for Vaccinieae approximately 30 million years ago. Tropical diversity of Vaccinieae was inferred to result from multiple, independent movements into the tropics from north-temperate ancestors. Diversification rate increases corresponded to radiation into the Andes and SE Asia. The pseudo-10-locular ovary evolved once in the tribe from the five-locular state, coinciding with the diversification of a major clade that includes most Asian Vaccinium and the group from which commercial blueberries are derived (V. sect. Cyanococcus). A reconstruction from available chromosome counts suggests that a major polyploid event predated the evolution of nearly half the diversity of Vaccinieae. The extent of polyphyly in Vaccinium documented here supports the need for a generic reclassification of the tribe.
RESUMO
The Ledebouriinae (Scilloideae, Asparagaceae) are a widespread group of bulbous geophytes found predominantly throughout seasonal climates in sub-Saharan Africa, with a handful of taxa in Madagascar, the Middle East, India, and Sri Lanka. Phylogenetic relationships within the group have been historically difficult to elucidate. Here, we provide the first phylogenomic perspective into the Ledebouriinae. Using the Angiosperms353 targeted enrichment probe set, we consistently recovered four major clades (i.e., two Ledebouria clades, Drimiopsis, and Resnova). The two Ledebouria clades closely align with geography, either consisting almost entirely of sub-Saharan African taxa (Ledebouria Clade A), or East African and non-African taxa (Ledebouria Clade B). Our results suggest that the Ledebouriinae likely underwent a rapid radiation leading to rampant incomplete lineage sorting. We additionally find evidence for potential historical hybridization between Drimiopsis and a subclade within Ledebouria Clade A.
Assuntos
Asparagaceae , Geografia , Hibridização Genética , Índia , FilogeniaRESUMO
PREMISE: The true blueberries (Vaccinium sect. Cyanococcus; Ericaceae), endemic to North America, have been intensively studied for over a century. However, with species estimates ranging from nine to 24 and much confusion regarding species boundaries, this ecologically and economically valuable group remains inadequately understood at a basic evolutionary and taxonomic level. As a first step toward understanding the evolutionary history and taxonomy of this species complex, we present the first phylogenomic hypothesis of the known diploid blueberries. METHODS: We used flow cytometry to verify the ploidy of putative diploid taxa and a target-enrichment approach to obtain a genomic data set for phylogenetic analyses. RESULTS: Despite evidence of gene flow, we found that a primary phylogenetic signal is present. Monophyly for all morphospecies was recovered, with two notable exceptions: one sample of V. boreale was consistently nested in the V. myrtilloides clade and V. caesariense was nested in the V. fuscatum clade. One diploid taxon, Vaccinium pallidum, is implicated as having a homoploid hybrid origin. CONCLUSIONS: This foundational study represents the first attempt to elucidate evolutionary relationships of the true blueberries of North America with a phylogenomic approach and sets the stage for multiple avenues of future study such as a taxonomic revision of the group, the verification of a homoploid hybrid taxon, and the study of polyploid lineages within the context of a diploid phylogeny.
Assuntos
Mirtilos Azuis (Planta) , Vaccinium , Filogenia , Diploide , PoliploidiaRESUMO
Botanists have long recognised interspecific gene flow as a common occurrence within white oaks (Quercus section Quercus). Historical allele exchange, however, has not been fully characterised and the complex genomic signals resulting from the combination of vertical and horizontal gene transmission may confound phylogenetic inference and obscure our ability to accurately infer the deep evolutionary history of oaks. Using anchored enrichment, we obtained a phylogenomic dataset consisting of hundreds of single-copy nuclear loci. Concatenation, species-tree and network analyses were carried out in an attempt to uncover the genomic signal of ancient introgression and infer the divergent phylogenetic topology for the white oak clade. Locus and site-level likelihood comparisons were then conducted to further explore the introgressed signal within our dataset. Historical, intersectional gene flow is suggested to have occurred between an ancestor of the Eurasian Roburoid lineage and Quercus pontica and North American Dumosae and Prinoideae lineages. Despite extensive time past, our approach proved successful in detecting the genomic signature of ancient introgression. Our results, however, highlight the importance of sampling and the use of a plurality of analytical tools and methods to sufficiently explore genomic datasets, uncover this signal, and accurately infer evolutionary history.
Assuntos
Quercus , Evolução Biológica , Fluxo Gênico , Genômica , Filogenia , Quercus/genéticaRESUMO
The tree of life is highly reticulate, with the history of population divergence emerging from populations of gene phylogenies that reflect histories of introgression, lineage sorting and divergence. In this study, we investigate global patterns of oak diversity and test the hypothesis that there are regions of the oak genome that are broadly informative about phylogeny. We utilize fossil data and restriction-site associated DNA sequencing (RAD-seq) for 632 individuals representing nearly 250 Quercus species to infer a time-calibrated phylogeny of the world's oaks. We use a reversible-jump Markov chain Monte Carlo method to reconstruct shifts in lineage diversification rates, accounting for among-clade sampling biases. We then map the > 20 000 RAD-seq loci back to an annotated oak genome and investigate genomic distribution of introgression and phylogenetic support across the phylogeny. Oak lineages have diversified among geographic regions, followed by ecological divergence within regions, in the Americas and Eurasia. Roughly 60% of oak diversity traces back to four clades that experienced increases in net diversification, probably in response to climatic transitions or ecological opportunity. The strong support for the phylogeny contrasts with high genomic heterogeneity in phylogenetic signal and introgression. Oaks are phylogenomic mosaics, and their diversity may in fact depend on the gene flow that shapes the oak genome.
Assuntos
Quercus , Fluxo Gênico , Genômica , Filogenia , Quercus/genética , Análise de Sequência de DNARESUMO
Peat mosses (Sphagnum) hold exceptional importance in the control of global carbon fluxes and climate because of the vast stores of carbon bound up in partially decomposed biomass (peat). This study tests the hypothesis that the early diversification of Sphagnum was in the Northern Hemisphere, with subsequent range expansions to tropical latitudes and the Southern Hemisphere. A phylogenetic analysis of 192 accessions representing the moss class Sphagnopsida based on four plastid loci was conducted in conjunction with biogeographic analyses using BioGeoBEARS to investigate the tempo and mode of geographic range evolution. Analyses support the hypothesis that the major intrageneric clades of peat-forming species accounting for >90% of peat moss diversity originated and diversified at northern latitudes. The genus underwent multiple range expansions into tropical and Southern Hemisphere regions. Range evolution in peat mosses was most common within latitudinal zones, attesting to the relative difficulty of successfully invading new climate zones. Allopolyploidy in Sphagnum (inferred from microsatellite heterozygosity) does not appear to be biased with regard to geographic region nor intrageneric clade. The inference that Sphagnum diversified in cool-or cold-climate regions and repeatedly expanded its range into tropical regions makes the genus an excellent model for studying morphological, physiological, and genomic traits associated with adaptation to warming climates.
Assuntos
Adaptação Fisiológica/fisiologia , Clima , Sphagnopsida/fisiologia , Biodiversidade , Evolução Biológica , Carbono/metabolismo , Mudança Climática , Filogenia , Sphagnopsida/classificação , Sphagnopsida/genética , Sphagnopsida/metabolismoRESUMO
The Campanulaceae comprises approximately 2300 species that are distributed among five major lineages: Campanuloideae, Cyphioideae, Cyphocarpoideae, Lobelioideae, and Nemacladoideae. Of these, the Campanuloideae, a primarily Old World clade, has the largest diversity in East Asia. In this study, we reconstruct the phylogeny of East Asian Campanuloideae based on one nuclear gene (i.e., PPR70) and five plastid markers (i.e., atpB, matK, petD, rbcL, and trnL-trnF). We then use this phylogenetic framework to reconstruct the biogeographical history of the genus. Our molecular dataset includes 376 of the 1045 currently recognized species in the Campanuloideae. Of the 376 sampled species, 116 are from East Asia, representing ca. 60% of the East Asian Campanuloideae. Our PPR dataset included sequences for 156 accessions, representing 54 species, while our plastid dataset included sequences for 305 accessions, representing 354 species. Phylogenetic analyses recovered three large clades containing East Asian taxa: Campanulinae, Platycodinae, and Wahlenberginae. The historical assembly of Campanuloideae diversity in East Asia appears to have resulted from numerous, independent movements from Africa, Europe/W. Asia, and North America. Africa was inferred as the ancestral range for the Campanuloideae. Movement of the largest East Asian clade (Platycodinae) occurred at approximately 53.1â¯Ma (46.6-58.73 95% HPD) from Africa, with much of the current diversity found in East Asia having resulted from in situ diversification. Thirteen additional movements into East Asia, primarily from Europe/Western Asia, occurred subsequently. One dispersal event from western North America was also inferred. In contrast, only six movements out of East Asia were found. Our results suggest that East Asia has acted primarily as a sink for Campanuloideae diversity, with Europe, Western Asia, and Africa representing major source areas.
Assuntos
Evolução Biológica , Campanulaceae/classificação , Campanulaceae/genética , Variação Genética , Núcleo Celular/genética , Ásia Oriental , Funções Verossimilhança , Filogenia , Filogeografia , Plastídeos/genética , Fatores de TempoRESUMO
PREMISE OF THE STUDY: The Campanulaceae are a diverse clade of flowering plants encompassing more than 2300 species in myriad habitats from tropical rainforests to arctic tundra. A robust, multigene phylogeny, including all major lineages, is presented to provide a broad, evolutionary perspective of this cosmopolitan clade. METHODS: We used a phylogenetic framework, in combination with divergence dating, ancestral range estimation, chromosome modeling, and morphological character reconstruction analyses to infer phylogenetic placement and timing of major biogeographic, genomic, and morphological changes in the history of the group and provide insights into the diversification of this clade across six continents. KEY RESULTS: Ancestral range estimation supports an out-of-Africa diversification following the Cretaceous-Tertiary extinction event. Chromosomal modeling, with corroboration from the distribution of synonymous substitutions among gene duplicates, provides evidence for as many as 20 genome-wide duplication events before large radiations. Morphological reconstructions support the hypothesis that switches in floral symmetry and anther dehiscence were important in the evolution of secondary pollen presentation mechanisms. CONCLUSIONS: This study provides a broad, phylogenetic perspective on the evolution of the Campanulaceae clade. The remarkable habitat diversity and cosmopolitan distribution of this lineage appears to be the result of a complex history of genome duplications and numerous long-distance dispersal events. We failed to find evidence for an ancestral polyploidy event for this clade, and our analyses indicate an ancestral base number of nine for the group. This study will serve as a framework for future studies in diverse areas of research in Campanulaceae.
Assuntos
Evolução Biológica , Campanulaceae/classificação , Campanulaceae/genética , Filogenia , Evolução Molecular , Genes de Plantas , Análise de Sequência de DNARESUMO
Northern Hemisphere forests changed drastically in the early Eocene with the diversification of the oak family (Fagaceae). Cooling climates over the next 20 million years fostered the spread of temperate biomes that became increasingly dominated by oaks and their chestnut relatives. Here we use phylogenomic analyses of nuclear and plastid genomes to investigate the timing and pattern of major macroevolutionary events and ancient genome-wide signatures of hybridization across Fagaceae. Innovation related to seed dispersal is implicated in triggering waves of continental radiations beginning with the rapid diversification of major lineages and resulting in unparalleled transformation of forest dynamics within 15 million years following the K-Pg extinction. We detect introgression at multiple time scales, including ancient events predating the origination of genus-level diversity. As oak lineages moved into newly available temperate habitats in the early Miocene, secondary contact between previously isolated species occurred. This resulted in adaptive introgression, which may have further amplified the diversification of white oaks across Eurasia.
Assuntos
Genomas de Plastídeos , Quercus , Ecossistema , Florestas , Genomas de Plastídeos/genética , FilogeniaRESUMO
The Mediterranean Basin harbors a remarkable amount of biodiversity, a high proportion of which is endemic to this region. Here, we present an in-depth study of an angiosperm species complex, in which cryptic taxonomic diversity has been hypothesized. Specifically, we focus on four currently recognized species in the Roucela complex, a well-supported clade in the Campanulaceae/Campanuloideae: Campanula creutzburgii, C. drabifolia, C. erinus, and C. simulans. This study takes a phylogenomic approach, utilizing near-complete plastomes and 130 nuclear loci, to uncover cryptic diversity and test hypotheses regarding hybridization and polyploidy within this clade. Genome size estimates recovered tetraploid and octoploid lineages within the currently recognized, widespread species C. erinus, showing an east-west geographic pattern. Though genomic data clearly differentiate these two cytotypes, we failed to discern morphological differences. The formation of a cryptic octoploid lineage, distributed across the eastern Mediterranean, is hypothesized to be the result of an allopolyploid event in which one parental morphology is retained. The tetraploid C. erinus and C. creutzburgii (also a tetraploid) are implicated as parental lineages. Our results highlight the utility of target-enrichment approaches for obtaining genomic datasets for thorough assessments of species diversity and the importance of carefully considering gene-tree discordance within such datasets.
Assuntos
Biodiversidade , Campanulaceae/classificação , Campanulaceae/genética , Filogenia , Poliploidia , Genoma de Planta , Hibridização Genética , Região do Mediterrâneo , Filogeografia , Análise de Sequência de DNARESUMO
In recent times, evolution has become a central tenet of taxonomy, but nomenclature has consistently been decoupled from the tree-thinking process, often leading to significant issues in reconciling traditional (Linnaean) names with clades in the Tree of Life. Recent evolutionary studies on the Roucela clade, a group of endemic plants found in the Mediterranean Basin, motivated the establishment of phylogenetic concepts to formally anchor clade names on the Campanuloideae (Campanulaceae) tree. These concepts facilitate communication of clades that approximate traditionally defined groups, in addition to naming newly discovered cryptic diversity in a phylogenetic framework.
RESUMO
PREMISE OF THE STUDY: The One Thousand Plant Transcriptomes Project (1KP, 1000+ assembled plant transcriptomes) provides an enormous resource for developing microsatellite loci across the plant tree of life. We developed loci from these transcriptomes and tested their utility. METHODS AND RESULTS: Using software packages and custom scripts, we identified microsatellite loci in 1KP transcriptomes. We assessed the potential for cross-amplification and whether loci were biased toward exons, as compared to markers derived from genomic DNA. We characterized over 5.7 million simple sequence repeat (SSR) loci from 1334 plant transcriptomes. Eighteen percent of loci substantially overlapped with open reading frames (ORFs), and electronic PCR revealed that over half the loci would amplify successfully in conspecific taxa. Transcriptomic SSRs were approximately three times more likely to map to translated regions than genomic SSRs. CONCLUSIONS: We believe microsatellites still have a place in the genomic age-they remain effective and cost-efficient markers. The loci presented here are a valuable resource for researchers.
RESUMO
Microsatellites, or simple sequence repeats (SSRs), have long played a major role in genetic studies due to their typically high polymorphism. They have diverse applications, including genome mapping, forensics, ascertaining parentage, population and conservation genetics, identification of the parentage of polyploids, and phylogeography. We compare SSRs and newer methods, such as genotyping by sequencing (GBS) and restriction site associated DNA sequencing (RAD-Seq), and offer recommendations for researchers considering which genetic markers to use. We also review the variety of techniques currently used for identifying microsatellite loci and developing primers, with a particular focus on those that make use of next-generation sequencing (NGS). Additionally, we review software for microsatellite development and report on an experiment to assess the utility of currently available software for SSR development. Finally, we discuss the future of microsatellites and make recommendations for researchers preparing to use microsatellites. We argue that microsatellites still have an important place in the genomic age as they remain effective and cost-efficient markers.
RESUMO
At the intersection of geological activity, climatic fluctuations, and human pressure, the Mediterranean Basin - a hotspot of biodiversity - provides an ideal setting for studying endemism, evolution, and biogeography. Here, we focus on the Roucela complex (Campanula subgenus Roucela), a group of 13 bellflower species found primarily in the eastern Mediterranean Basin. Plastid and low-copy nuclear markers were employed to reconstruct evolutionary relationships and estimate divergence times within the Roucela complex using both concatenation and species tree analyses. Niche modeling, ancestral range estimation, and diversification analyses were conducted to provide further insights into patterns of endemism and diversification through time. Diversification of the Roucela clade appears to have been primarily the result of vicariance driven by the breakup of an ancient landmass. We found geologic events such as the formation of the mid-Aegean trench and the Messinian Salinity Crisis to be historically important in the evolutionary history of this group. Contrary to numerous past studies, the onset of the Mediterranean climate has not promoted diversification in the Roucela complex and, in fact, may be negatively affecting these species. This study highlights the diversity and complexity of historical processes driving plant evolution in the Mediterranean Basin.
RESUMO
BACKGROUND: The Campanuloideae (Campanulaceae) are a highly diverse clade of angiosperms found mostly in the Northern Hemisphere, with the highest diversity in temperate areas of the Old World. Chloroplast markers have greatly improved our understanding of this clade but many relationships remain unclear primarily due to low levels of molecular evolution and recent and rapid divergence. Furthermore, focusing solely on maternally inherited markers such as those from the chloroplast genome may obscure processes such as hybridization. In this study we explore the phylogenetic utility of two low-copy nuclear loci from the pentatricopeptide repeat gene family (PPR). Rapidly evolving nuclear loci may provide increased phylogenetic resolution in clades containing recently diverged or closely related taxa. We present results based on both chloroplast and low-copy nuclear loci and discuss the utility of such markers to resolve evolutionary relationships and infer hybridization events within the Campanuloideae clade. RESULTS: The inclusion of low-copy nuclear genes into the analyses provides increased phylogenetic resolution in two species-rich clades containing recently diverged taxa. We also obtain support for the placement of two early diverging lineages (Jasione and Musschia-Gadellia clades) that have previously been unresolved. Furthermore, phylogenetic analyses of PPR loci revealed potential hybridization events for a number of taxa (e.g., Campanula pelviformis and Legousia species). These loci offer greater overall topological support than obtained with plastid DNA alone. CONCLUSION: This study represents the first inclusion of low-copy nuclear genes for phylogenetic reconstruction in Campanuloideae. The two PPR loci were easy to sequence, required no cloning, and the sequence alignments were straightforward across the entire Campanuloideae clade. Although potentially complicated by incomplete lineage sorting, these markers proved useful for understanding the processes of reticulate evolution and resolving relationships at a wide range of phylogenetic levels. Our results stress the importance of including multiple, independent loci in phylogenetic analyses.
Assuntos
Campanulaceae/classificação , Genes de Plantas , Campanulaceae/genética , DNA de Cloroplastos/genética , DNA de Plantas/genética , Evolução Molecular , Dosagem de Genes , Marcadores Genéticos , Variação Genética , Família Multigênica , Filogenia , Proteínas de Plantas/genética , Alinhamento de Sequência , Homologia de Sequência do Ácido Nucleico , Especificidade da EspécieRESUMO
BACKGROUND: Speciose clades usually harbor species with a broad spectrum of adaptive strategies and complex distribution patterns, and thus constitute ideal systems to disentangle biotic and abiotic causes underlying species diversification. The delimitation of such study systems to test evolutionary hypotheses is difficult because they often rely on artificial genus concepts as starting points. One of the most prominent examples is the bellflower genus Campanula with some 420 species, but up to 600 species when including all lineages to which Campanula is paraphyletic. We generated a large alignment of petD group II intron sequences to include more than 70% of described species as a reference. By comparison with partial data sets we could then assess the impact of selective taxon sampling strategies on phylogenetic reconstruction and subsequent evolutionary conclusions. METHODOLOGY/PRINCIPAL FINDINGS: Phylogenetic analyses based on maximum parsimony (PAUP, PRAP), Bayesian inference (MrBayes), and maximum likelihood (RAxML) were first carried out on the large reference data set (D680). Parameters including tree topology, branch support, and age estimates, were then compared to those obtained from smaller data sets resulting from "classification-guided" (D088) and "phylogeny-guided sampling" (D101). Analyses of D088 failed to fully recover the phylogenetic diversity in Campanula, whereas D101 inferred significantly different branch support and age estimates. CONCLUSIONS/SIGNIFICANCE: A short genomic region with high phylogenetic utility allowed us to easily generate a comprehensive phylogenetic framework for the speciose Campanula clade. Our approach recovered 17 well-supported and circumscribed sub-lineages. Knowing these will be instrumental for developing more specific evolutionary hypotheses and guide future research, we highlight the predictive value of a mass taxon-sampling strategy as a first essential step towards illuminating the detailed evolutionary history of diverse clades.