Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 51
Filter
Add more filters










Publication year range
1.
AoB Plants ; 15(6): plad074, 2023 Dec.
Article in English | MEDLINE | ID: mdl-38130422

ABSTRACT

High-throughput sequencing of herbarium specimens' DNA with short-read platforms has helped explore many biological questions. Here, for the first time, we investigate the potential of using herbarium specimens as a resource for long-read DNA sequencing technologies. We use target capture of 48 low-copy nuclear loci in 12 herbarium specimens of Silene as a basis for long-read sequencing using SMRT PacBio Sequel. The samples were collected between 1932 and 2019. A simple optimization of size selection protocol enabled the retrieval of both long DNA fragments (>1 kb) and long on-target reads for nine of them. The limited sampling size does not enable statistical evaluation of the influence of specimen age to the DNA fragmentation, but our results confirm that younger samples, that is, collected after 1990, are less fragmented and have better sequencing success than specimens collected before this date. Specimens collected between 1990 and 2019 yield between 167 and 3403 on-target reads > 1 kb. They enabled recovering between 34 loci and 48 (i.e. all loci recovered). Three samples from specimens collected before 1990 did not yield on-target reads > 1 kb. The four other samples collected before this date yielded up to 144 reads and recovered up to 25 loci. Young herbarium specimens seem promising for long-read sequencing. However, older ones have partly failed. Further exploration would be necessary to statistically test and understand the potential of older material in the quest for long reads. We would encourage greatly expanding the sampling size and comparing different taxonomic groups.

2.
PhytoKeys ; 195: 143-160, 2022.
Article in English | MEDLINE | ID: mdl-36761355

ABSTRACT

Sileneneglecta has been misunderstood and confused with S.nocturna, although several morphological characters (petal shape, calyx indumentum, hairiness of stamen filaments, seed size, seed-coat surface and shape) allow separation of these species. Moreover, S.mutabilis (which has been considered conspecific with S.neglecta) and S.martinolii (an alleged endemic species to south-western Sardinia) are considered here as taxonomic synonyms of S.nocturna and S.neglecta, respectively. These taxonomic conclusions are strongly supported by multivariate morphometric analyses of 21 characters.

3.
Mol Phylogenet Evol ; 162: 107199, 2021 09.
Article in English | MEDLINE | ID: mdl-33984468

ABSTRACT

Silene (Caryophyllaceae) is distributed predominantly in the northern Hemisphere, where it is most diverse around the Mediterranean Basin. The genus is also well represented in North Africa, extending into tropical, sub-Saharan and southern Africa. Eight native species are recognized in southern Africa, taxonomically placed in two sections: Elisanthe and Silene s.l. Although the taxonomy of the southern African taxa has recently been revised, their phylogenetic relationships and biogeographic history remain unclear. This study aims to infer the phylogenetic position and geographic origins of the southern African taxa. We generated DNA sequences of nuclear and plastid loci from several individuals belonging to all eight species of Silene recognized from southern Africa, and combined our DNA sequences with existing data representing species from major clades (i.e. sections) based on the recently revised Silene infrageneric taxonomy. We used a Bayesian coalescent species tree continuous diffusion approach to co-estimate the species tree and the ancestral areas of representative members of the genus. Our results show that the perennial southern African members of section Elisanthe form a strongly-supported clade with the Eurasian annual S. noctiflora and the Central Asian perennial S. turkestanica. The rest of the perennial species form a strongly-supported clade together with the annual S. aethiopica, which is nested in a larger Mediterranean clade comprising mostly annual species classified in section Silene s.l. Estimates of ancestral areas indicate a late Pleistocene dispersal to southern Africa from central and East Africa for the sub-Saharan members of section Silene s.l. The Elisanthe clade is inferred to have colonized southern Africa through long-distance dispersal from Eurasia during the late Pleistocene. Our findings support the hypothesis of a relatively recent colonization into southern Africa resulting from two independent dispersal events during the Pleistocene.


Subject(s)
Phylogeny , Phylogeography , Silene/classification , Silene/genetics , Bayes Theorem , Cell Nucleus/genetics , Humans , Plastids/genetics , South Africa
4.
Mol Phylogenet Evol ; 154: 106994, 2021 01.
Article in English | MEDLINE | ID: mdl-33250446

ABSTRACT

Species delimitation has advanced from a purely phenotypic exercise to a branch of science that integrates multiple sources of data to identify independently evolving lineages that can be treated as species. We here test species limits in the avian Lesser Short-toed Lark Alaudala rufesens-Sand Lark A. raytal complex, which has an intricate taxonomic history, ranging from a single to three recognised species, with different inclusiveness in different treatments. Our integrative taxonomic approach is based on a combination of DNA sequences, plumage, biometrics, songs, song-flights, geographical distributions, habitat, and bioclimatic data, and using various methods including a species delimitation program (STACEY) based on the multispecies coalescent model. We propose that four species should be recognised: Lesser Short-toed Lark A. rufescens (sensu stricto), Heine's Short-toed Lark A. heinei, Asian Short-toed Lark A. cheleensis and Sand Lark A. raytal. There is also some evidence suggesting lineage separation within A. cheleensis and A. raytal, but additional data are required to evaluate this. The species delimitation based on STACEY agrees well with the non-genetic data. Although computer-based species delimitation programs can be useful in identifying independently evolving lineages, we stress that whenever possible, species hypotheses proposed by these programs should be tested by independent, non-genetic data. Our results highlight the difficulty and subjectivity of delimiting lineages and species, especially at early stages in the speciation process.


Subject(s)
Passeriformes/classification , Phylogeny , Animals , Bayes Theorem , Choice Behavior , Climate , Cytochromes b/genetics , Discriminant Analysis , Ecosystem , Feathers/anatomy & histology , Flight, Animal/physiology , Geography , Humidity , Passeriformes/anatomy & histology , Passeriformes/physiology , Rain , Species Specificity , Temperature , Vocalization, Animal/physiology
5.
PhytoKeys ; 159: 1-34, 2020.
Article in English | MEDLINE | ID: mdl-32973386

ABSTRACT

A putatively monophyletic group of annual Silene species is revised taxonomically and described as the new section S. sect. Arenosae. The species of this section were previously treated as a part of a widely circumscribed and polyphyletic S. sect. Rigidulae. Silene sect. Arenosae as circumscribed here consists of nine species. Members of the section show a predominantly E Mediterranean to SW Asian distribution pattern from Turkey southward to Egypt and eastward to Iran and Pakistan, although most of the species have a limited distribution range. The species of S. sect. Arenosae are characterized by narrowly lanceolate calyx teeth, which are often highly polymorphic, and lanceolate to oblanceolate (non-spathulate) basal leaves. The provided taxonomic revision is based on morphological characters and supported by phylogenetic analyses of two nuclear loci (nrITS and an intron of the RPB2 gene) and one chloroplast locus (the intron of the rps16 gene). The species descriptions are formalized using a novel implementation of the Prometheus Description Model.

6.
PeerJ ; 7: e6399, 2019.
Article in English | MEDLINE | ID: mdl-30783571

ABSTRACT

Building the Tree of Life (ToL) is a major challenge of modern biology, requiring advances in cyberinfrastructure, data collection, theory, and more. Here, we argue that phylogenomics stands to benefit by embracing the many heterogeneous genomic signals emerging from the first decade of large-scale phylogenetic analysis spawned by high-throughput sequencing (HTS). Such signals include those most commonly encountered in phylogenomic datasets, such as incomplete lineage sorting, but also those reticulate processes emerging with greater frequency, such as recombination and introgression. Here we focus specifically on how phylogenetic methods can accommodate the heterogeneity incurred by such population genetic processes; we do not discuss phylogenetic methods that ignore such processes, such as concatenation or supermatrix approaches or supertrees. We suggest that methods of data acquisition and the types of markers used in phylogenomics will remain restricted until a posteriori methods of marker choice are made possible with routine whole-genome sequencing of taxa of interest. We discuss limitations and potential extensions of a model supporting innovation in phylogenomics today, the multispecies coalescent model (MSC). Macroevolutionary models that use phylogenies, such as character mapping, often ignore the heterogeneity on which building phylogenies increasingly rely and suggest that assimilating such heterogeneity is an important goal moving forward. Finally, we argue that an integrative cyberinfrastructure linking all steps of the process of building the ToL, from specimen acquisition in the field to publication and tracking of phylogenomic data, as well as a culture that values contributors at each step, are essential for progress.

7.
Sci Rep ; 9(1): 1045, 2019 01 31.
Article in English | MEDLINE | ID: mdl-30705300

ABSTRACT

Switches in heterogamety are known to occur in both animals and plants. Although plant sex determination systems probably often evolved more recently than those in several well-studied animals, including mammals, and have had less time for switches to occur, we previously detected a switch in heterogamety in the plant genus Silene: section Otites has both female and male heterogamety, whereas S. latifolia and its close relatives, in a different section of the genus, Melandrium (subgenus Behenantha), all have male heterogamety. Here we analyse the evolution of sex chromosomes in section Otites, which is estimated to have evolved only about 0.55 MYA. Our study confirms female heterogamety in S. otites and newly reveals female heterogamety in S. borysthenica. Sequence analyses and genetic mapping show that the sex-linked regions of these two species are the same, but the region in S. colpophylla, a close relative with male heterogamety, is different. The sex chromosome pairs of S. colpophylla and S. otites each correspond to an autosome of the other species, and both differ from the XY pair in S. latifolia. Silene section Otites species are suitable for detailed studies of the events involved in such changes, and our phylogenetic analysis suggests a possible change from female to male heterogamety within this section. Our analyses suggest a possibility that has so far not been considered, change in heterogamety through hybridization, in which a male-determining chromosome from one species is introgressed into another one, and over-rides its previous sex-determining system.


Subject(s)
Chromosomes, Plant/genetics , Silene/genetics , Bayes Theorem , Genetic Linkage/genetics , Phylogeny
8.
Syst Biol ; 68(1): 32-46, 2019 01 01.
Article in English | MEDLINE | ID: mdl-29771371

ABSTRACT

Advances in high-throughput sequencing techniques now allow relatively easy and affordable sequencing of large portions of the genome, even for nonmodel organisms. Many phylogenetic studies reduce costs by focusing their sequencing efforts on a selected set of targeted loci, commonly enriched using sequence capture. The advantage of this approach is that it recovers a consistent set of loci, each with high sequencing depth, which leads to more confidence in the assembly of target sequences. High sequencing depth can also be used to identify phylogenetically informative allelic variation within sequenced individuals, but allele sequences are infrequently assembled in phylogenetic studies. Instead, many scientists perform their phylogenetic analyses using contig sequences which result from the de novo assembly of sequencing reads into contigs containing only canonical nucleobases, and this may reduce both statistical power and phylogenetic accuracy. Here, we develop an easy-to-use pipeline to recover allele sequences from sequence capture data, and we use simulated and empirical data to demonstrate the utility of integrating these allele sequences to analyses performed under the multispecies coalescent model. Our empirical analyses of ultraconserved element locus data collected from the South American hummingbird genus Topaza demonstrate that phased allele sequences carry sufficient phylogenetic information to infer the genetic structure, lineage divergence, and biogeographic history of a genus that diversified during the last 3 myr. The phylogenetic results support the recognition of two species and suggest a high rate of gene flow across large distances of rainforest habitats but rare admixture across the Amazon River. Our simulations provide evidence that analyzing allele sequences leads to more accurate estimates of tree topology and divergence times than the more common approach of using contig sequences.


Subject(s)
Alleles , Classification/methods , Conserved Sequence/genetics , Phylogeny , Animals , Birds/classification , Birds/genetics , Computer Simulation , Ecosystem
9.
Front Genet ; 10: 1407, 2019.
Article in English | MEDLINE | ID: mdl-32153629

ABSTRACT

High-throughput DNA sequencing techniques enable time- and cost-effective sequencing of large portions of the genome. Instead of sequencing and annotating whole genomes, many phylogenetic studies focus sequencing effort on large sets of pre-selected loci, which further reduces costs and bioinformatic challenges while increasing coverage. One common approach that enriches loci before sequencing is often referred to as target sequence capture. This technique has been shown to be applicable to phylogenetic studies of greatly varying evolutionary depth. Moreover, it has proven to produce powerful, large multi-locus DNA sequence datasets suitable for phylogenetic analyses. However, target capture requires careful considerations, which may greatly affect the success of experiments. Here we provide a simple flowchart for designing phylogenomic target capture experiments. We discuss necessary decisions from the identification of target loci to the final bioinformatic processing of sequence data. We outline challenges and solutions related to the taxonomic scope, sample quality, and available genomic resources of target capture projects. We hope this review will serve as a useful roadmap for designing and carrying out successful phylogenetic target capture studies.

10.
PeerJ ; 6: e4388, 2018.
Article in English | MEDLINE | ID: mdl-29576938

ABSTRACT

Several studies have demonstrated the contribution of northern immigrants to the flora of the tropical Andes-the world's richest and most diverse biodiversity hotspot. However, much less is known about the biogeographic history and diversification of Andean groups with southern origins, although it has been suggested that northern and southern groups have contributed roughly equally to the high Andean (i.e., páramo) flora. Here we infer the evolutionary history of the southern hemisphere plant genus Gunnera, a lineage with a rich fossil history and an important ecological role as an early colonising species characteristic of wet, montane environments. Our results show striking contrasts in species diversification, where some species may have persisted for some 90 million years, and whereas others date to less than 2 Ma since origination. The outstanding longevity of the group is likely linked to a high degree of niche conservatism across its highly disjunct range, whereby Gunnera tracks damp and boggy soils in cool habitats. Colonisation of the northern Andes is related to Quaternary climate change, with subsequent rapid diversification appearing to be driven by their ability to take advantage of environmental opportunities. This study demonstrates the composite origin of a mega-diverse biota.

11.
BMC Evol Biol ; 18(1): 9, 2018 Jan 27.
Article in English | MEDLINE | ID: mdl-29374461

ABSTRACT

BACKGROUND: Whole genome duplication plays a central role in plant evolution. There are two main classes of polyploid formation: autopolyploids which arise within one species by doubling of similar homologous genomes; in contrast, allopolyploidy (hybrid polyploidy) arise via hybridization and subsequent doubling of nonhomologous (homoeologous) genomes. The distinction between polyploid origins can be made using gene phylogenies, if alleles from each genome can be correctly retrieved. We examined whether two closely related tetraploid Mediterranean shrubs (Medicago arborea and M. strasseri) have an allopolyploid origin - a question that has remained unsolved despite substantial previous research. We sequenced and analyzed ten low-copy nuclear genes from these and related species, phasing all alleles. To test the efficacy of allele phasing on the ability to recover the evolutionary origin of polyploids, we compared these results to analyses using unphased sequences. RESULTS: In eight of the gene trees the alleles inferred from the tetraploids formed two clades, in a non-sister relationship. Each of these clades was more closely related to alleles sampled from other species of Medicago, a pattern typical of allopolyploids. However, we also observed that alleles from one of the remaining genes formed two clades that were sister to one another, as is expected for autopolyploids. Trees inferred from unphased sequences were very different, with the tetraploids often placed in poorly supported and different positions compared to results obtained using phased alleles. CONCLUSIONS: The complex phylogenetic history of M. arborea and M. strasseri is explained predominantly by shared allotetraploidy. We also observed that an increase in woodiness is correlated with polyploidy in this group of species and present a new possibility that woodiness could be a transgressive phenotype. Correctly phased homoeologues are likely to be critical for inferring the hybrid origin of allopolyploid species, when most genes retain more than one homoeologue. Ignoring homoeologous variation by merging the homoeologues can obscure the signal of hybrid polyploid origins and produce inaccurate results.


Subject(s)
Alleles , Medicago/genetics , Polyploidy , Base Sequence , Evolution, Molecular , Genes, Plant , Hybridization, Genetic , Phylogeny , Population Density , Species Specificity
12.
Front Genet ; 9: 639, 2018.
Article in English | MEDLINE | ID: mdl-30619464

ABSTRACT

The Andes are an important biogeographic region in South America extending for about 8000 km from Venezuela to Argentina. They are - along with the Patagonian steppes - the main distribution area of ca. 18 polyploid species of Silene sect. Physolychnis. Using nuclear ITS and plastid psbE-petG and matK sequences, flow cytometric ploidy level estimations and chromosome counts, and including 13 South American species, we explored the origin and diversification of this group. Our data suggest a single, late Pliocene or early Pleistocene migration of the North American S. verecunda lineage to South America, which was followed by dispersal and diversification of this tetraploid lineage in the Andes, other Argentinian mountain ranges and the Patagonian steppes. Later in the Pleistocene South American populations hybridized with the S. uralensis lineage, which led to allopolyploidisation and origin of decaploid S. chilensis and S. echegarayi occurring at high elevations. Additionally, we show that the morphological differentiation in leaf shape correlated with divergent habitats (high elevation Andes vs. lower elevation Patagonian steppes) is also supported phylogenetically, especially in the ITS tree. Lastly, the species boundaries among the narrow-leaved Patagonian steppe species are poorly resolved and need more thorough taxonomic revision.

13.
J Plant Res ; 130(1): 107-116, 2017 Jan.
Article in English | MEDLINE | ID: mdl-27864639

ABSTRACT

Limosella is a small aquatic genus of Scrophulariaceae of twelve species, of which one is distributed in northern circumpolar regions, two in southern circumpolar regions, two in the Americas, one endemic to Australia, and six in tropical or southern Africa or both. The Australasian L. curdieana has always been considered distinct but its close phylogenetic relationships have never been inferred. Here, we investigated the following alternative phylogenetic hypotheses based on comparative leaf morphology and habitat preferences or floral morphology: (1) L. curdieana is sister to the African L. grandiflora; or (2) it is closely related to a group of other African species and the northern circumpolar L. aquatica. We tested these hypotheses in a phylogenetic framework using DNA sequence data from four plastid DNA regions and the nuclear ITS region. These were analyzed using maximum parsimony and Bayesian inference. We obtained moderately resolved, partially conflicting phylogenies, supporting that accessions of L. grandiflora form the sister group to the rest of the genus and that L. curdieana groups with the African taxa, L. africana and L. major, and L. aquatica. Thus, the molecular evidence supports the second hypothesis. A biogeographic analysis suggests an out-of-southern Africa scenario and several dispersal events in the Southern Hemisphere. Past dispersal from southern Africa to Australasia is suggested, yet it cannot be excluded that a route via tropical Africa and temperate Asia has existed.


Subject(s)
Scrophulariaceae/genetics , Africa , Asia , Bayes Theorem , DNA, Intergenic/genetics , DNA, Plant/genetics , Evolution, Molecular , Phylogeny , Phylogeography , Plant Dispersal , Plastids/genetics , Scrophulariaceae/physiology , Sequence Analysis, DNA
14.
Syst Biol ; 66(2): 152-166, 2017 Mar 01.
Article in English | MEDLINE | ID: mdl-27616324

ABSTRACT

Rapidly growing biological data-including molecular sequences and fossils-hold an unprecedented potential to reveal how evolutionary processes generate and maintain biodiversity. However, researchers often have to develop their own idiosyncratic workflows to integrate and analyze these data for reconstructing time-calibrated phylogenies. In addition, divergence times estimated under different methods and assumptions, and based on data of various quality and reliability, should not be combined without proper correction. Here we introduce a modular framework termed SUPERSMART (Self-Updating Platform for Estimating Rates of Speciation and Migration, Ages, and Relationships of Taxa), and provide a proof of concept for dealing with the moving targets of evolutionary and biogeographical research. This framework assembles comprehensive data sets of molecular and fossil data for any taxa and infers dated phylogenies using robust species tree methods, also allowing for the inclusion of genomic data produced through next-generation sequencing techniques. We exemplify the application of our method by presenting phylogenetic and dating analyses for the mammal order Primates and for the plant family Arecaceae (palms). We believe that this framework will provide a valuable tool for a wide range of hypothesis-driven research questions in systematics, biogeography, and evolution. SUPERSMART will also accelerate the inference of a "Dated Tree of Life" where all node ages are directly comparable. [Bayesian phylogenetics; data mining; divide-and-conquer methods; GenBank; multilocus multispecies coalescent; next-generation sequencing; palms; primates; tree calibration.].


Subject(s)
Classification/methods , Fossils , Phylogeny , Age Factors , Animal Migration , Animals , Arecaceae/classification , Bayes Theorem , Primates/classification , Reproducibility of Results , Time
15.
Mol Phylogenet Evol ; 102: 1-8, 2016 09.
Article in English | MEDLINE | ID: mdl-27233442

ABSTRACT

Species delimitation is a major focus of biosystematics. In recent years, considerable progress has been achieved with the development of the multispecies coalescent (MSC) model, where species constitute the branches of the species tree or network. However, researchers are faced with the limitation that the MSC method of choice often requires a priori assignment of individuals to species. This not only introduces subjectivitiy into the analyses, but may also lead to meaningless species tree hypotheses, if the allele-to-species assignments are inaccurate. DISSECT is a recently introduced method that does not require a priori allele-to-species assignments, but instead examines the posterior probabilities of groupings (clusterings) of individuals under study. Using the DISSECT approach, we analysed genetic data from 75 individual plants belonging to the Silene aegyptiaca species complex that has previously been divided into 3-5 species. Marginal likelihood estimates from (*)BEAST analyses, run with predefined species classifications, strongly favour those compatible with the DISSECT result over those from morphology- and geography-based taxonomy. We found at least nine species, including several cryptic ones, for which no clear geographical or morphological patterns are correlated. However, the limited data and the possibility of unmodelled processes mean there is still much uncertainty about the true number of MSC species, and for taxonomic purposes, other criteria might be relevant. Nevertheless, we argue that the approach signifies an important step towards objective and testable species delimitations in any organismal group. In particular, it makes it possible to avoid biologically irrelevant species classifications.


Subject(s)
Phylogeny , Silene/classification , Software , Bayes Theorem , Geography , Likelihood Functions , Probability , Species Specificity
16.
New Phytol ; 211(2): 719-34, 2016 07.
Article in English | MEDLINE | ID: mdl-27037925

ABSTRACT

The flora on the isolated high African mountains or 'sky islands' is remarkable for its peculiar adaptations, local endemism and striking biogeographical connections to remote parts of the world. Ages of the plant lineages and the timing of their radiations have frequently been debated but remain contentious as there are few estimates based on explicit models and fossil-calibrated molecular clocks. We used the plastid region maturaseK (matK) and a Caryophylloflora paleogenica fossil to infer the age of the genus Lychnis, and constructed a data set of three plastid (matK; a ribosomal protein S16 (rps16); and an intergenic spacer (psbE-petL)) and two nuclear (internal transcribed spacer (ITS) and a region spanning exon 18-24 in the second largest subunit of RNA polymerase II (RPB2)) loci for joint estimation of the species tree and divergence time of the African representatives. The time of divergence of the African high-altitude Lychnis was placed in the late Miocene to early Pliocene. A single speciation event was inferred in the early Pliocene; subsequent speciation took place sporadically from the late Pliocene to the middle Pleistocene. We provide further support for a Eurasian origin of the African 'sky islands' floral elements, which seem to have been recruited via dispersals at different times: some old, as in Lychnis, and others very recent. We show that dispersal and diversification within Africa play an important role in shaping these isolated plant communities.


Subject(s)
Fossils , Lychnis/genetics , Radiometric Dating , Africa , Calibration , DNA, Plant/genetics , Genetic Loci , Geography , Phylogeny , Species Specificity
17.
Syst Biol ; 64(3): 448-71, 2015 May.
Article in English | MEDLINE | ID: mdl-25604357

ABSTRACT

There is a rising awareness that species trees are best inferred from multiple loci while taking into account processes affecting individual gene trees, such as substitution model error (failure of the model to account for the complexity of the data) and coalescent stochasticity (presence of incomplete lineage sorting [ILS]). Although most studies have been carried out in the context of dichotomous species trees, these processes operate also in more complex evolutionary histories involving multiple hybridizations and polyploidy. Recently, methods have been developed that accurately handle ILS in allopolyploids, but they are thus far restricted to networks of diploids and tetraploids. We propose a procedure that improves on this limitation by designing a workflow that assigns homoeologs to hypothetical diploid ancestral genomes prior to genome tree construction. Conflicting assignment hypotheses are evaluated against substitution model error and coalescent stochasticity. Incongruence that cannot be explained by stochastic mechanisms needs to be explained by other processes (e.g., homoploid hybridization or paralogy). The data can then be filtered to build multilabeled genome phylogenies using inference methods that can recover species trees, either in the face of substitution model error and coalescent stochasticity alone, or while simultaneously accounting for hybridization. Methods are already available for folding the resulting multilabeled genome phylogeny into a network. We apply the workflow to the reconstruction of the reticulate phylogeny of the plant genus Fumaria (Papaveraceae) with ploidal levels ranging from 2[Formula: see text] to 14[Formula: see text]. We describe the challenges in recovering nuclear NRPB2 homoeologs in high ploidy species while combining in vivo cloning and direct sequencing techniques. Using parametric bootstrapping simulations we assign nuclear homoeologs and chloroplast sequences (four concatenated loci) to their common hypothetical diploid ancestral genomes. As these assignments hinge on effective population size assumptions, we investigate how varying these assumptions impacts the recovered multilabeled genome phylogeny.


Subject(s)
Classification/methods , Fumaria/classification , Fumaria/genetics , Genome, Plant/genetics , Phylogeny , Polyploidy , Chloroplasts/genetics , Sequence Homology
18.
Syst Biol ; 64(1): 84-101, 2015 Jan.
Article in English | MEDLINE | ID: mdl-25281848

ABSTRACT

Allopolyploidization accounts for a significant fraction of speciation events in many eukaryotic lineages. However, existing phylogenetic and dating methods require tree-like topologies and are unable to handle the network-like phylogenetic relationships of lineages containing allopolyploids. No explicit framework has so far been established for evaluating competing network topologies, and few attempts have been made to date phylogenetic networks. We used a four-step approach to generate a dated polyploid species network for the cosmopolitan angiosperm genus Viola L. (Violaceae Batch.). The genus contains ca 600 species and both recent (neo-) and more ancient (meso-) polyploid lineages distributed over 16 sections. First, we obtained DNA sequences of three low-copy nuclear genes and one chloroplast region, from 42 species representing all 16 sections. Second, we obtained fossil-calibrated chronograms for each nuclear gene marker. Third, we determined the most parsimonious multilabeled genome tree and its corresponding network, resolved at the section (not the species) level. Reconstructing the "correct" network for a set of polyploids depends on recovering all homoeologs, i.e., all subgenomes, in these polyploids. Assuming the presence of Viola subgenome lineages that were not detected by the nuclear gene phylogenies ("ghost subgenome lineages") significantly reduced the number of inferred polyploidization events. We identified the most parsimonious network topology from a set of five competing scenarios differing in the interpretation of homoeolog extinctions and lineage sorting, based on (i) fewest possible ghost subgenome lineages, (ii) fewest possible polyploidization events, and (iii) least possible deviation from expected ploidy as inferred from available chromosome counts of the involved polyploid taxa. Finally, we estimated the homoploid and polyploid speciation times of the most parsimonious network. Homoploid speciation times were estimated by coalescent analysis of gene tree node ages. Polyploid speciation times were estimated by comparing branch lengths and speciation rates of lineages with and without ploidy shifts. Our analyses recognize Viola as an old genus (crown age 31 Ma) whose evolutionary history has been profoundly affected by allopolyploidy. Between 16 and 21 allopolyploidizations are necessary to explain the diversification of the 16 major lineages (sections) of Viola, suggesting that allopolyploidy has accounted for a high percentage-between 67% and 88%-of the speciation events at this level. The theoretical and methodological approaches presented here for (i) constructing networks and (ii) dating speciation events within a network, have general applicability for phylogenetic studies of groups where allopolyploidization has occurred. They make explicit use of a hitherto underexplored source of ploidy information from chromosome counts to help resolve phylogenetic cases where incomplete sequence data hampers network inference. Importantly, the coalescent-based method used herein circumvents the assumption of tree-like evolution required by most techniques for dating speciation events.


Subject(s)
Phylogeny , Viola/classification , Viola/genetics , Evolution, Molecular , Fossils , Polyploidy , Time
19.
Bioinformatics ; 31(7): 991-8, 2015 Apr 01.
Article in English | MEDLINE | ID: mdl-25422051

ABSTRACT

MOTIVATION: The multispecies coalescent model provides a formal framework for the assignment of individual organisms to species, where the species are modeled as the branches of the sp tree. None of the available approaches so far have simultaneously co-estimated all the relevant parameters in the model, without restricting the parameter space by requiring a guide tree and/or prior assignment of individuals to clusters or species. RESULTS: We present DISSECT, which explores the full space of possible clusterings of individuals and species tree topologies in a Bayesian framework. It uses an approximation to avoid the need for reversible-jump Markov Chain Monte Carlo, in the form of a prior that is a modification of the birth-death prior for the species tree. It incorporates a spike near zero in the density for node heights. The model has two extra parameters: one controls the degree of approximation and the second controls the prior distribution on the numbers of species. It is implemented as part of BEAST and requires only a few changes from a standard *BEAST analysis. The method is evaluated on simulated data and demonstrated on an empirical dataset. The method is shown to be insensitive to the degree of approximation, but quite sensitive to the second parameter, suggesting that large numbers of sequences are needed to draw firm conclusions. AVAILABILITY AND IMPLEMENTATION: http://tree.bio.ed.ac.uk/software/beast/, http://www.indriid.com/dissectinbeast.html. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Algorithms , Bayes Theorem , Computational Biology/methods , Genetic Speciation , Gophers/genetics , Phylogeny , Silene/genetics , Animals , Computer Simulation , Markov Chains , Monte Carlo Method
20.
PLoS One ; 9(10): e109704, 2014.
Article in English | MEDLINE | ID: mdl-25329401

ABSTRACT

Next-generation sequencing technology has increased the capacity to generate molecular data for plant biological research, including phylogenetics, and can potentially contribute to resolving complex phylogenetic problems. The evolutionary history of Medicago L. (Leguminosae: Trifoliae) remains unresolved due to incongruence between published phylogenies. Identification of the processes causing this genealogical incongruence is essential for the inference of a correct species phylogeny of the genus and requires that more molecular data, preferably from low-copy nuclear genes, are obtained across different species. Here we report the development of 50 novel LCN markers in Medicago and assess the phylogenetic properties of each marker. We used the genomic resources available for Medicago truncatula Gaertn., hybridisation-based gene enrichment (sequence capture) techniques and Next-Generation Sequencing to generate sequences. This alternative proves to be a cost-effective approach to amplicon sequencing in phylogenetic studies at the genus or tribe level and allows for an increase in number and size of targeted loci. Substitution rate estimates for each of the 50 loci are provided, and an overview of the variation in substitution rates among a large number of low-copy nuclear genes in plants is presented for the first time. Aligned sequences of major species lineages of Medicago and its sister genus are made available and can be used in further probe development for sequence-capture of the same markers.


Subject(s)
Genetic Loci , Genome, Plant , Medicago/genetics , Phylogeny , High-Throughput Nucleotide Sequencing , Sequence Analysis, DNA
SELECTION OF CITATIONS
SEARCH DETAIL
...