RESUMEN
Natural hybridization can have a profound evolutionary impact, with consequences ranging from the extinction of rare taxa to the origin of new species. Natural hybridization is particularly common in plants; however, our understanding of the general factors that promote or prevent hybridization is hampered by the highly variable outcomes in different lineages. Here, we quantify the influence of different predictors on hybrid formation across species from an entire flora. We combine estimates of hybridization with ecological attributes and a new species-level phylogeny for over 1,100 UK flowering plant species. Our results show that genetic factors, particularly parental genetic distance, as well as phylogenetic position and ploidy, are key determinants of hybrid formation, whereas many other factors such as range overlap and genus size explain much less variation in hybrid formation. Overall, intrinsic genetic factors shape the evolutionary and ecological consequences of natural hybridization across species in a flora.
Asunto(s)
Evolución Biológica , Ploidias , Filogenia , Hibridación de Ácido Nucleico , Hibridación GenéticaRESUMEN
We present a complete generic-level phylogeny of the complex thalloid liverworts, a lineage that includes the model system Marchantia polymorpha. The complex thalloids are remarkable for their slow rate of molecular evolution and for being the only extant plant lineage to differentiate gas exchange tissues in the gametophyte generation. We estimated the divergence times and analyzed the evolutionary trends of morphological traits, including air chambers, rhizoids and specialized reproductive structures. A multilocus dataset was analyzed using maximum likelihood and Bayesian approaches. Relative rates were estimated using local clocks. Our phylogeny cements the early branching in complex thalloids. Marchantia is supported in one of the earliest divergent lineages. The rate of evolution in organellar loci is slower than for other liverwort lineages, except for two annual lineages. Most genera diverged in the Cretaceous. Marchantia polymorpha diversified in the Late Miocene, giving a minimum age estimate for the evolution of its sex chromosomes. The complex thalloid ancestor, excluding Blasiales, is reconstructed as a plant with a carpocephalum, with filament-less air chambers opening via compound pores, and without pegged rhizoids. Our comprehensive study of the group provides a temporal framework for the analysis of the evolution of critical traits essential for plants during land colonization.
Asunto(s)
Marchantia/anatomía & histología , Filogenia , Secuencia de Bases , Funciones de Verosimilitud , Mitocondrias/genética , Plastidios/genética , Factores de TiempoRESUMEN
We present a genome assembly from an individual specimen of Luzula sylvatica (great wood-rush; Tracheophyta; Magnoliopsida; Poales; Juncaceae). The genome sequence is 444.5 megabases in span. Most of the assembly is scaffolded into 6 chromosomal pseudomolecules. The mitochondrial and plastid genome assemblies have lengths of 633.36 kilobases and 201.32 kilobases in length, respectively.
RESUMEN
Biodiversity genomics research requires reliable organismal identification, which can be difficult based on morphology alone. DNA-based identification using DNA barcoding can provide confirmation of species identity and resolve taxonomic issues but is rarely used in studies generating reference genomes. Here, we describe the development and implementation of DNA barcoding for the Darwin Tree of Life Project (DToL), which aims to sequence and assemble high quality reference genomes for all eukaryotic species in Britain and Ireland. We present a standardised framework for DNA barcode sequencing and data interpretation that is then adapted for diverse organismal groups. DNA barcoding data from over 12,000 DToL specimens has identified up to 20% of samples requiring additional verification, with 2% of seed plants and 3.5% of animal specimens subsequently having their names changed. We also make recommendations for future developments using new sequencing approaches and streamlined bioinformatic approaches.
Identifying species based solely on their morphology can be difficult. DNA-based identification using DNA barcoding can aid species identification, but can be challenging to implement in biodiversity projects sampling diverse organismal groups. Here, we describe the development and implementation of DNA barcoding for the Darwin Tree of Life Project (DToL), which aims to sequence and assemble high quality reference genomes for all eukaryotic species in Britain and Ireland. We discuss how a standardised approach has been adapted by each partner to suit different organismal groups, show the efficacy of this approach for confirming species identities and resolving taxonomic issues, and make recommendations for future developments.
RESUMEN
Recent technological advances in long-read high-throughput sequencing and assembly methods have facilitated the generation of annotated chromosome-scale whole-genome sequence data for evolutionary studies; however, generating such data can still be difficult for many plant species. For example, obtaining high-molecular-weight DNA is typically impossible for samples in historical herbarium collections, which often have degraded DNA. The need to fast-freeze newly collected living samples to conserve high-quality DNA can be complicated when plants are only found in remote areas. Therefore, short-read reduced-genome representations, such as target capture and genome skimming, remain important for evolutionary studies. Here, we review the pros and cons of each technique for non-model plant taxa. We provide guidance related to logistics, budget, the genomic resources previously available for the target clade, and the nature of the study. Furthermore, we assess the available bioinformatic analyses, detailing best practices and pitfalls, and suggest pathways to combine newly generated data with legacy data. Finally, we explore the possible downstream analyses allowed by the type of data generated using each technique. We provide a practical guide to help researchers make the best-informed choice regarding reduced genome representation for evolutionary studies of non-model plants in cases where whole-genome sequencing remains impractical.
RESUMEN
BACKGROUND: PacBio HiFi sequencing provides highly accurate long-read sequencing datasets which are of great advantage for whole genome sequencing projects. One limitation of the method is the requirement for high quality, high molecular weight input DNA. This can be particularly challenging for plants that frequently contain common and species-specific secondary metabolites, which often interfere with downstream processes. Cape Primroses (genus Streptocarpus), are some of these recalcitrant plants and are selected here as material to develop a high quality, high molecular weight DNA extraction protocol for long read genome sequencing. RESULTS: We developed a DNA extraction method for PacBio HiFi sequencing for Streptocarpus grandis and Streptocarpus kentaniensis. A CTAB lysis buffer was employed to avoid guanidine, and the traditional chloroform and phenol purification steps were replaced with pre-lysis sample washes. Best cells/nucleus lysis was achieved with 4 h at 58 °C. The obtained high quality and high molecular weight DNAs were tested in PacBio SMRTBell™ library preparations, which resulted in circular consensus sequencing (CCS) reads from 17 to 27 Gb per cell, and a read length N50 from 14 to 17 kbp. To evaluate the quality of the reads for whole genome sequencing, they were assembled with HiFiasm into draft genomes, with N50 = 49 Mb and 23 Mb, and L50 = 10 and 11. The longest contigs were 95 Mb and 57 Mb respectively, showing good contiguity as these are longer than the theoretical chromosome length (genome size/chromosome number) of 78 Mb and 55 Mb, for S. grandis and S. kentaniensis respectively. CONCLUSIONS: DNA extraction is a critical step towards obtaining a complete genome assembly. Our DNA extraction method here provided the required high quality, high molecular weight DNA for successful standard-input PacBio HiFi library preparation. The contigs from those reads showed a high contiguity, providing a good starting draft assembly towards obtaining a complete genome. The results obtained here were highly promising, and demonstrated that the DNA extraction method developed here is compatible with PacBio HiFi sequencing and suitable for de novo whole genome sequencing projects of plants.
RESUMEN
Intentionally preserved biological material in natural history collections represents a vast repository of biodiversity. Advances in laboratory and sequencing technologies have made these specimens increasingly accessible for genomic analyses, offering a window into the genetic past of species and often permitting access to information that can no longer be sampled in the wild. Due to their age, preparation and storage conditions, DNA retrieved from museum and herbarium specimens is often poor in yield, heavily fragmented and biochemically modified. This not only poses methodological challenges in recovering nucleotide sequences, but also makes such investigations susceptible to environmental and laboratory contamination. In this paper, we review the practical challenges associated with making the recovery of DNA sequence data from museum collections more routine. We first review key operational principles and issues to address, to guide the decision-making process and dialogue between researchers and curators about when and how to sample museum specimens for genomic analyses. We then outline the range of steps that can be taken to reduce the likelihood of contamination including laboratory set-ups, workflows and working practices. We finish by presenting a series of case studies, each focusing on protocol practicalities for the application of different mainstream methodologies to museum specimens including: (i) shotgun sequencing of insect mitogenomes, (ii) whole genome sequencing of insects, (iii) genome skimming to recover plant plastid genomes from herbarium specimens, (iv) target capture of multi-locus nuclear sequences from herbarium specimens, (v) RAD-sequencing of bird specimens and (vi) shotgun sequencing of ancient bovid bone samples.
RESUMEN
DNA barcoding and metabarcoding provide new avenues for investigating biological systems. These techniques require well-curated reference libraries with extensive coverage. Generating an exhaustive national DNA barcode reference library can open up new avenues of research in ecology, evolution and conservation, yet few studies to date have created such a resource. In plant DNA barcoding, herbarium collections provide taxonomically robust material but also pose challenges in lab processing. Here, we present a national DNA barcoding resource covering all of the native flowering plants and conifers of the United Kingdom. This represents 1,482 plant species, with the majority of specimens (81%) sourced from herbaria. Using Sanger sequencing of the plant DNA barcode markers, rbcL, matK, and ITS2, at least one DNA barcode was retrieved from 98% of the UK flora. We sampled from multiple individuals, resulting in a species coverage for rbcL of 96% (4,477 sequences), 90% for matK (3,259 sequences) and 75% for ITS2 (2,585 sequences). Sequence recovery was lower for herbarium material compared to fresh collections, with the age of the specimen having a significant effect on the success of sequence recovery. Species level discrimination was highest with ITS2, however, the ability to successfully retrieve a sequence was lowest for this region. Analyses of the genetic distinctiveness of species across a complete flora showed DNA barcoding to be informative for all but the most taxonomically complex groups. The UK flora DNA barcode reference library provides an important resource for many applications that require plant identification from DNA.
Asunto(s)
Código de Barras del ADN Taxonómico , Magnoliopsida , Tracheophyta , ADN de Plantas/genética , Magnoliopsida/clasificación , Magnoliopsida/genética , Tracheophyta/clasificación , Tracheophyta/genética , Reino UnidoRESUMEN
Various historical processes have been put forth as drivers of patterns in the spatial distribution of Amazonian trees and their population genetic variation. We tested whether five widespread tree species show congruent phylogeographic breaks and similar patterns of demographic expansion, which could be related to proposed Pleistocene refugia or the presence of geological arches in western Amazonia. We sampled Otoba parvifolia/glycycarpa (Myristicaceae), Clarisia biflora, Poulsenia armata, Ficus insipida (all Moraceae), and Jacaratia digitata (Caricaceae) across the western Amazon Basin. Plastid DNA (trnH-psbA; 674 individuals from 34 populations) and nuclear ribosomal internal transcribed spacers (ITS; 214 individuals from 30 populations) were sequenced to assess genetic diversity, genetic differentiation, population genetic structure, and demographic patterns. Overall genetic diversity for both markers varied among species, with higher values in populations of shade-tolerant species than in pioneer species. Spatial analysis of molecular variance (SAMOVA) identified three genetically differentiated groups for the plastid marker for each species, but the areas of genetic differentiation were not concordant among species. Fewer SAMOVA groups were found for ITS, with no detectable genetic differentiation among populations in pioneers. The lack of spatially congruent phylogeographic breaks across species suggests no common biogeographic history of these Amazonian tree species. The idiosyncratic phylogeographic patterns of species could be due instead to species-specific responses to geological and climatic changes. Population genetic patterns were similar among species with similar biological features, indicating that the ecological characteristics of species impact large-scale phylogeography.