High-throughput sequencing of 5S-IGS in oaks: Exploring intragenomic variation and algorithms to recognize target species in pure and mixed samples.
Mol Ecol Resour
; 21(2): 495-510, 2021 Feb.
Article
em En
| MEDLINE
| ID: mdl-32997899
Measuring biological diversity is a crucial but difficult undertaking, as exemplified in oaks where complex patterns of morphological, ecological, biogeographical and genetic differentiation collide with traditional taxonomy, which measures biodiversity in number of species (or higher taxa). In this pilot study, we generated high-throughput sequencing amplicon data of the intergenic spacer of the 5S nuclear ribosomal DNA cistron (5S-IGS) in oaks, using six mock samples that differ in geographical origin, species composition and pool complexity. The potential of the marker for automated genotaxonomy applications was assessed using a reference data set of 1,770 5S-IGS cloned sequences, covering the entire taxonomic breadth and distribution range of western Eurasian Quercus, and applying similarity (blast) and evolutionary approaches (maximum-likelihood trees and Evolutionary Placement Algorithm). Both methods performed equally well, allowing correct identification of species in sections Ilex and Cerris in the pure and mixed samples, and main lineages shared by species of sect. Quercus. Application of different cut-off thresholds revealed that medium- to high-abundance (>10 or 25) sequences suffice for a net species identification of samples containing one or a few individuals. Lower thresholds identify phylogenetic correspondence with all target species in highly mixed samples (analogous to environmental bulk samples) and include rare variants pointing towards reticulation, incomplete lineage sorting, pseudogenic 5S units and in situ (natural) contamination. Our pipeline is highly promising for future assessments of intraspecific and interpopulation diversity, and of the genetic resources of natural ecosystems, which are fundamental to empower fast and solid biodiversity conservation programmes worldwide.
Palavras-chave
Texto completo:
1
Bases de dados:
MEDLINE
Assunto principal:
Genoma de Planta
/
Quercus
Idioma:
En
Revista:
Mol Ecol Resour
Ano de publicação:
2021
Tipo de documento:
Article
País de afiliação:
Itália