RESUMO
The spillover of viruses from wildlife into agricultural animals or humans has profound socioeconomic and public health impact. Vampire bats, found throughout South America, feed directly on humans and other animals and are an important reservoir for zoonotic viruses, including rabies virus. This has resulted in considerable effort in understanding both the ecology of bat-borne viruses and the composition and associated correlates of the structure of entire virus communities in wildlife, particularly in the context of disease control interventions. In a From the Cover article in this issue of Molecular Ecology, Bergner et al. (2019) set out to reveal virus community dynamics in vampire bats by interrogating factors that affect the structure, diversity and richness of these communities. Due to the linkage of metagenomic sequence data with community ecology, this study represents an important advance in the field of virus ecology.
Assuntos
Biota , Quirópteros/virologia , Ecologia , Metagenômica , Vírus/genética , Animais , Humanos , Vírus da Raiva/genéticaRESUMO
Bacteria that engage in long-standing associations with particular hosts are expected to evolve host-specific adaptations that limit their capacity to thrive in other environments. Consistent with this, many gut symbionts seem to have a limited host range, based on community profiling and phylogenomics. However, few studies have experimentally investigated host specialization of gut symbionts and the underlying mechanisms have largely remained elusive. Here, we studied host specialization of a dominant gut symbiont of social bees, Lactobacillus Firm5. We show that Firm5 strains isolated from honey bees and bumble bees separate into deep-branching host-specific phylogenetic lineages. Despite their divergent evolution, colonization experiments show that bumble bee strains are capable of colonizing the honey bee gut. However, they were less successful than honey bee strains, and competition with honey bee strains completely abolished their colonization. In contrast, honey bee strains of divergent phylogenetic lineages were able to coexist within individual bees. This suggests that both host selection and interbacterial competition play important roles in host specialization. Using comparative genomics of 27 Firm5 isolates, we found that the genomes of honey bee strains harbour more carbohydrate-related functions than bumble bee strains, possibly providing a competitive advantage in the honey bee gut. Remarkably, most of the genes encoding carbohydrate-related functions were not conserved among the honey bee strains, which suggests that honey bees can support a metabolically more diverse community of Firm5 strains than bumble bees. These findings advance our understanding of the genomic changes underlying host specialization.
Assuntos
Abelhas/microbiologia , Microbioma Gastrointestinal/fisiologia , Genoma Bacteriano , Lactobacillus/genética , Simbiose/genética , Animais , Bacteriocinas/genética , Genes Bacterianos , Glicosídeo Hidrolases/genética , Lactobacillus/isolamento & purificação , Filogenia , SuíçaRESUMO
We recently published the first genomic diversity study of Trypanosoma congolense, a major aetiological agent of Animal African Trypanosomiasis. We demonstrated striking levels of SNP and indel diversity in the Eastern province of Zambia as a consequence of hybridization between divergent trypanosome lineages. We concluded that these and earlier findings in T. congolense challenge the predominant clonal evolution (PCE) model. In a recent comment, Tibayrenc and Ayala claim that there are many features in T. congolense supporting their theory of clonality. While we can follow the reasoning of the authors, we also identify major limitations in their theory and interpretations that resulted in incorrect conclusions. First, we argue that each T. congolense subgroup should be analysed independently as they may represent different (sub)species rather than "near-clades". Second, the authors neglect major findings of two robust population genetic studies on Savannah T. congolense that provide clear evidence of frequent recombination. Third, we reveal additional events of introgressive hybridization in T. congolense by analysing the maxicircle coding region using next-generation sequencing analyses. At last, we pinpoint two important misinterpretations by the authors and show that there are no spatially and temporally widespread clones in T. congolense. We stand by our earlier conclusions that the clonal framework is unlikely to accurately model the population structure of T. congolense. Other theoretical frameworks such as Maynard Smith's epidemic model may better represent the complex ancestry seen in T. congolense, where clones delimited in space and time arise against a background of recombination.
Assuntos
Trypanosoma congolense , Tripanossomíase Africana , Animais , Evolução Clonal , Genômica , ZâmbiaRESUMO
Providing science and society with an integrated, up-to-date, high quality, open, reproducible and sustainable plant tree of life would be a huge service that is now coming within reach. However, synthesizing the growing body of DNA sequence data in the public domain and disseminating the trees to a diverse audience are often not straightforward due to numerous informatics barriers. While big synthetic plant phylogenies are being built, they remain static and become quickly outdated as new data are published and tree-building methods improve. Moreover, the body of existing phylogenetic evidence is hard to navigate and access for non-experts. We propose that our community of botanists, tree builders, and informaticians should converge on a modular framework for data integration and phylogenetic analysis, allowing easy collaboration, updating, data sourcing and flexible analyses. With support from major institutions, this pipeline should be re-run at regular intervals, storing trees and their metadata long-term. Providing the trees to a diverse global audience through user-friendly front ends and application development interfaces should also be a priority. Interactive interfaces could be used to solicit user feedback and thus improve data quality and to coordinate the generation of new data. We conclude by outlining a number of steps that we suggest the scientific community should take to achieve global phylogenetic synthesis.
Assuntos
Disseminação de Informação , Gestão da Informação , Filogenia , Plantas/genética , DNA de Plantas , Humanos , Tecnologia da Informação , Análise de Sequência de DNARESUMO
Major gaps remain in our understanding of the ecology, evolution, biodiversity, biogeography, extinction risk, and adaptive potential of reef building corals. One of the central challenges remains that there are few informative genetic markers for studying boundaries between species, and variation within species. Reduced representation sequencing approaches, such as RADseq (Restriction site Associated DNA sequencing) have great potential for resolving such relationships. However, it is necessary to identify loci in order to make inferences for endosymbiotic organisms such as corals. Here, we examined twenty-one coral holobiont ezRAD libraries from Hawai'i, focusing on P. lobata and P. compressa, two species with contrasting morphology and habitat preference that previous studies have not resolved. We used a combination of de novo assembly and reference mapping approaches to identify and compare loci: we used reference mapping to extract and compare nearly complete mitochondrial genomes, ribosomal arrays, and histone genes. We used de novo clustering and phylogenomic methods to compare the complete holobiont data set with coral and symbiont subsets that map to transcriptomic data. In addition, we used reference assemblies to examine genetic structure from SNPs (Single Nucleotide Polymorphisms). All approaches resolved outgroup taxa but failed to resolve P. lobata and P. compressa as distinct, with mito-nuclear discordance and shared mitochondrial haplotypes within the species complex. The holobiont and 'coral transcriptomic' datasets were highly concordant, revealing stronger genetic structure between sites than between coral morphospecies. These results suggest that either branching morphology is a polymorphic trait, or that these species frequently hybridize. This study provides examples of several approaches to acquire, identify, and compare loci across metagenomic samples such as the coral holobiont while providing insights into the nature of coral variability.
Assuntos
Antozoários/genética , Fluxo Gênico/genética , Genômica/métodos , Hibridização Genética , Animais , Genoma Mitocondrial , Geografia , Havaí , Funções Verossimilhança , Fenótipo , Filogenia , Polimorfismo de Nucleotídeo Único/genética , Análise de Componente Principal , Alinhamento de Sequência , Especificidade da EspécieRESUMO
Ectomycorrhizal (ECM) fungi, symbiotic mutualists of many dominant tree and shrub species, exhibit a biogeographic pattern counter to the established latitudinal diversity gradient of most macroflora and fauna. However, an evolutionary basis for this pattern has not been explicitly tested in a diverse lineage. In this study, we reconstructed a mega-phylogeny of a cosmopolitan and hyperdiverse genus of ECM fungi, Russula, sampling from annotated collections and utilizing publically available sequences deposited in GenBank. Metadata from molecular operational taxonomic unit cluster sets were examined to infer the distribution and plant association of the genus. This allowed us to test for differences in patterns of diversification between tropical and extratropical taxa, as well as how their associations with different plant lineages may be a driver of diversification. Results show that Russula is most species-rich at temperate latitudes and ancestral state reconstruction shows that the genus initially diversified in temperate areas. Migration into and out of the tropics characterizes the early evolution of the genus, and these transitions have been frequent since this time. We propose the 'generalized diversification rate' hypothesis to explain the reversed latitudinal diversity gradient pattern in Russula as we detect a higher net diversification rate in extratropical lineages. Patterns of diversification with plant associates support host switching and host expansion as driving diversification, with a higher diversification rate in lineages associated with Pinaceae and frequent transitions to association with angiosperms.
Assuntos
Basidiomycota/classificação , Evolução Biológica , Micorrizas/classificação , Filogenia , Clima , Análise por Conglomerados , DNA Fúngico/genética , Funções Verossimilhança , Magnoliopsida/microbiologia , Filogeografia , Análise de Sequência de DNARESUMO
As global exploitation of available resources increases, operations extend towards sensitive and previously protected ecosystems. It is important to monitor such areas in order to detect, understand and remediate environmental responses to stressors. The natural heterogeneity and complexity of communities means that accurate monitoring requires high resolution, both temporally and spatially, as well as more complete assessments of taxa. Increased resolution and taxonomic coverage is economically challenging using current microscopy-based monitoring practices. Alternatively, DNA sequencing-based methods have been suggested for cost-efficient monitoring, offering additional insights into ecosystem function and disturbance. Here, we applied DNA metabarcoding of eukaryotic communities in marine sediments, in areas of offshore drilling on the Norwegian continental shelf. Forty-five samples, collected from seven drilling sites in the Troll/Oseberg region, were assessed, using the small subunit ribosomal RNA gene as a taxonomic marker. In agreement with results based on classical morphology-based monitoring, we were able to identify changes in sediment communities surrounding oil platforms. In addition to overall changes in community structure, we identified several potential indicator taxa, responding to pollutants associated with drilling fluids. These included the metazoan orders Macrodasyida, Macrostomida and Ceriantharia, as well as several ciliates and other protist taxa, typically not targeted by environmental monitoring programmes. Analysis of a co-occurrence network to study the distribution of taxa across samples provided a framework for better understanding the impact of anthropogenic activities on the benthic food web, generating novel, testable hypotheses of trophic interactions structuring benthic communities.
Assuntos
Biodiversidade , Código de Barras de DNA Taxonômico , Monitoramento Ambiental , Campos de Petróleo e Gás , Animais , Cilióforos , Ecossistema , Cadeia Alimentar , Sedimentos GeológicosRESUMO
It has long been known that adaptive evolution can occur through genetic mutations in DNA sequence, but it is unclear whether adaptive evolution can occur through analogous epigenetic mechanisms, such as through DNA methylation. If epigenetic variation contributes directly to evolution, species under threat of disease, invasive competition, climate change or other stresses would have greater stores of variation from which to draw. We looked for evidence of natural selection acting on variably methylated DNA sites using population genomic analysis across three climatologically distinct populations of valley oaks. We found patterns of genetic and epigenetic differentiations that indicate local adaptation is operating on large portions of the oak genome. While CHG methyl polymorphisms are not playing a significant role and would make poor targets for natural selection, our findings suggest that CpG methyl polymorphisms as a whole are involved in local adaptation, either directly or through linkage to regions under selection.
Assuntos
Adaptação Fisiológica/genética , Ilhas de CpG , Metilação de DNA , Genética Populacional , Quercus/genética , California , DNA de Plantas/genética , Epigênese Genética , Polimorfismo Genético , Seleção Genética , Análise de Sequência de DNARESUMO
Insect guts are often colonized by multispecies microbial communities that play integral roles in nutrition, digestion and defence. Community composition can differ across host species with increasing dietary and genetic divergence, yet gut microbiota can also vary between conspecific hosts and across an individual's lifespan. Through exploration of such intraspecific variation and its correlates, molecular profiling of microbial communities can generate and test hypotheses on the causes and consequences of symbioses. In this study, we used 454 pyrosequencing and TRFLP to achieve these goals in an herbivorous ant, Cephalotes varians, exploring variation in bacterial communities across colonies, populations and workers reared on different diets. C. varians bacterial communities were dominated by 16 core species present in over two-thirds of the sampled colonies. Core species comprised multiple genotypes, or strains and hailed from ant-specific clades containing relatives from other Cephalotes species. Yet three were detected in environmental samples, suggesting the potential for environmental acquisition. In spite of their prevalence and long-standing relationships with Cephalotes ants, the relative abundance and genotypic composition of core species varied across colonies. Diet-induced plasticity is a likely cause, but only pollen-based diets had consistent effects, altering the abundance of two types of bacteria. Additional factors, such as host age, genetics, chance or natural selection, must therefore shape natural variation. Future studies on these possibilities and on bacterial contributions to the use of pollen, a widespread food source across Cephalotes, will be important steps in developing C. varians as a model for studying widespread social insect-bacteria symbioses.
Assuntos
Formigas/microbiologia , Bactérias/classificação , Sistema Digestório/microbiologia , Simbiose , Animais , Bactérias/genética , DNA Bacteriano/genética , Dieta , Meio Ambiente , Florida , Genótipo , Dados de Sequência Molecular , Filogenia , Polimorfismo de Fragmento de Restrição , RNA Ribossômico 16S/genética , Análise de Sequência de DNARESUMO
Gametic isolation is thought to play an important role in the evolution of reproductive isolation in broadcast-spawning marine invertebrates. However, it is unclear whether gametic isolation commonly evolves early in the speciation process or only accumulates after other reproductive barriers are already in place. It is also unknown whether gametic isolation is an effective barrier to introgression following speciation. Here, we used whole-genome sequencing data and multiple complementary phylogenomic approaches to test whether the well-documented gametic incompatibilities among the strongylocentrotid sea urchins have limited introgression. We quantified phylogenetic discordance, inferred reticulate phylogenetic networks, and applied the Δ statistic using gene tree topologies reconstructed from multiple sequence alignments of protein-coding single-copy orthologs. In addition, we conducted ABBA-BABA tests on genome-wide single nucleotide variants and reconstructed a phylogeny of mitochondrial genomes. Our results revealed strong mito-nuclear discordance and considerable nonrandom gene tree discordance that cannot be explained by incomplete lineage sorting alone. Eight of the nine species examined demonstrated a history of introgression with at least one other species or ancestral lineage, indicating that introgression was common during the diversification of the strongylocentrotid urchins. There was strong support for introgression between four extant species pairs (Strongylocentrotus pallidus â S. droebachiensis, S. intermedius â S. pallidus, S. purpuratus â S. fragilis, and Mesocentrotus franciscanus â Pseudocentrotus depressus) and additional evidence for introgression on internal branches of the phylogeny. Our results suggest that the existing gametic incompatibilities among the strongylocentrotid urchin species have not been a complete barrier to hybridization and introgression following speciation. Their continued divergence in the face of widespread introgression indicates that other reproductive isolating barriers likely exist and may have been more critical in establishing reproductive isolation early in speciation.
RESUMO
The Holy Grail of an Insect Tree of Life can only be 'discovered' through extensive collaboration among taxon specialists, phylogeneticists and centralized frameworks such as Open Tree of Life, but insufficient effort from stakeholders has so far hampered this promising approach. The resultant unavailability of synthesis phylogenies is an unfortunate situation given the numerous practical usages of phylogenies in the near term and against the backdrop of the ongoing biodiversity crisis. To resolve this issue, we establish a new online hub that centralizes the collation of relevant phylogenetic data and provides the resultant synthesis molecular phylogenies. This is achieved through key developments in a proposed pipeline for the construction of a species-level insect phylogeny. The functionality of the framework is demonstrated through the construction of a highly supported, species-comprehensive phylogeny of Diptera, built from integrated omics data, COI DNA barcodes, and a compiled database of over 100 standardized, published Diptera phylogenies. Machine-readable forms of the phylogeny (and subsets thereof) are publicly available at insectphylo.org, a new public repository for species-comprehensive phylogenies for biological research.
Assuntos
Dípteros , Insetos , Animais , Filogenia , Insetos/genética , Dípteros/genética , DNA , BiodiversidadeRESUMO
Microbiome studies are often limited by a lack of statistical power due to small sample sizes and a large number of features. This problem is exacerbated in correlative studies of multi-omic datasets. Statistical power can be increased by finding and summarizing modules of correlated observations, which is one dimensionality reduction method. Additionally, modules provide biological insight as correlated groups of microbes can have relationships among themselves. To address these challenges, we developed SCNIC: Sparse Cooccurrence Network Investigation for compositional data. SCNIC is open-source software that can generate correlation networks and detect and summarize modules of highly correlated features. Modules can be formed using either the Louvain Modularity Maximization (LMM) algorithm or a Shared Minimum Distance algorithm (SMD) that we newly describe here and relate to LMM using simulated data. We applied SCNIC to two published datasets and we achieved increased statistical power and identified microbes that not only differed across groups, but also correlated strongly with each other, suggesting shared environmental drivers or cooperative relationships among them. SCNIC provides an easy way to generate correlation networks, identify modules of correlated features and summarize them for downstream statistical analysis. Although SCNIC was designed considering properties of microbiome data, such as compositionality and sparsity, it can be applied to a variety of data types including metabolomics data and used to integrate multiple data types. SCNIC allows for the identification of functional microbial relationships at scale while increasing statistical power through feature reduction.
Assuntos
Microbiota , Software , AlgoritmosRESUMO
The availability of phylogenetic data has greatly expanded in recent years. As a result, a new era in phylogenetic analysis is dawning-one in which the methods we use to analyse and assess our data are the bottleneck to producing valuable phylogenetic hypotheses, rather than the need to acquire more data. This makes the ability to accurately appraise and evaluate new methods of phylogenetic analysis and phylogenetic artefact identification more important than ever. Incongruence in phylogenetic reconstructions based on different datasets may be due to two major sources: biological and methodological. Biological sources comprise processes like horizontal gene transfer, hybridization and incomplete lineage sorting, while methodological ones contain falsely assigned data or violations of the assumptions of the underlying model. While the former provides interesting insights into the evolutionary history of the investigated groups, the latter should be avoided or minimized as best as possible. However, errors introduced by methodology must first be excluded or minimized to be able to conclude that biological sources are the cause. Fortunately, a variety of useful tools exist to help detect such misassignments and model violations and to apply ameliorating measurements. Still, the number of methods and their theoretical underpinning can be overwhelming and opaque. Here, we present a practical and comprehensive review of recent developments in techniques to detect artefacts arising from model violations and poorly assigned data. The advantages and disadvantages of the different methods to detect such misleading signals in phylogenetic reconstructions are also discussed. As there is no one-size-fits-all solution, this review can serve as a guide in choosing the most appropriate detection methods depending on both the actual dataset and the computational power available to the researcher. Ultimately, this informed selection will have a positive impact on the broader field, allowing us to better understand the evolutionary history of the group of interest.
RESUMO
Despite reproducing without sexual recombination, Meloidogyne incognita is an adaptive and versatile phytoparasitic nematode. This species displays a global distribution, can parasitize a large range of plants, and can overcome plant resistance in a few generations. The mechanisms underlying this adaptability remain poorly known. At the whole-genome level, only a few single nucleotide variations have been observed across different geographical isolates with distinct ranges of compatible hosts. Exploring other factors possibly involved in genomic plasticity is thus important. Transposable elements (TEs), by their repetitive nature and mobility, can passively and actively impact the genome dynamics. This is particularly expected in polyploid hybrid genomes such as the one of M. incognita. Here, we have annotated the TE content of M. incognita, analyzed the statistical properties of this TE landscape, and used whole-genome pool-seq data to estimate the mobility of these TEs across twelve geographical isolates, presenting variations in ranges of compatible host plants. DNA transposons are more abundant than retrotransposons, and the high similarity of TE copies to their consensus sequences suggests they have been at least recently active. We have identified loci in the genome where the frequencies of presence of a TE showed substantial variations across the different isolates. Overall, variations in TE frequencies across isolates followed their phylogenetic divergence, suggesting TEs participate in the species diversification. Compared with the M. incognita reference genome, we detected isolate and lineage-specific de novo insertion of some TEs, including within genic regions or in the upstream regulatory regions. We validated by PCR the insertion of some of these TEs inside genic regions, confirming TE movements have possible functional impacts. Overall, we show DNA transposons can drive genomic plasticity in M. incognita and their role in genome evolution of other parthenogenetic animal deserves further investigation.
RESUMO
Cancer is a potentially lethal disease, in which patients with nearly identical genetic backgrounds can develop a similar pathology through distinct combinations of genetic alterations. We aimed to reconstruct the evolutionary process underlying tumour initiation, using the combination of convergence and discrepancies observed across 2,742 cancer genomes from nine tumour types. We developed a framework using the repeatability of cancer development to score the local malignant adaptation (LMA) of genetic clones, as their potential to malignantly progress and invade their environment of origin. Using this framework, we found that premalignant skin and colorectal lesions appeared specifically adapted to their local environment, yet insufficiently for full cancerous transformation. We found that metastatic clones were more adapted to the site of origin than to the invaded tissue, suggesting that genetics may be more important for local progression than for the invasion of distant organs. In addition, we used network analyses to investigate evolutionary properties at the system-level, highlighting that different dynamics of malignant progression can be modelled by such a framework in tumour-type-specific fashion. We find that occurrence-based methods can be used to specifically recapitulate the process of cancer initiation and progression, as well as to evaluate the adaptation of genetic clones to given environments. The repeatability observed in the evolution of most tumour types could therefore be harnessed to better predict the trajectories likely to be taken by tumours and preneoplastic lesions in the future.
RESUMO
Vibrio vulnificus causes human sickness throughout the world via the consumption of undercooked seafood or exposure to contaminated water. Previous attempts at phylogenetic analyses of V. vulnificus have proven unsuccessful, mainly due to the poorly understood impact of factors on its divergence. In this study, we used advanced statistical and phylogenetic methods to strengthen the classification of V. vulnificus. This updated classification included the impact of geographical and host factors. The results demonstrate the existence of hierarchies and multidimensional effects in the classification of V. vulnificus, from the molecular level using biotypes, to the distributional level using geographical location, to the adaptational level through host immune response. These findings have implications for the classification of bacteria, bacterial evolution, and public health.
RESUMO
High-throughput sequencing using the Illumina HiSeq platform is a pervasive and critical molecular ecology resource, and has provided the data underlying many recent advances. A recent study has suggested that "index switching," where reads are misattributed to the wrong sample, may be higher in new versions of the HiSeq platform. This has the potential to invalidate both published and in-progress work across the field. Here, we test for evidence of index switching in an exemplar whole-genome shotgun data set sequenced on both the Illumina HiSeq 2500, which should not have the problem, and the Illumina HiSeq X, which may. We leverage unbalanced heterozygotes, which may be produced by index switching, and ask whether the undersequenced allele is more likely to be found in other samples in the same lane than expected based on the allele frequency. Although we validate the sensitivity of this method using simulations, we find that neither the HiSeq 2500 nor the HiSeq X has evidence of index switching. This suggests that, thankfully, index switching may not be a ubiquitous problem in HiSeq X sequence data. Lastly, we provide scripts for applying our method so that index switching can be tested for in other data sets.
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento de Nucleotídeos em Larga Escala/normas , Análise de Sequência de DNA/métodos , Análise de Sequência de DNA/normas , Sensibilidade e EspecificidadeRESUMO
Structure_threader is a program to parallelize multiple runs of genetic clustering software that does not make use of multithreading technology (structure, fastStructure and MavericK) on multicore computers. Our approach was benchmarked across multiple systems and displayed great speed improvements relative to the single-threaded implementation, scaling very close to linearly with the number of physical cores used. Structure_threader was compared to previous software written for the same task-ParallelStructure and StrAuto and was proven to be the faster (up to 25% faster) wrapper under all tested scenarios. Furthermore, Structure_threader can perform several automatic and convenient operations, assisting the user in assessing the most biologically likely value of 'K' via implementations such as the "Evanno," or "Thermodynamic Integration" tests and automatically draw the "meanQ" plots (static or interactive) for each value of K (or even combined plots). Structure_threader is written in python 3 and licensed under the GPLv3. It can be downloaded free of charge at https://github.com/StuntsPT/Structure_threader.
Assuntos
Análise por Conglomerados , Biologia Computacional/métodos , Software , Técnicas de Genotipagem/métodosRESUMO
megasat is software that enables genotyping of microsatellite loci using next-generation sequencing data. Microsatellites are amplified in large multiplexes, and then sequenced in pooled amplicons. megasat reads sequence files and automatically scores microsatellite genotypes. It uses fuzzy matches to allow for sequencing errors and applies decision rules to account for amplification artefacts, including nontarget amplification products, replication slippage during PCR (amplification stutter) and differential amplification of alleles. An important feature of megasat is the generation of histograms of the length-frequency distributions of amplification products for each locus and each individual. These histograms, analogous to electropherograms traditionally used to score microsatellite genotypes, enable rapid evaluation and editing of automatically scored genotypes. megasat is written in Perl, runs on Windows, Mac OS X and Linux systems, and includes a simple graphical user interface. We demonstrate megasat using data from guppy, Poecilia reticulata. We genotype 1024 guppies at 43 microsatellites per run on an Illumina MiSeq sequencer. We evaluated the accuracy of automatically called genotypes using two methods, based on pedigree and repeat genotyping data, and obtained estimates of mean genotyping error rates of 0.021 and 0.012. In both estimates, three loci accounted for a disproportionate fraction of genotyping errors; conversely, 26 loci were scored with 0-1 detected error (error rate ≤0.007). Our results show that with appropriate selection of loci, automated genotyping of microsatellite loci can be achieved with very high throughput, low genotyping error and very low genotyping costs.
Assuntos
Biologia Computacional/métodos , Genótipo , Técnicas de Genotipagem/métodos , Repetições de Microssatélites , Técnicas de Amplificação de Ácido Nucleico/métodos , Análise de Sequência de DNA/métodos , Animais , Poecilia/classificação , Poecilia/genética , SoftwareRESUMO
The measurement of species diversity represents a powerful tool for assessing the impacts of human activities on marine ecosystems. Traditionally, the impact of fish farming on the coastal environment is evaluated by monitoring the dynamics of macrobenthic infaunal populations. However, taxonomic sorting and morphology-based identification of the macrobenthos demand highly trained specialists and are extremely time-consuming and costly, making it unsuitable for large-scale biomonitoring efforts involving numerous samples. Here, we propose to alleviate this laborious task by developing protist metabarcoding tools based on next-generation sequencing (NGS) of environmental DNA and RNA extracted from sediment samples. In this study, we analysed the response of benthic foraminiferal communities to the variation of environmental gradients associated with salmon farms in Scotland. We investigated the foraminiferal diversity based on ribosomal minibarcode sequences generated by the Illumina NGS technology. We compared the molecular data with morphospecies counts and with environmental gradients, including distance to cages and redox used as a proxy for sediment oxygenation. Our study revealed high variations between foraminiferal communities collected in the vicinity of fish farms and at distant locations. We found evidence for species richness decrease in impacted sites, especially visible in the RNA data. We also detected some candidate bioindicator foraminiferal species. Based on this proof-of-concept study, we conclude that NGS metabarcoding using foraminifera and other protists has potential to become a new tool for surveying the impact of aquaculture and other industrial activities in the marine environment.