RESUMO
Brownfields are unused sites that contain hazardous substances due to previous commercial or industrial use. The sites are inhospitable for many organisms, but some fungi and microbes can tolerate and thrive in the nutrient-depleted and contaminated soils. However, few studies have characterized the impacts of long-term contamination on soil microbiome composition and diversity at brownfields. This study focuses on an urban brownfield-a former rail yard in Los Angeles that is contaminated with heavy metals, volatile organic compounds, and petroleum-derived pollutants. We anticipate that heavy metals and organic pollutants will shape soil microbiome diversity and that several candidate fungi and bacteria will be tolerant to the contaminants. We sequence three gene markers (16S ribosomal RNA, 18S ribosomal RNA, and the fungal internal transcribed spacer (FITS)) in 55 soil samples collected at five depths to (1) profile the composition of the soil microbiome across depths; (2) determine the extent to which hazardous chemicals predict microbiome variation; and (3) identify microbial taxonomic groups that may metabolize these contaminants. Detected contaminants in the samples included heavy metals, petroleum hydrocarbons, polycyclic aromatic hydrocarbons, and volatile organic compounds. Bacterial, eukaryotic, and fungal communities all varied with depth and with concentrations of arsenic, chromium, cobalt, and lead. 18S rRNA microbiome richness and fungal richness were positively correlated with lead and cobalt levels, respectively. Furthermore, bacterial Paenibacillus and Iamia, eukaryotic Actinochloris, and fungal Alternaria were enriched in contaminated soils compared to uncontaminated soils and represent taxa of interest for future bioremediation research. Based on our results, we recommend incorporating DNA-based multi-marker microbial community profiling at multiple sites and depths in brownfield site assessment standard methods and restoration.
Assuntos
Poluentes Ambientais , Metais Pesados , Microbiota , Petróleo , Poluentes do Solo , Compostos Orgânicos Voláteis , Solo/química , Compostos Orgânicos Voláteis/metabolismo , Poluentes do Solo/análise , Metais Pesados/metabolismo , Bactérias , Cobalto/metabolismo , Microbiologia do Solo , RNA Ribossômico 16S/genética , RNA Ribossômico 16S/metabolismo , Biodegradação AmbientalRESUMO
Ecosystems globally are under threat from ongoing anthropogenic environmental change. Effective conservation management requires more thorough biodiversity surveys that can reveal system-level patterns and that can be applied rapidly across space and time. Using modern ecological models and community science, we integrate environmental DNA and Earth observations to produce a time snapshot of regional biodiversity patterns and provide multi-scalar community-level characterization. We collected 278 samples in spring 2017 from coastal, shrub, and lowland forest sites in California, a complex ecosystem and biodiversity hotspot. We recovered 16,118 taxonomic entries from eDNA analyses and compiled associated traditional observations and environmental data to assess how well they predicted alpha, beta, and zeta diversity. We found that local habitat classification was diagnostic of community composition and distinct communities and organisms in different kingdoms are predicted by different environmental variables. Nonetheless, gradient forest models of 915 families recovered by eDNA analysis and using BIOCLIM variables, Sentinel-2 satellite data, human impact, and topographical features as predictors, explained 35% of the variance in community turnover. Elevation, sand percentage, and photosynthetic activities (NDVI32) were the top predictors. In addition to this signal of environmental filtering, we found a positive relationship between environmentally predicted families and their numbers of biotic interactions, suggesting environmental change could have a disproportionate effect on community networks. Together, these analyses show that coupling eDNA with environmental predictors including remote sensing data has capacity to test proposed Essential Biodiversity Variables and create new landscape biodiversity baselines that span the tree of life.
Assuntos
DNA Ambiental , Ecossistema , Biodiversidade , California , Código de Barras de DNA Taxonômico , Monitoramento AmbientalRESUMO
The sequencing revolution requires accurate taxonomic classification of DNA sequences. Key to making accurate taxonomic assignments are curated, comprehensive reference barcode databases. However, the generation and curation of such databases has remained challenging given the large and continuously growing volumes of both DNA sequence data and novel reference barcode targets. Monitoring and research applications require a greater diversity of specialized gene regions and targeted taxa then are currently curated by professional staff. Thus there is a growing need for an easy to implement computational tool that can generate comprehensive metabarcoding reference libraries for any bespoke locus. We address this need by reimagining CRUX from the Anacapa Toolkit and present the rCRUX package in R which, like it's predecessor, relies on sequence homology and PCR primer compatibility instead of keyword-searches to avoid limitations of user-defined metadata. The typical workflow involves searching for plausible seed amplicons (get_seeds_local() or get_seeds_remote()) by simulating in silico PCR to acquire a set of sequences analogous to PCR products containing a user-defined set of primer sequences. Next, these seeds are used to iteratively blast search seed sequences against a local copy of the National Center for Biotechnology Information (NCBI) formatted nt database using a taxonomic-rank based stratified random sampling approach ( blast_seeds() ). This results in a comprehensive set of sequence matches. This database is dereplicated and cleaned (derep_and_clean_db()) by identifying identical reference sequences and collapsing the taxonomic path to the lowest taxonomic agreement across all matching reads. This results in a curated, comprehensive database of primer-specific reference barcode sequences from NCBI. Databases can then be compared (compare_db()) to determine read and taxonomic overlap. We demonstrate that rCRUX provides more comprehensive reference databases for the MiFish Universal Teleost 12S, Taberlet trnl, fungal ITS, and Leray CO1 loci than CRABS, MetaCurator, RESCRIPt, and ecoPCR reference databases. We then further demonstrate the utility of rCRUX by generating 24 reference databases for 20 metabarcoding loci, many of which lack dedicated reference database curation efforts. The rCRUX package provides a simple to use tool for the generation of curated, comprehensive reference databases for user-defined loci, facilitating accurate and effective taxonomic classification of metabarcoding and DNA sequence efforts broadly.
RESUMO
Key to making accurate taxonomic assignments are curated, comprehensive reference barcode databases. However, the generation and curation of such databases has remained challenging given the large and continuously growing volumes of DNA sequence data and novel reference barcode targets. Monitoring and research applications require a greater diversity of specialized gene regions and targeted taxa to meet taxonomic classification goals then are currently curated by professional staff. Thus, there is a growing need for an easy to implement tool that can generate comprehensive metabarcoding reference libraries for any bespoke locus. We address this need by reimagining CRUX from the Anacapa Toolkit and present the rCRUX package in R. The typical workflow involves searching for plausible seed amplicons (get_seeds_local() or get_seeds_remote()) by simulating in silico PCR to acquire seed sequences containing a user-defined primer set. Next these seeds are used to iteratively blast search seed sequences against a local NCBI formatted database using a taxonomic rank based stratified random sampling approach (blast_seeds()) that results in a comprehensive set of sequence matches. This database is dereplicated and cleaned (derep_and_clean_db()) by identifying identical reference sequences and collapsing the taxonomic path to the lowest taxonomic agreement across all matching reads. This results in a curated, comprehensive database of primer specific reference barcode sequences from NCBI. We demonstrate that rCRUX provides more comprehensive reference databases for the MiFish Universal Teleost 12S, Taberlet trnl, and fungal ITS locus than CRABS, METACURATOR, RESCRIPt, and ECOPCR reference databases. We then further demonstrate the utility of rCRUX by generating 16 reference databases for metabarcoding loci that lack dedicated reference database curation efforts. The rCRUX package provides a simple to use tool for the generation of curated, comprehensive reference databases for user-defined loci, facilitating accurate and effective taxonomic classification of metabarcoding and DNA sequence efforts broadly.
RESUMO
Environmental DNA (eDNA) metabarcoding has gained growing attention as a strategy for monitoring biodiversity in ecology. However, taxa identifications produced through metabarcoding require sophisticated processing of high-throughput sequencing data from taxonomically informative DNA barcodes. Various sets of universal and taxon-specific primers have been developed, extending the usability of metabarcoding across archaea, bacteria and eukaryotes. Accordingly, a multitude of metabarcoding data analysis tools and pipelines have also been developed. Often, several developed workflows are designed to process the same amplicon sequencing data, making it somewhat puzzling to choose one among the plethora of existing pipelines. However, each pipeline has its own specific philosophy, strengths and limitations, which should be considered depending on the aims of any specific study, as well as the bioinformatics expertise of the user. In this review, we outline the input data requirements, supported operating systems and particular attributes of thirty-two amplicon processing pipelines with the goal of helping users to select a pipeline for their metabarcoding projects.
Assuntos
Dermatite Atópica/imunologia , Dermatite Atópica/microbiologia , Microbiota/imunologia , Pele/imunologia , Pele/microbiologia , Adolescente , Adulto , Fatores Etários , Estudos de Casos e Controles , Criança , Pré-Escolar , Dermatite Atópica/patologia , Feminino , Humanos , Masculino , Microbiota/genética , Pessoa de Meia-Idade , Pele/patologia , Adulto JovemRESUMO
Environmental DNA (eDNA) metabarcoding is a powerful tool that can enhance marine ecosystem/biodiversity monitoring programs. Here we outline five important steps managers and researchers should consider when developing eDNA monitoring program: (1) select genes and primers to target taxa; (2) assemble or develop comprehensive barcode reference databases; (3) apply rigorous site occupancy based decontamination pipelines; (4) conduct pilot studies to define spatial and temporal variance of eDNA; and (5) archive samples, extracts, and raw sequence data. We demonstrate the importance of each of these considerations using a case study of eDNA metabarcoding in the Ports of Los Angeles and Long Beach. eDNA metabarcoding approaches detected 94.1% (16/17) of species observed in paired trawl surveys while identifying an additional 55 native fishes, providing more comprehensive biodiversity inventories. Rigorous benchmarking of eDNA metabarcoding results improved ecological interpretation and confidence in species detections while providing archived genetic resources for future analyses. Well designed and validated eDNA metabarcoding approaches are ideally suited for biomonitoring applications that rely on the detection of species, including mapping invasive species fronts and endangered species habitats as well as tracking range shifts in response to climate change. Incorporating these considerations will enhance the utility and efficacy of eDNA metabarcoding for routine biomonitoring applications.
Assuntos
DNA Ambiental , Ecossistema , DNA Ambiental/genética , Código de Barras de DNA Taxonômico/métodos , Monitoramento Ambiental/métodos , BiodiversidadeRESUMO
DNA metabarcoding is an important tool for molecular ecology. However, its effectiveness hinges on the quality of reference sequence databases and classification parameters employed. Here we evaluate the performance of MiFish 12S taxonomic assignments using a case study of California Current Large Marine Ecosystem fishes to determine best practices for metabarcoding. Specifically, we use a taxonomy cross-validation by identity framework to compare classification performance between a global database comprised of all available sequences and a curated database that only includes sequences of fishes from the California Current Large Marine Ecosystem. We demonstrate that the regional database provides higher assignment accuracy than the comprehensive global database. We also document a tradeoff between accuracy and misclassification across a range of taxonomic cutoff scores, highlighting the importance of parameter selection for taxonomic classification. Furthermore, we compared assignment accuracy with and without the inclusion of additionally generated reference sequences. To this end, we sequenced tissue from 597 species using the MiFish 12S primers, adding 252 species to GenBank's existing 550 California Current Large Marine Ecosystem fish sequences. We then compared species and reads identified from seawater environmental DNA samples using global databases with and without our generated references, and the regional database. The addition of new references allowed for the identification of 16 additional native taxa representing 17.0% of total reads from eDNA samples, including species with vast ecological and economic value. Together these results demonstrate the importance of comprehensive and curated reference databases for effective metabarcoding and the need for locus-specific validation efforts.
Assuntos
DNA Ambiental , Ecossistema , Animais , Biodiversidade , Código de Barras de DNA Taxonômico , Peixes/genética , Água do MarRESUMO
BACKGROUND: Avian influenza virus (AIV) is an important public health issue because pandemic influenza viruses in people have contained genes from viruses that infect birds. The H5 and H7 AIV subtypes have periodically mutated from low pathogenicity to high pathogenicity form. Analysis of the geographic distribution of AIV can identify areas where reassortment events might occur and how high pathogenicity influenza might travel if it enters wild bird populations in the US. Modelling the number of AIV cases is important because the rate of co-infection with multiple AIV subtypes increases with the number of cases and co-infection is the source of reassortment events that give rise to new strains of influenza, which occurred before the 1968 pandemic. Aquatic birds in the orders Anseriformes and Charadriiformes have been recognized as reservoirs of AIV since the 1970s. However, little is known about influenza prevalence in terrestrial birds in the order Passeriformes. Since passerines share the same habitat as poultry, they may be more effective transmitters of the disease to humans than aquatic birds. We analyze 152 passerine species including the American Robin (Turdus migratorius) and Swainson's Thrush (Catharus ustulatus). METHODS: We formulate a regression model to predict AIV cases throughout the US at the county scale as a function of 12 environmental variables, sampling effort, and proximity to other counties with influenza outbreaks. Our analysis did not distinguish between types of influenza, including low or highly pathogenic forms. RESULTS: Analysis of 13,046 cloacal samples collected from 225 bird species in 41 US states between 2005 and 2008 indicates that the average prevalence of influenza in passerines is greater than the prevalence in eight other avian orders. Our regression model identifies the Great Plains and the Pacific Northwest as high-risk areas for AIV. Highly significant predictors of AIV include the amount of harvested cropland and the first day of the year when a county is snow free. CONCLUSIONS: Although the prevalence of influenza in waterfowl has long been appreciated, we show that 22 species of song birds and perching birds (order Passeriformes) are influenza reservoirs in the contiguous US.
Assuntos
Vírus da Influenza A/classificação , Vírus da Influenza A/isolamento & purificação , Influenza Aviária/epidemiologia , Influenza Aviária/virologia , Passeriformes/virologia , Medição de Risco , Animais , Cloaca/virologia , Geografia , Modelos Estatísticos , Prevalência , Estados UnidosRESUMO
The microbiome plays an important role in human physiology. The composition of the human microbiome has been described at the phylum, class, genus, and species levels, however, it is largely unknown at the strain level. The importance of strain-level differences in microbial communities has been increasingly recognized in understanding disease associations. Current methods for identifying strain populations often require deep metagenomic sequencing and a comprehensive set of reference genomes. In this study, we developed a method, metagenomic multi-locus sequence typing (MG-MLST), to determine strain-level composition in a microbial community by combining high-throughput sequencing with multi-locus sequence typing (MLST). We used a commensal bacterium, Propionibacterium acnes, as an example to test the ability of MG-MLST in identifying the strain composition. Using simulated communities, MG-MLST accurately predicted the strain populations in all samples. We further validated the method using MLST gene amplicon libraries and metagenomic shotgun sequencing data of clinical skin samples. MG-MLST yielded consistent results of the strain composition to those obtained from nearly full-length 16S rRNA clone libraries and metagenomic shotgun sequencing analysis. When comparing strain-level differences between acne and healthy skin microbiomes, we demonstrated that strains of RT2/6 were highly associated with healthy skin, consistent with previous findings. In summary, MG-MLST provides a quantitative analysis of the strain populations in the microbiome with diversity and richness. It can be applied to microbiome studies to reveal strain-level differences between groups, which are critical in many microorganism-related diseases.
RESUMO
All organisms release their DNA into the environment through processes such as excretion and the senescence of tissues and limbs. This DNA, often referred to as environmental DNA (eDNA) or sedimentary ancient DNA (sedaDNA), can be recovered from both present-day and ancient soils, fecal samples, bodies of water and lake cores, and even air. While eDNA is a potentially useful record of past and present biodiversity, several challenges complicate data generation and interpretation of results. Most importantly, eDNA samples tend to be highly taxonomically mixed, and the target organism or group of organisms may be present at very low abundance within this mixture. To overcome this challenge, enrichment approaches are often used to target specific taxa of interest. Here, we describe a protocol to amplify metabarcodes or short, variable loci that identify lineages within broad taxonomic groups (e.g., plants, mammals), using the polymerase chain reaction (PCR) with established generic "barcode" primers. We also provide a catalog of animal and plant barcode primers that, because they target relatively short fragments of DNA, are potentially suitable for use with degraded DNA.
Assuntos
Código de Barras de DNA Taxonômico/métodos , DNA Antigo/análise , Monitoramento Ambiental/métodos , Sedimentos Geológicos/análise , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Técnicas de Amplificação de Ácido Nucleico/métodos , Reação em Cadeia da Polimerase/métodos , Animais , HumanosRESUMO
Environmental DNA (eDNA) metabarcoding is becoming a core tool in ecology and conservation biology, and is being used in a growing number of education, biodiversity monitoring, and public outreach programs in which professional research scientists engage community partners in primary research. Results from eDNA analyses can engage and educate natural resource managers, students, community scientists, and naturalists, but without significant training in bioinformatics, it can be difficult for this diverse audience to interact with eDNA results. Here we present the R package ranacapa, at the core of which is a Shiny web app that helps perform exploratory biodiversity analyses and visualizations of eDNA results. The app requires a taxonomy-by-sample matrix and a simple metadata file with descriptive information about each sample. The app enables users to explore the data with interactive figures and presents results from simple community ecology analyses. We demonstrate the value of ranacapa to two groups of community partners engaging with eDNA metabarcoding results.
Assuntos
DNA/análise , Meio Ambiente , Internet , Software , Estatística como Assunto , Biodiversidade , Currículo , Código de Barras de DNA Taxonômico , Microbiologia/educação , Análise de Componente PrincipalRESUMO
Propionibacterium acnes is a major skin commensal and is associated with acne vulgaris, the most common skin disease. Here we report the draft genome sequences of two P. acnes strains, the type strain ATCC6919 and an antibiotic-resistant strain, HL411PA1.
RESUMO
Efficient influenza A viral surveillance of wild and domestic birds requires rapid viral detection and quantitation of high and low quality samples. Current influenza A qPCR-based detection protocols specified by CDC, OIE and USDA utilize fluorogenic hydrolysis probe based real-time reverse transcription PCR (RRT-PCR) assays for detection and quantitation. The sequence diversity of this virus, even in the conserved matrix gene M1, makes primer and probe designs challenging. In this report it was determined that false RRT-PCR positives are possible with this method. This is particularly problematic when surveying non-cultured or inactivated avian tracheal and cloacal mucosal samples with low concentrations of virus and large proportions of background nucleic acids. This report presents a modification of a one-step RRT-PCR detection method for influenza A using SYBR green intercalating dye-based target amplification detection. High Resolution Melting (HRM), amplicon size quantitation and sequence verification is used to screen for non-target amplification (false positives). The resulting protocol has the sensitivity of hydrolysis probe methods, allows for flexible primer design and verification of target amplification, and provides high confidence in positive results. A multiplex subtype detection method using the RRT-PCR HRM assay is also demonstrated. Overall, this method is both time and cost effective while providing an extra measure of confidence in surveillance results through the implementation of target verification.