RESUMEN
BACKGROUND: The application of reduced metagenomic sequencing approaches holds promise as a middle ground between targeted amplicon sequencing and whole metagenome sequencing approaches but has not been widely adopted as a technique. A major barrier to adoption is the lack of read simulation software built to handle characteristic features of these novel approaches. Reduced metagenomic sequencing (RMS) produces unique patterns of fragmentation per genome that are sensitive to restriction enzyme choice, and the non-uniform size selection of these fragments may introduce novel challenges to taxonomic assignment as well as relative abundance estimates. RESULTS: Through the development and application of simulation software, readsynth, we compare simulated metagenomic sequencing libraries with existing RMS data to assess the influence of multiple library preparation and sequencing steps on downstream analytical results. Based on read depth per position, readsynth achieved 0.79 Pearson's correlation and 0.94 Spearman's correlation to these benchmarks. Application of a novel estimation approach, fixed length taxonomic ratios, improved quantification accuracy of simulated human gut microbial communities when compared to estimates of mean or median coverage. CONCLUSIONS: We investigate the possible strengths and weaknesses of applying the RMS technique to profiling microbial communities via simulations with readsynth. The choice of restriction enzymes and size selection steps in library prep are non-trivial decisions that bias downstream profiling and quantification. The simulations investigated in this study illustrate the possible limits of preparing metagenomic libraries with a reduced representation sequencing approach, but also allow for the development of strategies for producing and handling the sequence data produced by this promising application.
Asunto(s)
Metagenoma , Metagenómica , Programas Informáticos , Metagenoma/genética , Metagenómica/métodos , Humanos , Análisis de Secuencia de ADN/métodos , Microbioma Gastrointestinal/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodosRESUMEN
BACKGROUND: Muscadine grape (Vitis rotundifolia) is resistant to many of the pathogens that negatively impact the production of common grape (V. vinifera), including the bacterial pathogen Xylella fastidiosa subsp. fastidiosa (Xfsf), which causes Pierce's Disease (PD). Previous studies in common grape have indicated Xfsf delays host immune response with a complex O-chain antigen produced by the wzy gene. Muscadine cultivars range from tolerant to completely resistant to Xfsf, but the mechanism is unknown. RESULTS: We assembled and annotated a new, long-read genome assembly for 'Carlos', a cultivar of muscadine that exhibits tolerance, to build upon the existing genetic resources available for muscadine. We used these resources to construct an initial pan-genome for three cultivars of muscadine and one cultivar of common grape. This pan-genome contains a total of 34,970 synteny-constrained entries containing genes of similar structure. Comparison of resistance gene content between the 'Carlos' and common grape genomes indicates an expansion of resistance (R) genes in 'Carlos.' We further identified genes involved in Xfsf response by transcriptome sequencing 'Carlos' plants inoculated with Xfsf. We observed 234 differentially expressed genes with functions related to lipid catabolism, oxidation-reduction signaling, and abscisic acid (ABA) signaling as well as seven R genes. Leveraging public data from previous experiments of common grape inoculated with Xfsf, we determined that most differentially expressed genes in the muscadine response were not found in common grape, and three of the R genes identified as differentially expressed in muscadine do not have an ortholog in the common grape genome. CONCLUSIONS: Our results support the utility of a pan-genome approach to identify candidate genes for traits of interest, particularly disease resistance to Xfsf, within and between muscadine and common grape.
Asunto(s)
Vitis , Xylella , Vitis/microbiología , Resistencia a la Enfermedad/genética , Xylella/genética , Cromosomas , Enfermedades de las Plantas/genética , Enfermedades de las Plantas/microbiologíaRESUMEN
Online, open access databases for biological knowledge serve as central repositories for research communities to store, find and analyze integrated, multi-disciplinary datasets. With increasing volumes, complexity and the need to integrate genomic, transcriptomic, metabolomic, proteomic, phenomic and environmental data, community databases face tremendous challenges in ongoing maintenance, expansion and upgrades. A common infrastructure framework using community standards shared by many databases can reduce development burden, provide interoperability, ensure use of common standards and support long-term sustainability. Tripal is a mature, open source platform built to meet this need. With ongoing improvement since its first release in 2009, Tripal provides full functionality for searching, browsing, loading and curating numerous types of data and is a primary technology powering at least 31 publicly available databases spanning plants, animals and human data, primarily storing genomics, genetics and breeding data. Tripal software development is managed by a shared, inclusive governance structure including both project management and advisory teams. Here, we report on the most important and innovative aspects of Tripal after 11 years development, including integration of diverse types of biological data, successful collaborative projects across member databases, and support for implementing FAIR principles.
Asunto(s)
Cruzamiento , Biología Computacional/métodos , Bases de Datos Genéticas , Genómica/métodos , Plantas/genética , Programas Informáticos , Productos Agrícolas/genética , Variación Genética , Filogenia , Plantas/metabolismo , Proteómica , Navegador WebRESUMEN
The Genome Database for Rosaceae (GDR, https://www.rosaceae.org) is an integrated web-based community database resource providing access to publicly available genomics, genetics and breeding data and data-mining tools to facilitate basic, translational and applied research in Rosaceae. The volume of data in GDR has increased greatly over the last 5 years. The GDR now houses multiple versions of whole genome assembly and annotation data from 14 species, made available by recent advances in sequencing technology. Annotated and searchable reference transcriptomes, RefTrans, combining peer-reviewed published RNA-Seq as well as EST datasets, are newly available for major crop species. Significantly more quantitative trait loci, genetic maps and markers are available in MapViewer, a new visualization tool that better integrates with other pages in GDR. Pathways can be accessed through the new GDR Cyc Pathways databases, and synteny among the newest genome assemblies from eight species can be viewed through the new synteny browser, SynView. Collated single-nucleotide polymorphism diversity data and phenotypic data from publicly available breeding datasets are integrated with other relevant data. Also, the new Breeding Information Management System allows breeders to upload, manage and analyze their private breeding data within the secure GDR server with an option to release data publicly.
Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Genoma de Planta/genética , Genómica/métodos , Rosaceae/genética , Biología Computacional/estadística & datos numéricos , Perfilación de la Expresión Génica/métodos , Genes de Plantas/genética , Almacenamiento y Recuperación de la Información/métodos , Internet , Fitomejoramiento/métodos , Sitios de Carácter Cuantitativo/genética , Rosaceae/clasificación , Especificidad de la Especie , Sintenía , Factores de Tiempo , Interfaz Usuario-ComputadorRESUMEN
BACKGROUND: Climate plays an essential role in forest health, and climate change may increase forest productivity losses due to abiotic and biotic stress. Increased temperature leads to the increased formation of ozone (O3). Ozone is formed by the interaction of sunlight, molecular oxygen and by the reactions of chemicals commonly found in industrial and automobile emissions such as nitrogen oxides and volatile organic compounds. Although it is well known that productivity of Northern red oak (Quercus rubra) (NRO), an ecologically and economically important species in the forests of eastern North America, is reduced by exposure to O3, limited information is available on its responses to exogenous stimuli at the level of gene expression. RESULTS: RNA sequencing yielded more than 323 million high-quality raw sequence reads. De novo assembly generated 52,662 unigenes, of which more than 42,000 sequences could be annotated through homology-based searches. A total of 4140 differential expressed genes (DEGs) were detected in response to O3 stress, as compared to their respective controls. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses of the O3-response DEGs revealed perturbation of several biological pathways including energy, lipid, amino acid, carbohydrate and terpenoid metabolism as well as plant-pathogen interaction. CONCLUSION: This study provides the first reference transcriptome for NRO and initial insights into the genomic responses of NRO to O3. Gene expression profiling reveals altered primary and secondary metabolism of NRO seedlings, including known defense responses such as terpenoid biosynthesis.
Asunto(s)
Perfilación de la Expresión Génica , Ozono/metabolismo , Quercus/genética , Quercus/metabolismo , Estrés Fisiológico , Transcriptoma , Vías Biosintéticas , Biología Computacional/métodos , Regulación de la Expresión Génica de las Plantas , Ontología de Genes , Interacciones Huésped-Patógeno , Anotación de Secuencia Molecular , Transducción de SeñalRESUMEN
BACKGROUND: Plants have evolved intimate interactions with soil microbes for a range of beneficial functions including nutrient acquisition, pathogen resistance and stress tolerance. Further understanding of this system is a promising way to advance sustainable agriculture by exploiting the versatile benefits offered by the plant microbiome. The rhizosphere is the interface between plant and soil, and functions as the first step of plant defense and root microbiome recruitment. It features a specialized microbial community, intensive microbe-plant and microbe-microbe interactions, and complex signal communication. To decipher the rhizosphere microbiome assembly of soybean (Glycine max), we comprehensively characterized the soybean rhizosphere microbial community using 16S rRNA gene sequencing and evaluated the structuring influence from both host genotype and soil source. RESULTS: Comparison of the soybean rhizosphere to bulk soil revealed significantly different microbiome composition, microbe-microbe interactions and metabolic capacity. Soil type and soybean genotype cooperatively modulated microbiome assembly with soil type predominantly shaping rhizosphere microbiome assembly while host genotype slightly tuned this recruitment process. The undomesticated progenitor species, Glycine soja, had higher rhizosphere diversity in both soil types tested in comparison to the domesticated soybean genotypes. Rhizobium, Novosphingobium, Phenylobacterium, Streptomyces, Nocardioides, etc. were robustly enriched in soybean rhizosphere irrespective of the soil tested. Co-occurrence network analysis revealed dominant soil type effects and genotype specific preferences for key microbe-microbe interactions. Functional prediction results demonstrated converged metabolic capacity in the soybean rhizosphere between soil types and among genotypes, with pathways related to xenobiotic degradation, plant-microbe interactions and nutrient transport being greatly enriched in the rhizosphere. CONCLUSION: This comprehensive comparison of the soybean microbiome between soil types and genotypes expands our understanding of rhizosphere microbe assembly in general and provides foundational information for soybean as a legume crop for this assembly process. The cooperative modulating role of the soil type and host genotype emphasizes the importance of integrated consideration of soil condition and plant genetic variability for future development and application of synthetic microbiomes. Additionally, the detection of the tuning role by soybean genotype in rhizosphere microbiome assembly provides a promising way for future breeding programs to integrate host traits participating in beneficial microbiota assembly.
Asunto(s)
Bacterias/aislamiento & purificación , Glycine max/genética , Microbiología del Suelo , Bacterias/clasificación , Bacterias/genética , Genotipo , Microbiota , Filogenia , Raíces de Plantas/genética , Raíces de Plantas/crecimiento & desarrollo , Raíces de Plantas/microbiología , Rizosfera , Suelo/química , Glycine max/crecimiento & desarrollo , Glycine max/microbiologíaRESUMEN
Chloroplasts retain part of their ancestral genomes and the machinery for expression of those genomes. The nucleus-encoded chloroplast RNA helicase INCREASED SIZE EXCLUSION LIMIT2 (ISE2) is required for chloroplast ribosomal RNA processing and chloro-ribosome assembly. To further elucidate ISE2's role in chloroplast translation, two independent approaches were used to identify its potential protein partners. Both a yeast two-hybrid screen and a pull-down assay identified plastid ribosomal protein L15, uL15c (formerly RPL15), as interacting with ISE2. The interaction was confirmed in vivo by co-immunoprecipitation. Interestingly, we found that rpl15 null mutants do not complete embryogenesis, indicating that RPL15 is an essential gene for autotrophic growth of Arabidopsis thaliana. Arabidopsis and Nicotiana benthamiana plants with reduced expression of RPL15 developed chlorotic leaves, had reduced photosynthetic capacity and exhibited defective chloroplast development. Processing of chloroplast ribosomal RNAs and assembly of ribosomal subunits were disrupted by reduced expression of RPL15. Chloroplast translation was also decreased, reducing accumulation of chloroplast-encoded proteins, in such plants compared to wild-type plants. Notably, knockdown of RPL15 expression increased intercellular trafficking, a phenotype also observed in plants with reduced ISE2 expression. This finding provides further evidence for chloroplast function in modulating intercellular trafficking via plasmodesmata.
Asunto(s)
Proteínas de Arabidopsis/metabolismo , Arabidopsis/genética , Proteínas de Cloroplastos/metabolismo , ARN Helicasas/metabolismo , Proteínas Ribosómicas/metabolismo , Arabidopsis/fisiología , Arabidopsis/ultraestructura , Proteínas de Arabidopsis/genética , Proteínas de Cloroplastos/genética , Cloroplastos/metabolismo , Cloroplastos/ultraestructura , Genes Reporteros , Fotosíntesis , Plasmodesmos/metabolismo , Transporte de Proteínas , ARN Helicasas/genética , ARN del Cloroplasto/genética , ARN Ribosómico/genética , Proteínas Ribosómicas/genética , Nicotiana/genética , Nicotiana/fisiología , Nicotiana/ultraestructuraRESUMEN
Powdery mildews (PMs) are important plant pathogens causing widespread damage. Here, we report the first draft genome of Erysiphe pulchra, the causative agent of PM of flowering dogwood, Cornus florida. The assembled genome was 63.5 Mbp and resulted in formation of 19,442 contigs (N50 = 11,686 bp) that contained an estimated 6,860 genes with a genome coverage of 62×. We found 102 candidate secreted effector proteins (CSEPs) in E. pulchra similar to E. necator genes that are potentially involved in disease development. This draft genome is an initial step for understanding the evolutionary history of the PMs and will also provide insight into evolutionary strategies that led to the wide host expansion and environmental adaptations so effectively employed by the PM lineages.
Asunto(s)
Ascomicetos , Genoma Fúngico , Ascomicetos/genética , Genómica/tendencias , Enfermedades de las Plantas/microbiologíaRESUMEN
INCREASED SIZE EXCLUSION LIMIT2 (ISE2) is a chloroplast-localized RNA helicase that is indispensable for proper plant development. Chloroplasts in leaves with reduced ISE2 expression have previously been shown to exhibit reduced thylakoid contents and increased stromal volume, indicative of defective development. It has recently been reported that ISE2 is required for the splicing of group II introns from chloroplast transcripts. The current study extends these findings, and presents evidence for ISE2's role in multiple aspects of chloroplast RNA processing beyond group II intron splicing. Loss of ISE2 from Arabidopsis thaliana leaves resulted in defects in C-to-U RNA editing, altered accumulation of chloroplast transcripts and chloroplast-encoded proteins, and defective processing of chloroplast ribosomal RNAs. Potential ISE2 substrates were identified by RNA immunoprecipitation followed by next-generation sequencing (RIP-seq), and the diversity of RNA species identified supports ISE2's involvement in multiple aspects of chloroplast RNA metabolism. Comprehensive phylogenetic analyses revealed that ISE2 is a non-canonical Ski2-like RNA helicase that represents a separate sub-clade unique to green photosynthetic organisms, consistent with its function as an essential protein. Thus ISE2's evolutionary conservation may be explained by its numerous roles in regulating chloroplast gene expression.
Asunto(s)
Arabidopsis/enzimología , Arabidopsis/metabolismo , ARN Helicasas/metabolismo , ARN del Cloroplasto/metabolismo , Arabidopsis/genética , Cloroplastos/genética , Cloroplastos/metabolismo , Regulación de la Expresión Génica de las Plantas , Intrones/genética , Plasmodesmos/metabolismo , Edición de ARN/genética , ARN Helicasas/genéticaRESUMEN
Analysis of transcriptome sequence data from eggs and second-stage juveniles (J2s) of sugar beet cyst nematode (SBCN, Heterodera schachtii) identified the full-length genome of a positive-sense single-stranded RNA virus, provisionally named sugar beet cyst nematode virus 1 (SBCNV1). The SBCNV1 sequence was detected in both eggs and J2s, indicating its possible vertical transmission. The 9503-nucleotide genome sequence contains a single long open reading frame, which was predicted to encode a polyprotein with conserved domains for picornaviral structural proteins proximal to its amino terminus and RNA helicase, cysteine proteinase and RNA-dependent RNA polymerase (RdRp) conserved domains proximal to its carboxyl terminus, hallmarks of viruses belonging to the order Picornavirales. Phylogenetic analysis of the predicted SBCNV1 RdRp amino acid sequence indicated that the SBCNV1 sequence is most closely related to members of the family Secoviridae, which includes genera of nematode-transmitted plant-infecting viruses. SBCNV1 represents the first fully sequenced viral genome from SBCN.
Asunto(s)
Beta vulgaris/parasitología , Picornaviridae/clasificación , Picornaviridae/aislamiento & purificación , Transcriptoma , Tylenchoidea/virología , Animales , Genoma Viral , Anotación de Secuencia Molecular , Sistemas de Lectura Abierta , Filogenia , Picornaviridae/genética , ARN Polimerasa Dependiente del ARN/genética , Análisis de Secuencia de ADN , Análisis de Secuencia de ARN , Homología de Secuencia de Aminoácido , Tylenchoidea/genética , Tylenchoidea/crecimiento & desarrollo , Proteínas Virales/genéticaRESUMEN
Mastitis is a detrimental disease in the dairy industry that decreases milk quality and costs upwards of $2 billion annually. Often, mastitis results from bacteria entering the gland through the teat opening. Streptococcus uberis is responsible for a high percentage of subclinical and clinical mastitis. Following an intramammary experimental challenge with S. uberis on Holstein cows (n = 40), milk samples were collected and somatic cell counts (SCC) were determined by the Dairy Herd Improvement Association Laboratory. Traditional genome-wide association studies (GWAS) have utilized test day SCC or SCC lactation averages to identify loci of interest. Our approach utilizes SCC collected following a S. uberis experimental challenge to generate three novel phenotypes: (1) area under the curve (AUC) of SCC for 0-7 days and (2) 0-28 days post-challenge; and (3) when SCC returned to below 200,000 cells/mL post-challenge (< 21 days, 21-28 days, or > 28 days). Polymorphisms were identified using Illumina's BovineSNP50 v2 DNA BeadChip. Associations were tested using Plink software and identified 16 significant (p < 1.0 × 10-4) single-nucleotide polymorphisms (SNPs) across the phenotypes. Most significant SNPs were in genes linked to cell signaling, migration, and apoptosis. Several have been recognized in relation to infectious processes (ATF7, SGK1, and PACRG), but others less so (TRIO, GLRA1, CELSR2, TIAM2, CPE). Further investigation of these genes and their roles in inflammation (e.g., SCC) can provide potential targets that influence resolution of mammary gland infection. Likewise, further investigation of the identified SNP with mastitis and other disease phenotypes can provide greater insight to the potential of these SNP as genetic markers.
Asunto(s)
Leucocitos/fisiología , Mastitis Bovina/genética , Mastitis Bovina/microbiología , Polimorfismo de Nucleótido Simple , Infecciones Estreptocócicas/veterinaria , Animales , Bovinos , Femenino , Estudio de Asociación del Genoma Completo , Leche/citología , Fenotipo , Infecciones Estreptocócicas/genética , Infecciones Estreptocócicas/microbiología , Streptococcus/patogenicidadRESUMEN
BACKGROUND: Interspecific hybrid bermudagrass [Cynodon dactylon (L.) Pers. x C. transvaalensis Burtt-Davy] is one of the most widely used grasses on golf courses, with cultivars derived from 'Tifgreen' or 'Tifdwarf' particularly used for putting greens. Many bermudagrass cultivars established for putting greens can be genetically unstable and lead to the occurrence of undesirable off-type grasses that vary in phenotype. The objective of this research was to genetically and phenotypically differentiate off-type grasses and hybrid cultivars. Beginning in 2013, off-type and desirable hybrid bermudagrass samples were collected from golf course putting greens in the southeastern United States and genetically and phenotypically characterized using genotyping-by-sequencing and morphology. RESULTS: Genotyping-by-sequencing determined that 11% (5) of off-type and desirable samples from putting greens were genetically divergent from standard cultivars such as Champion, MiniVerde, Tifdwarf, TifEagle, and Tifgreen. In addition, genotyping-by-sequencing was unable to genetically distinguish all standard cultivars from one another due to their similar origin and clonal propagation; however, over 90,000 potentially informative nucleotide variants were identified among the triploid hybrid cultivars. CONCLUSIONS: Although few genetic differences were found in this research, samples harvested from golf course putting greens had variable morphology and were clustered into three distinct phenotypic groups. The majority of off-type grasses in hybrid bermudagrass putting greens were genetically similar with variable morphological traits. Off-type grasses within golf course putting greens have the potential to compromise putting surface functionality and aesthetics.
Asunto(s)
Cynodon/genética , Hibridación Genética , ADN de Plantas/genética , Variación Genética , Genotipo , Golf , Fenotipo , Análisis de Secuencia de ADNRESUMEN
BACKGROUND: Blueberries are one of the few horticultural crops adapted to grow in acidic soils. Neutral to basic soil pH is detrimental to all commonly cultivated blueberry species, including Vaccinium corymbosum (VC). In contrast, the wild species V. arboreum (VA) is able to tolerate a wider range of soil pH. To assess the molecular mechanisms involved in near neutral pH stress response, plants from pH-sensitive VC (tetraploid) and pH-tolerant VA (diploid) were grown at near neutral pH 6.5 and at the preferred pH of 4.5. RESULTS: Transcriptome sequencing of root RNA was performed for 4 biological replications per species x pH level interaction, for a total of 16 samples. Reads were mapped to the reference genome from diploid V. corymbosum, transforming ~55% of the reads to gene counts. A quasi-likelihood F test identified differential expression due to pH stress in 337 and 4867 genes in VA and VC, respectively. Both species shared regulation of genes involved in nutrient homeostasis and cell wall metabolism. VA and VC exhibited differential regulation of signaling pathways related to abiotic/biotic stress, cellulose and lignin biosynthesis, and nutrient uptake. CONCLUSIONS: The specific responses in VA likely facilitate tolerance to higher soil pH. In contrast, response in VC, despite affecting a greater number of genes, is not effective overcoming the stress induced by pH. Further inspection of those genes with differential expression that are specific in VA may provide insight on the mechanisms towards tolerance.
Asunto(s)
Arándanos Azules (Planta)/genética , Arándanos Azules (Planta)/fisiología , Perfilación de la Expresión Génica , Raíces de Plantas/genética , ARN de Planta/genética , Análisis de Secuencia de ARN , Estrés Fisiológico/genética , Adaptación Fisiológica/genética , Anotación de Secuencia Molecular , Fenotipo , Suelo/químicaRESUMEN
BACKGROUND: Restriction site associated DNA sequencing (RADseq) has the potential to be a broadly applicable, low-cost approach for high-quality genetic linkage mapping in forest trees lacking a reference genome. The statistical inference of linear order must be as accurate as possible for the correct ordering of sequence scaffolds and contigs to chromosomal locations. Accurate maps also facilitate the discovery of chromosome segments containing allelic variants conferring resistance to the biotic and abiotic stresses that threaten forest trees worldwide. We used ddRADseq for genetic mapping in the tree Quercus rubra, with an approach optimized to produce a high-quality map. Our study design also enabled us to model the results we would have obtained with less depth of coverage. RESULTS: Our sequencing design produced a high sequencing depth in the parents (248×) and a moderate sequencing depth (15×) in the progeny. The digital normalization method of generating a de novo reference and the SAMtools SNP variant caller yielded the most SNP calls (78,725). The major drivers of map inflation were multiple SNPs located within the same sequence (77% of SNPs called). The highest quality map was generated with a low level of missing data (5%) and a genome-wide threshold of 0.025 for deviation from Mendelian expectation. The final map included 849 SNP markers (1.8% of the 78,725 SNPs called). Downsampling the individual FASTQ files to model lower depth of coverage revealed that sequencing the progeny using 96 samples per lane would have yielded too few SNP markers to generate a map, even if we had sequenced the parents at depth 248×. CONCLUSIONS: The ddRADseq technology produced enough high-quality SNP markers to make a moderately dense, high-quality map. The success of this project was due to high depth of coverage of the parents, moderate depth of coverage of the progeny, a good framework map, an optimized bioinformatics pipeline, and rigorous premapping filters. The ddRADseq approach is useful for the construction of high-quality genetic maps in organisms lacking a reference genome if the parents and progeny are sequenced at sufficient depth. Technical improvements in reduced representation sequencing (RRS) approaches are needed to reduce the amount of missing data.
Asunto(s)
Mapeo Cromosómico/métodos , Enzimas de Restricción del ADN/metabolismo , Quercus/genética , Análisis de Secuencia de ADN , Técnicas de Genotipaje , Polimorfismo de Nucleótido SimpleRESUMEN
Forest trees are an unparalleled group of organisms in their combined ecological, economic and societal importance. With widespread distributions, predominantly random mating systems and large population sizes, most tree species harbour extensive genetic variation both within and among populations. At the same time, demographic processes associated with Pleistocene climate oscillations and land-use change have affected contemporary range-wide diversity and may impinge on the potential for future adaptation. Understanding how these adaptive and neutral processes have shaped the genomes of trees species is therefore central to their management and conservation. As for many other taxa, the advent of high-throughput sequencing methods is expected to yield an understanding of the interplay between the genome and environment at a level of detail and depth not possible only a few years ago. An international conference entitled 'Genomics and Forest Tree Genetics' was held in May 2016, in Arcachon (France), and brought together forest geneticists with a wide range of research interests to disseminate recent efforts that leverage contemporary genomic tools to probe the population, quantitative and evolutionary genomics of trees. An important goal of the conference was to discuss how such data can be applied to both genome-enabled breeding and the conservation of forest genetic resources under land use and climate change. Here, we report discoveries presented at the meeting and discuss how the ecological genomic toolkit can be used to address both basic and applied questions in tree biology.
Asunto(s)
Conservación de los Recursos Naturales , Genómica/métodos , Fitomejoramiento , Árboles/genética , Cambio Climático , Congresos como Asunto , Bosques , FranciaRESUMEN
BACKGROUND: To develop a set of transcriptome sequences to support research on environmental stress responses in green ash (Fraxinus pennsylvanica), we undertook deep RNA sequencing of green ash tissues under various stress treatments. The treatments, including emerald ash borer (EAB) feeding, heat, drought, cold and ozone, were selected to mimic the increasing threats of climate change and invasive pests faced by green ash across its native habitat. RESULTS: We report the generation and assembly of RNA sequences from 55 green ash samples into 107,611 putative unique transcripts (PUTs). 52,899 open reading frames were identified. Functional annotation of the PUTs by comparison to the Uniprot protein database identified matches for 63 % of transcripts and for 98 % of transcripts with ORFs. Further functional annotation identified conserved protein domains and assigned gene ontology terms to the PUTs. Examination of transcript expression across different RNA libraries revealed that expression patterns clustered based on tissues regardless of stress treatment. The transcripts from stress treatments were further examined to identify differential expression. Tens to hundreds of differentially expressed PUTs were identified for each stress treatment. A set of 109 PUTs were found to be consistently up or down regulated across three or more different stress treatments, representing basal stress response candidate genes in green ash. In addition, 1956 simple sequence repeats were identified in the PUTs, of which we identified 465 high quality DNA markers and designed flanking PCR primers. CONCLUSIONS: North American native ash trees have suffered extensive mortality due to EAB infestation, creating a need to breed or select for resistant green ash genotypes. Stress from climate change is an additional concern for longevity of native ash populations. The use of genomics could accelerate management efforts. The green ash transcriptome we have developed provides important sequence information, genetic markers and stress-response candidate genes.
Asunto(s)
Fraxinus/genética , Genes de Plantas , Estrés Fisiológico/genética , Transcriptoma , Cambio Climático , Análisis por Conglomerados , Biología Computacional/métodos , Sequías , Perfilación de la Expresión Génica , Regulación de la Expresión Génica de las Plantas , Ontología de Genes , Secuenciación de Nucleótidos de Alto Rendimiento , Repeticiones de Microsatélite , Missouri , Anotación de Secuencia Molecular , Especificidad de Órganos/genéticaRESUMEN
BACKGROUND: The ATP-binding cassette (ABC) transporter gene superfamily is ubiquitous among extant organisms and prominently represented in plants. ABC transporters act to transport compounds across cellular membranes and are involved in a diverse range of biological processes. Thus, the applicability to biotechnology is vast, including cancer resistance in humans, drug resistance among vertebrates, and herbicide and other xenobiotic resistance in plants. In addition, plants appear to harbor the highest diversity of ABC transporter genes compared with any other group of organisms. This study applied transcriptome analysis to survey the kingdom-wide ABC transporter diversity in plants and suggest biotechnology applications of this diversity. RESULTS: We utilized sequence similarity-based informatics techniques to infer the identity of ABC transporter gene candidates from 1295 phylogenetically-diverse plant transcriptomes. A total of 97,149 putative (approximately 25 % were full-length) ABC transporter gene members were identified; each RNA-Seq library (plant sample) had 88 ± 30 gene members. As expected, simpler organisms, such as algae, had fewer unique members than vascular land plants. Differences were also noted in the richness of certain ABC transporter subfamilies. Land plants had more unique ABCB, ABCC, and ABCG transporter gene members on average (p < 0.005), and green algae, red algae, and bryophytes had significantly more ABCF transporter gene members (p < 0.005). Ferns had significantly fewer ABCA transporter gene members than all other plant groups (p < 0.005). CONCLUSIONS: We present a transcriptomic overview of ABC transporter gene members across all major plant groups. An increase in the number of gene family members present in the ABCB, ABCC, and ABCD transporter subfamilies may indicate an expansion of the ABC transporter superfamily among green land plants, which include all crop species. The striking difference between the number of ABCA subfamily transporter gene members between ferns and other plant taxa is surprising and merits further investigation. Discussed is the potential exploitation of ABC transporters in plant biotechnology, with an emphasis on crops.
Asunto(s)
Transportadoras de Casetes de Unión a ATP/genética , Genes de Plantas/genética , Variación Genética/genética , Genoma de Planta/genética , Proteínas de Plantas/genética , Plantas/genética , Biotecnología/tendencias , Mapeo Cromosómico/métodos , Minería de Datos/métodos , Bases de Datos de Proteínas , Especificidad de la EspecieRESUMEN
BACKGROUND: Chinese chestnut (Castanea mollissima) has emerged as a model species for the Fagaceae family with extensive genomic resources including a physical map, a dense genetic map and quantitative trait loci (QTLs) for chestnut blight resistance. These resources enable comparative genomics analyses relative to model plants. We assessed the degree of conservation between the chestnut genome and other well annotated and assembled plant genomic sequences, focusing on the QTL regions of most interest to the chestnut breeding community. RESULTS: The integrated physical and genetic map of Chinese chestnut has been improved to now include 858 shared sequence-based markers. The utility of the integrated map has also been improved through the addition of 42,970 BAC (bacterial artificial chromosome) end sequences spanning over 26 million bases of the estimated 800 Mb chestnut genome. Synteny between chestnut and ten model plant species was conducted on a macro-syntenic scale using sequences from both individual probes and BAC end sequences across the chestnut physical map. Blocks of synteny with chestnut were found in all ten reference species, with the percent of the chestnut physical map that could be aligned ranging from 10 to 39 %. The integrated genetic and physical map was utilized to identify BACs that spanned the three previously identified QTL regions conferring blight resistance. The clones were pooled and sequenced, yielding 396 sequence scaffolds covering 13.9 Mbp. Comparative genomic analysis on a microsytenic scale, using the QTL-associated genomic sequence, identified synteny from chestnut to other plant genomes ranging from 5.4 to 12.9 % of the genome sequences aligning. CONCLUSIONS: On both the macro- and micro-synteny levels, the peach, grape and poplar genomes were found to be the most structurally conserved with chestnut. Interestingly, these results did not strictly follow the expectation that decreased phylogenetic distance would correspond to increased levels of genome preservation, but rather suggest the additional influence of life-history traits on preservation of synteny. The regions of synteny that were detected provide an important tool for defining and cataloging genes in the QTL regions for advancing chestnut blight resistance research.
Asunto(s)
Genoma de Planta , Magnoliopsida/genética , Sintenía/genética , Biología Computacional , Evolución Molecular , Genómica/métodos , Magnoliopsida/clasificación , Filogenia , Mapeo Físico de Cromosoma , Sitios de Carácter CuantitativoRESUMEN
BACKGROUND: Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed sequence tags (ESTs) are a source of SSRs that can be used to develop markers to facilitate plant breeding and for more basic research across genera and higher plant orders. METHODS: Leaf and meristem tissue from 'Heritage' red raspberry (Rubus idaeus) and 'Bristol' black raspberry (R. occidentalis) were utilized for RNA extraction. After conversion to cDNA and library construction, ESTs were sequenced, quality verified, assembled and scanned for SSRs. Primers flanking the SSRs were designed and a subset tested for amplification, polymorphism and transferability across species. ESTs containing SSRs were functionally annotated using the GenBank non-redundant (nr) database and further classified using the gene ontology database. RESULTS: To accelerate development of EST-SSRs in the genus Rubus (Rosaceae), 1149 and 2358 cDNA sequences were generated from red raspberry and black raspberry, respectively. The cDNA sequences were screened using rigorous filtering criteria which resulted in the identification of 121 and 257 SSR loci for red and black raspberry, respectively. Primers were designed from the surrounding sequences resulting in 131 and 288 primer pairs, respectively, as some sequences contained more than one SSR locus. Sequence analysis revealed that the SSR-containing genes span a diversity of functions and share more sequence identity with strawberry genes than with other Rosaceous species. CONCLUSION: This resource of Rubus-specific, gene-derived markers will facilitate the construction of linkage maps composed of transferable markers for studying and manipulating important traits in this economically important genus.