Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 24
Filtrar
1.
Nucleic Acids Res ; 49(D1): D1496-D1501, 2021 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-33264401

RESUMEN

SoyBase, a USDA genetic and genomics database, holds professionally curated soybean genetic and genomic data, which is integrated and made accessible to researchers and breeders. The site holds several reference genome assemblies, as well as genetic maps, thousands of mapped traits, expression and epigenetic data, pedigree information, and extensive variant and genotyping data sets. SoyBase displays include genetic, genomic, and epigenetic maps of the soybean genome. Gene expression data is presented in the genome viewer as heat maps and pictorial and tabular displays in gene report pages. Millions of sequence variants have been added, representing variations across various collections of cultivars. This variant data is explorable using new interactive tools to visualize the distribution of those variants across the genome, between selected accessions. SoyBase holds several reference-quality soybean genome assemblies, accessible via various query tools and browsers, including a new visualization system for exploring the soybean pan-genome. SoyBase also serves as a nexus of announcements pertinent to the greater soybean research community. The database also includes a soybean-specific anatomic and biochemical trait ontology. The database can be accessed at https://soybase.org.


Asunto(s)
Bases de Datos Genéticas , Regulación de la Expresión Génica de las Plantas , Genoma de Planta , Genotipo , Glycine max/genética , Proteínas de Plantas/genética , Mapeo Cromosómico , Productos Agrícolas , Epigénesis Genética , Estudios de Asociación Genética , Internet , Anotación de Secuencia Molecular , Filogenia , Fitomejoramiento/métodos , Proteínas de Plantas/metabolismo , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Carácter Cuantitativo Heredable , Estándares de Referencia , Programas Informáticos , Glycine max/clasificación , Glycine max/metabolismo , Estados Unidos , United States Department of Agriculture
2.
PLoS Comput Biol ; 14(12): e1006472, 2018 12.
Artículo en Inglés | MEDLINE | ID: mdl-30589835

RESUMEN

As sequencing prices drop, genomic data accumulates-seemingly at a steadily increasing pace. Most genomic data potentially have value beyond the initial purpose-but only if shared with the scientific community. This, of course, is often easier said than done. Some of the challenges in sharing genomic data include data volume (raw file sizes and number of files), complexities, formats, nomenclatures, metadata descriptions, and the choice of a repository. In this paper, we describe 10 quick tips for sharing open genomic data.


Asunto(s)
Bases de Datos Genéticas/tendencias , Difusión de la Información/métodos , Almacenamiento y Recuperación de la Información/métodos , Bases de Datos Factuales/estadística & datos numéricos , Bases de Datos Factuales/tendencias , Bases de Datos Genéticas/estadística & datos numéricos , Genómica , Programas Informáticos , Interfaz Usuario-Computador
3.
Database (Oxford) ; 20232023 11 15.
Artículo en Inglés | MEDLINE | ID: mdl-37971715

RESUMEN

Over the last couple of decades, there has been a rapid growth in the number and scope of agricultural genetics, genomics and breeding databases and resources. The AgBioData Consortium (https://www.agbiodata.org/) currently represents 44 databases and resources (https://www.agbiodata.org/databases) covering model or crop plant and animal GGB data, ontologies, pathways, genetic variation and breeding platforms (referred to as 'databases' throughout). One of the goals of the Consortium is to facilitate FAIR (Findable, Accessible, Interoperable, and Reusable) data management and the integration of datasets which requires data sharing, along with structured vocabularies and/or ontologies. Two AgBioData working groups, focused on Data Sharing and Ontologies, respectively, conducted a Consortium-wide survey to assess the current status and future needs of the members in those areas. A total of 33 researchers responded to the survey, representing 37 databases. Results suggest that data-sharing practices by AgBioData databases are in a fairly healthy state, but it is not clear whether this is true for all metadata and data types across all databases; and that, ontology use has not substantially changed since a similar survey was conducted in 2017. Based on our evaluation of the survey results, we recommend (i) providing training for database personnel in a specific data-sharing techniques, as well as in ontology use; (ii) further study on what metadata is shared, and how well it is shared among databases; (iii) promoting an understanding of data sharing and ontologies in the stakeholder community; (iv) improving data sharing and ontologies for specific phenotypic data types and formats; and (v) lowering specific barriers to data sharing and ontology use, by identifying sustainability solutions, and the identification, promotion, or development of data standards. Combined, these improvements are likely to help AgBioData databases increase development efforts towards improved ontology use, and data sharing via programmatic means. Database URL  https://www.agbiodata.org/databases.


Asunto(s)
Manejo de Datos , Fitomejoramiento , Animales , Genómica/métodos , Bases de Datos Factuales , Difusión de la Información
4.
Plant Physiol ; 157(1): 355-71, 2011 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-21791600

RESUMEN

Inoculation of soybean (Glycine max) plants with Phakopsora pachyrhizi, the causal organism of Asian soybean rust, elicits a biphasic response characterized by a burst of differential gene expression in the first 12 h. A quiescent period occurs from 24 to 48 h after inoculation, in which P. pachyrhizi continues to develop but does not elicit strong host responses, followed by a second phase of intense gene expression. To correlate soybean responses with P. pachyrhizi growth and development, we inoculated the soybean cultivar Ankur (accession PI462312), which carries the Rpp3 resistance gene, with avirulent and virulent isolates of P. pachyrhizi. The avirulent isolate Hawaii 94-1 elicits hypersensitive cell death that limits fungal growth on Ankur and results in an incompatible response, while the virulent isolate Taiwan 80-2 grows extensively, sporulates profusely, and produces a compatible reaction. Inoculated leaves were collected over a 288-h time course for microarray analysis of soybean gene expression and microscopic analysis of P. pachyrhizi growth and development. The first burst in gene expression correlated with appressorium formation and penetration of epidermal cells, while the second burst of gene expression changes followed the onset of haustoria formation in both compatible and incompatible interactions. The proliferation of haustoria coincided with the inhibition of P. pachyrhizi growth in the incompatible interaction or the beginning of accelerated growth in the compatible interaction. The temporal relationships between P. pachyrhizi growth and host responses provide an important context in which to view interacting gene networks that mediate the outcomes of their interactions.


Asunto(s)
Basidiomycota/fisiología , Regulación de la Expresión Génica de las Plantas , Glycine max/microbiología , Basidiomycota/patogenicidad , Interacciones Huésped-Patógeno , Fotosíntesis , Reguladores del Crecimiento de las Plantas/metabolismo , Transducción de Señal , Glycine max/metabolismo , Glycine max/fisiología , Transcripción Genética
5.
Plant Physiol ; 156(1): 240-53, 2011 May.
Artículo en Inglés | MEDLINE | ID: mdl-21321255

RESUMEN

Mutagenized populations have become indispensable resources for introducing variation and studying gene function in plant genomics research. In this study, fast neutron (FN) radiation was used to induce deletion mutations in the soybean (Glycine max) genome. Approximately 120,000 soybean seeds were exposed to FN radiation doses of up to 32 Gray units to develop over 23,000 independent M2 lines. Here, we demonstrate the utility of this population for phenotypic screening and associated genomic characterization of striking and agronomically important traits. Plant variation was cataloged for seed composition, maturity, morphology, pigmentation, and nodulation traits. Mutants that showed significant increases or decreases in seed protein and oil content across multiple generations and environments were identified. The application of comparative genomic hybridization (CGH) to lesion-induced mutants for deletion mapping was validated on a midoleate x-ray mutant, M23, with a known FAD2-1A (for fatty acid desaturase) gene deletion. Using CGH, a subset of mutants was characterized, revealing deletion regions and candidate genes associated with phenotypes of interest. Exome resequencing and sequencing of PCR products confirmed FN-induced deletions detected by CGH. Beyond characterization of soybean FN mutants, this study demonstrates the utility of CGH, exome sequence capture, and next-generation sequencing approaches for analyses of mutant plant genomes. We present this FN mutant soybean population as a valuable public resource for future genetic screens and functional genomics research.


Asunto(s)
Hibridación Genómica Comparativa/métodos , Genoma de Planta/genética , Genómica , Glycine max/genética , Proteínas de Plantas/genética , Exoma/genética , Neutrones Rápidos , Secuenciación de Nucleótidos de Alto Rendimiento , Semillas/genética , Análisis de Secuencia de ADN , Eliminación de Secuencia
6.
Nucleic Acids Res ; 38(Database issue): D843-6, 2010 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-20008513

RESUMEN

SoyBase, the USDA-ARS soybean genetic database, is a comprehensive repository for professionally curated genetics, genomics and related data resources for soybean. SoyBase contains the most current genetic, physical and genomic sequence maps integrated with qualitative and quantitative traits. The quantitative trait loci (QTL) represent more than 18 years of QTL mapping of more than 90 unique traits. SoyBase also contains the well-annotated 'Williams 82' genomic sequence and associated data mining tools. The genetic and sequence views of the soybean chromosomes and the extensive data on traits and phenotypes are extensively interlinked. This allows entry to the database using almost any kind of available information, such as genetic map symbols, soybean gene names or phenotypic traits. SoyBase is the repository for controlled vocabularies for soybean growth, development and trait terms, which are also linked to the more general plant ontologies. SoyBase can be accessed at http://soybase.org.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Bases de Datos de Ácidos Nucleicos , Genética , Genoma de Planta , Genómica/métodos , Glycine max/genética , Glycine max/fisiología , Cromosomas de las Plantas , Biología Computacional/tendencias , Bases de Datos de Proteínas , Almacenamiento y Recuperación de la Información/métodos , Internet , Modelos Genéticos , Sitios de Carácter Cuantitativo , Programas Informáticos , Estados Unidos
7.
Genome ; 54(1): 10-8, 2011 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-21217801

RESUMEN

Studies have indicated that exon and intron size and intergenic distance are correlated with gene expression levels and expression breadth. Previous reports on these correlations in plants and animals have been conflicting. In this study, next-generation sequence data, which has been shown to be more sensitive than previous expression profiling technologies, were generated and analyzed from 14 tissues. Our results revealed a novel dichotomy. At the low expression level, an increase in expression breadth correlated with an increase in transcript size because of an increase in the number of exons and introns. No significant changes in intron or exon sizes were noted. Conversely, genes expressed at the intermediate to high expression levels displayed a decrease in transcript size as their expression breadth increased. This was due to smaller exons, with no significant change in the number of exons. Taking advantage of the known gene space of soybean, we evaluated the positioning of genes and found significant clustering of similarly expressed genes. Identifying the correlations between the physical parameters of individual genes could lead to uncovering the role of regulation owing to nucleotide composition, which might have potential impacts in discerning the role of the noncoding regions.


Asunto(s)
Exones/genética , Regulación de la Expresión Génica de las Plantas , Genes de Plantas , Glycine max/genética , Intrones/genética , Animales , ADN Intergénico/genética , Perfilación de la Expresión Génica
8.
Plants (Basel) ; 10(11)2021 Nov 18.
Artículo en Inglés | MEDLINE | ID: mdl-34834856

RESUMEN

Seeds, especially those of certain grasses and legumes, provide the majority of the protein and carbohydrates for much of the world's population. Therefore, improvements in seed quality and yield are important drivers for the development of new crop varieties to feed a growing population. Quantitative Trait Loci (QTL) have been identified for many biologically interesting and agronomically important traits, including many seed quality traits. QTL can help explain the genetic architecture of the traits and can also be used to incorporate traits into new crop cultivars during breeding. Despite the important contributions that QTL have made to basic studies and plant breeding, knowing the exact gene(s) conditioning each QTL would greatly improve our ability to study the underlying genetics, biochemistry and regulatory networks. The data sets needed for identifying these genes are increasingly available and often housed in species- or clade-specific genetics and genomics databases. In this demonstration, we present a generalized walkthrough of how such databases can be used in these studies using SoyBase, the USDA soybean Genetics and Genomics Database, as an example.

9.
Sci Data ; 8(1): 50, 2021 02 08.
Artículo en Inglés | MEDLINE | ID: mdl-33558550

RESUMEN

We report characteristics of soybean genetic diversity and structure from the resequencing of 481 diverse soybean accessions, comprising 52 wild (Glycine soja) selections and 429 cultivated (Glycine max) varieties (landraces and elites). This data was used to identify 7.8 million SNPs, to predict SNP effects relative to genic regions, and to identify the genetic structure, relationships, and linkage disequilibrium. We found evidence of distinct, mostly independent selection of lineages by particular geographic location. Among cultivated varieties, we identified numerous highly conserved regions, suggesting selection during domestication. Comparisons of these accessions against the whole U.S. germplasm genotyped with the SoySNP50K iSelect BeadChip revealed that over 95% of the re-sequenced accessions have a high similarity to their SoySNP50K counterparts. Probable errors in seed source or genotype tracking were also identified in approximately 5% of the accessions.


Asunto(s)
Genoma de Planta , Glycine max/genética , Polimorfismo de Nucleótido Simple , Productos Agrícolas/genética , Fabaceae/genética , Genotipo , Geografía , Desequilibrio de Ligamiento , Selección Genética
10.
BMC Genomics ; 11: 113, 2010 Feb 17.
Artículo en Inglés | MEDLINE | ID: mdl-20163715

RESUMEN

BACKGROUND: Transposable elements are the most abundant components of all characterized genomes of higher eukaryotes. It has been documented that these elements not only contribute to the shaping and reshaping of their host genomes, but also play significant roles in regulating gene expression, altering gene function, and creating new genes. Thus, complete identification of transposable elements in sequenced genomes and construction of comprehensive transposable element databases are essential for accurate annotation of genes and other genomic components, for investigation of potential functional interaction between transposable elements and genes, and for study of genome evolution. The recent availability of the soybean genome sequence has provided an unprecedented opportunity for discovery, and structural and functional characterization of transposable elements in this economically important legume crop. DESCRIPTION: Using a combination of structure-based and homology-based approaches, a total of 32,552 retrotransposons (Class I) and 6,029 DNA transposons (Class II) with clear boundaries and insertion sites were structurally annotated and clearly categorized, and a soybean transposable element database, SoyTEdb, was established. These transposable elements have been anchored in and integrated with the soybean physical map and genetic map, and are browsable and visualizable at any scale along the 20 soybean chromosomes, along with predicted genes and other sequence annotations. BLAST search and other infrastracture tools were implemented to facilitate annotation of transposable elements or fragments from soybean and other related legume species. The majority (> 95%) of these elements (particularly a few hundred low-copy-number families) are first described in this study. CONCLUSION: SoyTEdb provides resources and information related to transposable elements in the soybean genome, representing the most comprehensive and the largest manually curated transposable element database for any individual plant genome completely sequenced to date. Transposable elements previously identified in legumes, the third largest family of flowering plants, are relatively scarce. Thus this database will facilitate structural, evolutionary, functional, and epigenetic analyses of transposable elements in soybean and other legume species.


Asunto(s)
Elementos Transponibles de ADN , Bases de Datos de Ácidos Nucleicos , Genoma de Planta , Glycine max/genética , ADN de Plantas/genética , Retroelementos , Análisis de Secuencia de ADN
11.
BMC Plant Biol ; 10: 160, 2010 Aug 05.
Artículo en Inglés | MEDLINE | ID: mdl-20687943

RESUMEN

BACKGROUND: Next generation sequencing is transforming our understanding of transcriptomes. It can determine the expression level of transcripts with a dynamic range of over six orders of magnitude from multiple tissues, developmental stages or conditions. Patterns of gene expression provide insight into functions of genes with unknown annotation. RESULTS: The RNA Seq-Atlas presented here provides a record of high-resolution gene expression in a set of fourteen diverse tissues. Hierarchical clustering of transcriptional profiles for these tissues suggests three clades with similar profiles: aerial, underground and seed tissues. We also investigate the relationship between gene structure and gene expression and find a correlation between gene length and expression. Additionally, we find dramatic tissue-specific gene expression of both the most highly-expressed genes and the genes specific to legumes in seed development and nodule tissues. Analysis of the gene expression profiles of over 2,000 genes with preferential gene expression in seed suggests there are more than 177 genes with functional roles that are involved in the economically important seed filling process. Finally, the Seq-atlas also provides a means of evaluating existing gene model annotations for the Glycine max genome. CONCLUSIONS: This RNA-Seq atlas extends the analyses of previous gene expression atlases performed using Affymetrix GeneChip technology and provides an example of new methods to accommodate the increase in transcriptome data obtained from next generation sequencing. Data contained within this RNA-Seq atlas of Glycine max can be explored at http://www.soybase.org/soyseq.


Asunto(s)
Perfilación de la Expresión Génica , Regulación de la Expresión Génica de las Plantas , Glycine max/genética , Glycine max/metabolismo , Análisis por Conglomerados , MicroARNs/genética , ARN Mensajero/genética , ARN de Planta/genética , Análisis de Secuencia de ARN
12.
BMC Bioinformatics ; 10: 309, 2009 Sep 23.
Artículo en Inglés | MEDLINE | ID: mdl-19775460

RESUMEN

BACKGROUND: SSWAP (Simple Semantic Web Architecture and Protocol; pronounced "swap") is an architecture, protocol, and platform for using reasoning to semantically integrate heterogeneous disparate data and services on the web. SSWAP was developed as a hybrid semantic web services technology to overcome limitations found in both pure web service technologies and pure semantic web technologies. RESULTS: There are currently over 2400 resources published in SSWAP. Approximately two dozen are custom-written services for QTL (Quantitative Trait Loci) and mapping data for legumes and grasses (grains). The remaining are wrappers to Nucleic Acids Research Database and Web Server entries. As an architecture, SSWAP establishes how clients (users of data, services, and ontologies), providers (suppliers of data, services, and ontologies), and discovery servers (semantic search engines) interact to allow for the description, querying, discovery, invocation, and response of semantic web services. As a protocol, SSWAP provides the vocabulary and semantics to allow clients, providers, and discovery servers to engage in semantic web services. The protocol is based on the W3C-sanctioned first-order description logic language OWL DL. As an open source platform, a discovery server running at http://sswap.info (as in to "swap info") uses the description logic reasoner Pellet to integrate semantic resources. The platform hosts an interactive guide to the protocol at http://sswap.info/protocol.jsp, developer tools at http://sswap.info/developer.jsp, and a portal to third-party ontologies at http://sswapmeet.sswap.info (a "swap meet"). CONCLUSION: SSWAP addresses the three basic requirements of a semantic web services architecture (i.e., a common syntax, shared semantic, and semantic discovery) while addressing three technology limitations common in distributed service systems: i.e., i) the fatal mutability of traditional interfaces, ii) the rigidity and fragility of static subsumption hierarchies, and iii) the confounding of content, structure, and presentation. SSWAP is novel by establishing the concept of a canonical yet mutable OWL DL graph that allows data and service providers to describe their resources, to allow discovery servers to offer semantically rich search engines, to allow clients to discover and invoke those resources, and to allow providers to respond with semantically tagged data. SSWAP allows for a mix-and-match of terms from both new and legacy third-party ontologies in these graphs.


Asunto(s)
Biología Computacional/métodos , Difusión de la Información/métodos , Semántica , Programas Informáticos , Bases de Datos Factuales , Almacenamiento y Recuperación de la Información , Internet , Interfaz Usuario-Computador
13.
BMC Genomics ; 10: 376, 2009 Aug 13.
Artículo en Inglés | MEDLINE | ID: mdl-19678937

RESUMEN

BACKGROUND: Soybeans grown in the upper Midwestern United States often suffer from iron deficiency chlorosis, which results in yield loss at the end of the season. To better understand the effect of iron availability on soybean yield, we identified genes in two near isogenic lines with changes in expression patterns when plants were grown in iron sufficient and iron deficient conditions. RESULTS: Transcriptional profiles of soybean (Glycine max, L. Merr) near isogenic lines Clark (PI548553, iron efficient) and IsoClark (PI547430, iron inefficient) grown under Fe-sufficient and Fe-limited conditions were analyzed and compared using the Affymetrix GeneChip Soybean Genome Array. There were 835 candidate genes in the Clark (PI548553) genotype and 200 candidate genes in the IsoClark (PI547430) genotype putatively involved in soybean's iron stress response. Of these candidate genes, fifty-eight genes in the Clark genotype were identified with a genetic location within known iron efficiency QTL and 21 in the IsoClark genotype. The arrays also identified 170 single feature polymorphisms (SFPs) specific to either Clark or IsoClark. A sliding window analysis of the microarray data and the 7X genome assembly coupled with an iterative model of the data showed the candidate genes are clustered in the genome. An analysis of 5' untranslated regions in the promoter of candidate genes identified 11 conserved motifs in 248 differentially expressed genes, all from the Clark genotype, representing 129 clusters identified earlier, confirming the cluster analysis results. CONCLUSION: These analyses have identified the first genes with expression patterns that are affected by iron stress and are located within QTL specific to iron deficiency stress. The genetic location and promoter motif analysis results support the hypothesis that the differentially expressed genes are co-regulated. The combined results of all analyses lead us to postulate iron inefficiency in soybean is a result of a mutation in a transcription factor(s), which controls the expression of genes required in inducing an iron stress response.


Asunto(s)
Genoma de Planta , Glycine max/genética , Deficiencias de Hierro , Análisis de Secuencia por Matrices de Oligonucleótidos , Enfermedades de las Plantas/genética , Análisis por Conglomerados , ADN de Plantas/genética , Perfilación de la Expresión Génica , Regulación de la Expresión Génica de las Plantas , Genes de Plantas , Genotipo , Hierro/metabolismo , Regiones Promotoras Genéticas , Sitios de Carácter Cuantitativo , Análisis de Secuencia de ADN , Glycine max/metabolismo
14.
Database (Oxford) ; 20182018 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-30239679

RESUMEN

The future of agricultural research depends on data. The sheer volume of agricultural biological data being produced today makes excellent data management essential. Governmental agencies, publishers and science funders require data management plans for publicly funded research. Furthermore, the value of data increases exponentially when they are properly stored, described, integrated and shared, so that they can be easily utilized in future analyses. AgBioData (https://www.agbiodata.org) is a consortium of people working at agricultural biological databases, data archives and knowledgbases who strive to identify common issues in database development, curation and management, with the goal of creating database products that are more Findable, Accessible, Interoperable and Reusable. We strive to promote authentic, detailed, accurate and explicit communication between all parties involved in scientific data. As a step toward this goal, we present the current state of biocuration, ontologies, metadata and persistence, database platforms, programmatic (machine) access to data, communication and sustainability with regard to data curation. Each section describes challenges and opportunities for these topics, along with recommendations and best practices.


Asunto(s)
Agricultura , Bases de Datos Genéticas , Genómica , Cruzamiento , Ontología de Genes , Metadatos , Encuestas y Cuestionarios
15.
Mol Plant Microbe Interact ; 20(8): 887-99, 2007 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-17722693

RESUMEN

Asian soybean rust (ASR), caused by Phakopsora pachyrhizi, is now established in all major soybean-producing countries. Currently, there is little information about the molecular basis of ASR-soybean interactions, which will be needed to assist future efforts to develop effective resistance. Toward this end, abundance changes of soybean mRNAs were measured over a 7-day ASR infection time course in mock-inoculated and infected leaves of a soybean accession (PI230970) carrying the Rpp2 resistance gene and a susceptible genotype (Embrapa-48). The expression profiles of differentially expressed genes (ASR-infected compared with the mock-inoculated control) revealed a biphasic response to ASR in each genotype. Within the first 12 h after inoculation (hai), which corresponds to fungal germination and penetration of the epidermal cells, differential gene expression changes were evident in both genotypes. mRNA expression of these genes mostly returned to levels found in mock-inoculated plants by 24 hai. In the susceptible genotype, gene expression remained unaffected by rust infection until 96 hai, a time period when rapid fungal growth began. In contrast, gene expression in the resistant genotype diverged from the mock-inoculated control earlier, at 72 h, demonstrating that Rpp2-mediated defenses were initiated prior to this time. These data suggest that ASR initially induces a nonspecific response that is transient or is suppressed when early steps in colonization are completed in both soybean genotypes. The race-specific resistance phenotype of Rpp2 is manifested in massive gene expression changes after the initial response prior to the onset of rapid fungal growth that occurs in the susceptible genotype.


Asunto(s)
Basidiomycota/fisiología , Glycine max/microbiología , Enfermedades de las Plantas/genética , ARN Mensajero/metabolismo , Análisis por Conglomerados , Perfilación de la Expresión Génica , Genotipo , Inmunidad Innata/genética , Análisis de Secuencia por Matrices de Oligonucleótidos , Glycine max/genética , Glycine max/metabolismo , Factores de Transcripción/genética , Factores de Transcripción/metabolismo
16.
BMC Genomics ; 8: 330, 2007 Sep 19.
Artículo en Inglés | MEDLINE | ID: mdl-17880721

RESUMEN

BACKGROUND: Soybean, Glycine max (L.) Merr., is a well documented paleopolyploid. What remains relatively under characterized is the level of sequence identity in retained homeologous regions of the genome. Recently, the Department of Energy Joint Genome Institute and United States Department of Agriculture jointly announced the sequencing of the soybean genome. One of the initial concerns is to what extent sequence identity in homeologous regions would have on whole genome shotgun sequence assembly. RESULTS: Seventeen BACs representing approximately 2.03 Mb were sequenced as representative potential homeologous regions from the soybean genome. Genetic mapping of each BAC shows that 11 of the 20 chromosomes are represented. Sequence comparisons between homeologous BACs shows that the soybean genome is a mosaic of retained paleopolyploid regions. Some regions appear to be highly conserved while other regions have diverged significantly. Large-scale "batch" reassembly of all 17 BACs combined showed that even the most homeologous BACs with upwards of 95% sequence identity resolve into their respective homeologous sequences. Potential assembly errors were generated by tandemly duplicated pentatricopeptide repeat containing genes and long simple sequence repeats. Analysis of a whole-genome shotgun assembly of 80,000 randomly chosen JGI-DOE sequence traces reveals some new soybean-specific repeat sequences. CONCLUSION: This analysis investigated both the structure of the paleopolyploid soybean genome and the potential effects retained homeology will have on assembling the whole genome shotgun sequence. Based upon these results, homeologous regions similar to those characterized here will not cause major assembly issues.


Asunto(s)
Genes Duplicados/genética , Genoma de Planta/genética , Glycine max/genética , Mapeo Físico de Cromosoma/métodos , Poliploidía , Secuencias Repetitivas de Ácidos Nucleicos , Análisis de Secuencia de ADN/métodos , Secuencia de Bases/genética , Cromosomas Artificiales Bacterianos/genética , Cromosomas de las Plantas/genética , Evolución Molecular , Marcadores Genéticos , Repeticiones de Microsatélite , Filogenia , Polimorfismo Genético/genética , Programas Informáticos , Especificidad de la Especie , Sintenía/genética
17.
BMC Genomics ; 7: 204, 2006 Aug 09.
Artículo en Inglés | MEDLINE | ID: mdl-16899135

RESUMEN

BACKGROUND: Large scale gene analysis of most organisms is hampered by incomplete genomic sequences. In many organisms, such as soybean, the best source of sequence information is the existence of expressed sequence tag (EST) libraries. Soybean has a large (1115 Mbp) genome that has yet to be fully sequenced. However it does have the 6th largest EST collection comprised of ESTs from a variety of soybean genotypes. Many EST libraries were constructed from RNA extracted from various genetic backgrounds, thus gene identification from these sources is complicated by the existence of both gene and allele sequence differences. We used the ESTminer suite of programs to identify potential soybean gene transcripts from a single genetic background allowing us to observe functional classifications between gene families as well as structural differences between genes and gene paralogs within families. The identification of potential gene sequences (pHaps) from soybean allows us to begin to get a picture of the genomic history of the organism as well as begin to observe the evolutionary fates of gene copies in this highly duplicated genome. RESULTS: We identified approximately 45,000 potential gene sequences (pHaps) from EST sequences of Williams/Williams82, an inbred genotype of soybean (Glycine max L. Merr.) using a redundancy criterion to identify reproducible sequence differences between related genes within gene families. Analysis of these sequences revealed single base substitutions and single base indels are the most frequently observed form of sequence variation between genes within families in the dataset. Genomic sequencing of selected loci indicate that intron-like intervening sequences are numerous and are approximately 220 bp in length. Functional annotation of gene sequences indicate functional classifications are not randomly distributed among gene families containing few or many genes. CONCLUSION: The predominance of single nucleotide insertion/deletions and substitution events between genes within families (individual genes and gene paralogs) is consistent with a model of gene amplification followed by single base random mutational events expected under the classical model of duplicated gene evolution. Molecular functions of small and large gene families appear to be non-randomly distributed possibly indicating a difference in retention of duplicates or local expansion.


Asunto(s)
Etiquetas de Secuencia Expresada , Genoma de Planta , Glycine max/genética , Familia de Multigenes , Evolución Molecular , Duplicación de Gen , Variación Genética , Análisis de Secuencia de ADN , Análisis de Secuencia de ARN
18.
Plant Methods ; 11: 10, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-25774204

RESUMEN

BACKGROUND: Plant phenotype datasets include many different types of data, formats, and terms from specialized vocabularies. Because these datasets were designed for different audiences, they frequently contain language and details tailored to investigators with different research objectives and backgrounds. Although phenotype comparisons across datasets have long been possible on a small scale, comprehensive queries and analyses that span a broad set of reference species, research disciplines, and knowledge domains continue to be severely limited by the absence of a common semantic framework. RESULTS: We developed a workflow to curate and standardize existing phenotype datasets for six plant species, encompassing both model species and crop plants with established genetic resources. Our effort focused on mutant phenotypes associated with genes of known sequence in Arabidopsis thaliana (L.) Heynh. (Arabidopsis), Zea mays L. subsp. mays (maize), Medicago truncatula Gaertn. (barrel medic or Medicago), Oryza sativa L. (rice), Glycine max (L.) Merr. (soybean), and Solanum lycopersicum L. (tomato). We applied the same ontologies, annotation standards, formats, and best practices across all six species, thereby ensuring that the shared dataset could be used for cross-species querying and semantic similarity analyses. Curated phenotypes were first converted into a common format using taxonomically broad ontologies such as the Plant Ontology, Gene Ontology, and Phenotype and Trait Ontology. We then compared ontology-based phenotypic descriptions with an existing classification system for plant phenotypes and evaluated our semantic similarity dataset for its ability to enhance predictions of gene families, protein functions, and shared metabolic pathways that underlie informative plant phenotypes. CONCLUSIONS: The use of ontologies, annotation standards, shared formats, and best practices for cross-taxon phenotype data analyses represents a novel approach to plant phenomics that enhances the utility of model genetic organisms and can be readily applied to species with fewer genetic resources and less well-characterized genomes. In addition, these tools should enhance future efforts to explore the relationships among phenotypic similarity, gene function, and sequence similarity in plants, and to make genotype-to-phenotype predictions relevant to plant biology, crop improvement, and potentially even human health.

19.
J Wildl Dis ; 38(2): 463-7, 2002 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-12038150

RESUMEN

Members of the genus Basidiobolus, a saprophytic fungus, have been associated with the digestive tracts of a wide variety of amphibians and reptiles. To elucidate the relationship of Basidiobolus sp. with amphibians in central Florida (USA), we document the occurrence of the fungus in the digestive tracts of Bufo terrestris, Buffo quercicus, Hyla femoralis, Hyla cinerea, Hyla gratiosa, Hyla squirella, Osteopilus septentrionalis, and Rana utricularia. Species that occupy terrestrial habitats (B. terrestris, B. quercicus, and R. utricularis) were found to harbor Basidiobolus spp. more frequently (83, 78, and 91%, respectively) than those that occupied a more arboreal habitat (H. cinerea, H. squirella, H. femoralis, H. gratiosa, and O. septentrionalis (50, 56, 55, 56, and 70%, respectively).


Asunto(s)
Anuros , Entomophthorales/aislamiento & purificación , Heces/microbiología , Cigomicosis/veterinaria , Animales , Entomophthorales/patogenicidad , Florida/epidemiología , Virulencia , Cigomicosis/epidemiología , Cigomicosis/microbiología
20.
Funct Plant Biol ; 40(10): 1029-1047, 2013 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-32481171

RESUMEN

Rpp4 (Resistance to Phakopsora pachyrhizi 4) confers resistance to Phakopsora pachyrhizi Sydow, the causal agent of Asian soybean rust (ASR). By combining expression profiling and virus induced gene silencing (VIGS), we are developing a genetic framework for Rpp4-mediated resistance. We measured gene expression in mock-inoculated and P. pachyrhizi-infected leaves of resistant soybean accession PI459025B (Rpp4) and the susceptible cultivar (Williams 82) across a 12-day time course. Unexpectedly, two biphasic responses were identified. In the incompatible reaction, genes induced at 12h after infection (hai) were not differentially expressed at 24 hai, but were induced at 72 hai. In contrast, genes repressed at 12 hai were not differentially expressed from 24 to 144 hai, but were repressed 216 hai and later. To differentiate between basal and resistance-gene (R-gene) mediated defence responses, we compared gene expression in Rpp4-silenced and empty vector-treated PI459025B plants 14 days after infection (dai) with P. pachyrhizi. This identified genes, including transcription factors, whose differential expression is dependent upon Rpp4. To identify differentially expressed genes conserved across multiple P. pachyrhizi resistance pathways, Rpp4 expression datasets were compared with microarray data previously generated for Rpp2 and Rpp3-mediated defence responses. Fourteen transcription factors common to all resistant and susceptible responses were identified, as well as fourteen transcription factors unique to R-gene-mediated resistance responses. These genes are targets for future P. pachyrhizi resistance research.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA