Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 30
Filtrar
Mais filtros












Base de dados
Intervalo de ano de publicação
1.
Methods Mol Biol ; 1374: 187-202, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-26519406

RESUMO

MaizeGDB is the community database for biological information about the crop plant Zea mays. Genomic, genetic, sequence, gene product, functional characterization, literature reference, and person/organization contact information are among the datatypes stored at MaizeGDB. At the project's website ( http://www.maizegdb.org ) are custom interfaces enabling researchers to browse data and to seek out specific information matching explicit search criteria. In addition, pre-compiled reports are made available for particular types of data and bulletin boards are provided to facilitate communication and coordination among members of the community of maize geneticists.


Assuntos
Biologia Computacional/métodos , Bases de Dados Genéticas , Genômica/métodos , Zea mays/genética , Navegador
2.
Plant Methods ; 11: 10, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-25774204

RESUMO

BACKGROUND: Plant phenotype datasets include many different types of data, formats, and terms from specialized vocabularies. Because these datasets were designed for different audiences, they frequently contain language and details tailored to investigators with different research objectives and backgrounds. Although phenotype comparisons across datasets have long been possible on a small scale, comprehensive queries and analyses that span a broad set of reference species, research disciplines, and knowledge domains continue to be severely limited by the absence of a common semantic framework. RESULTS: We developed a workflow to curate and standardize existing phenotype datasets for six plant species, encompassing both model species and crop plants with established genetic resources. Our effort focused on mutant phenotypes associated with genes of known sequence in Arabidopsis thaliana (L.) Heynh. (Arabidopsis), Zea mays L. subsp. mays (maize), Medicago truncatula Gaertn. (barrel medic or Medicago), Oryza sativa L. (rice), Glycine max (L.) Merr. (soybean), and Solanum lycopersicum L. (tomato). We applied the same ontologies, annotation standards, formats, and best practices across all six species, thereby ensuring that the shared dataset could be used for cross-species querying and semantic similarity analyses. Curated phenotypes were first converted into a common format using taxonomically broad ontologies such as the Plant Ontology, Gene Ontology, and Phenotype and Trait Ontology. We then compared ontology-based phenotypic descriptions with an existing classification system for plant phenotypes and evaluated our semantic similarity dataset for its ability to enhance predictions of gene families, protein functions, and shared metabolic pathways that underlie informative plant phenotypes. CONCLUSIONS: The use of ontologies, annotation standards, shared formats, and best practices for cross-taxon phenotype data analyses represents a novel approach to plant phenomics that enhances the utility of model genetic organisms and can be readily applied to species with fewer genetic resources and less well-characterized genomes. In addition, these tools should enhance future efforts to explore the relationships among phenotypic similarity, gene function, and sequence similarity in plants, and to make genotype-to-phenotype predictions relevant to plant biology, crop improvement, and potentially even human health.

3.
PLoS Biol ; 13(1): e1002033, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25562316

RESUMO

Despite a large and multifaceted effort to understand the vast landscape of phenotypic data, their current form inhibits productive data analysis. The lack of a community-wide, consensus-based, human- and machine-interpretable language for describing phenotypes and their genomic and environmental contexts is perhaps the most pressing scientific bottleneck to integration across many key fields in biology, including genomics, systems biology, development, medicine, evolution, ecology, and systematics. Here we survey the current phenomics landscape, including data resources and handling, and the progress that has been made to accurately capture relevant data descriptions for phenotypes. We present an example of the kind of integration across domains that computable phenotypes would enable, and we call upon the broader biology community, publishers, and relevant funding agencies to support efforts to surmount today's data barriers and facilitate analytical reproducibility.


Assuntos
Estudos de Associação Genética , Animais , Biologia Computacional , Curadoria de Dados , Bases de Dados Factuais/normas , Interação Gene-Ambiente , Genômica , Humanos , Fenótipo , Padrões de Referência , Reprodutibilidade dos Testes , Terminologia como Assunto
4.
Plant Physiol ; 167(1): 25-39, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25384563

RESUMO

The large size and relative complexity of many plant genomes make creation, quality control, and dissemination of high-quality gene structure annotations challenging. In response, we have developed MAKER-P, a fast and easy-to-use genome annotation engine for plants. Here, we report the use of MAKER-P to update and revise the maize (Zea mays) B73 RefGen_v3 annotation build (5b+) in less than 3 h using the iPlant Cyberinfrastructure. MAKER-P identified and annotated 4,466 additional, well-supported protein-coding genes not present in the 5b+ annotation build, added additional untranslated regions to 1,393 5b+ gene models, identified 2,647 5b+ gene models that lack any supporting evidence (despite the use of large and diverse evidence data sets), identified 104,215 pseudogene fragments, and created an additional 2,522 noncoding gene annotations. We also describe a method for de novo training of MAKER-P for the annotation of newly sequenced grass genomes. Collectively, these results lead to the 6a maize genome annotation and demonstrate the utility of MAKER-P for rapid annotation, management, and quality control of grasses and other difficult-to-annotate plant genomes.


Assuntos
Genes de Plantas/genética , Genoma de Planta/genética , Anotação de Sequência Molecular/métodos , Zea mays/genética , Bases de Dados Genéticas/normas , Éxons/genética , Íntrons/genética , Modelos Genéticos , Anotação de Sequência Molecular/normas , Pseudogenes/genética , Controle de Qualidade , RNA não Traduzido/genética
5.
J Genet Genomics ; 41(12): 627-47, 2014 Dec 20.
Artigo em Inglês | MEDLINE | ID: mdl-25527104

RESUMO

The G-quadruplex (G4) elements comprise a class of nucleic acid structures formed by stacking of guanine base quartets in a quadruple helix. This G4 DNA can form within or across single-stranded DNA molecules and is mutually exclusive with duplex B-form DNA. The reversibility and structural diversity of G4s make them highly versatile genetic structures, as demonstrated by their roles in various functions including telomere metabolism, genome maintenance, immunoglobulin gene diversification, transcription, and translation. Sequence motifs capable of forming G4 DNA are typically located in telomere repeat DNA and other non-telomeric genomic loci. To investigate their potential roles in a large-genome model plant species, we computationally identified 149,988 non-telomeric G4 motifs in maize (Zea mays L., B73 AGPv2), 29% of which were in non-repetitive genomic regions. G4 motif hotspots exhibited non-random enrichment in genes at two locations on the antisense strand, one in the 5' UTR and the other at the 5' end of the first intron. Several genic G4 motifs were shown to adopt sequence-specific and potassium-dependent G4 DNA structures in vitro. The G4 motifs were prevalent in key regulatory genes associated with hypoxia (group VII ERFs), oxidative stress (DJ-1/GATase1), and energy status (AMPK/SnRK) pathways. They also showed statistical enrichment for genes in metabolic pathways that function in glycolysis, sugar degradation, inositol metabolism, and base excision repair. Collectively, the maize G4 motifs may represent conditional regulatory elements that can aid in energy status gene responses. Such a network of elements could provide a mechanistic basis for linking energy status signals to gene regulation in maize, a model genetic system and major world crop species for feed, food, and fuel.


Assuntos
DNA de Plantas/genética , Quadruplex G , Genes de Plantas/genética , Genoma de Planta/genética , Zea mays/genética , Regiões 3' não Traduzidas/genética , Metabolismo dos Carboidratos/genética , Dicroísmo Circular , DNA de Plantas/química , Metabolismo Energético/genética , Regulação da Expressão Gênica de Plantas , Redes e Vias Metabólicas/genética , Modelos Genéticos , Consumo de Oxigênio/genética , Zea mays/metabolismo
6.
Plant Physiol ; 164(2): 513-24, 2014 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-24306534

RESUMO

We have optimized and extended the widely used annotation engine MAKER in order to better support plant genome annotation efforts. New features include better parallelization for large repeat-rich plant genomes, noncoding RNA annotation capabilities, and support for pseudogene identification. We have benchmarked the resulting software tool kit, MAKER-P, using the Arabidopsis (Arabidopsis thaliana) and maize (Zea mays) genomes. Here, we demonstrate the ability of the MAKER-P tool kit to automatically update, extend, and revise the Arabidopsis annotations in light of newly available data and to annotate pseudogenes and noncoding RNAs absent from The Arabidopsis Informatics Resource 10 build. Our results demonstrate that MAKER-P can be used to manage and improve the annotations of even Arabidopsis, perhaps the best-annotated plant genome. We have also installed and benchmarked MAKER-P on the Texas Advanced Computing Center. We show that this public resource can de novo annotate the entire Arabidopsis and maize genomes in less than 3 h and produce annotations of comparable quality to those of the current The Arabidopsis Information Resource 10 and maize V2 annotation builds.


Assuntos
Arabidopsis/genética , Biologia Computacional/métodos , Genoma de Planta/genética , Anotação de Sequência Molecular/métodos , Software , Zea mays/genética , Processamento Alternativo/genética , Éxons/genética , Genes de Plantas/genética , Pseudogenes/genética , Sequências Repetitivas de Ácido Nucleico/genética , Reprodutibilidade dos Testes
7.
Chromosoma ; 122(1-2): 67-75, 2013 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-23223973

RESUMO

Knobs are conspicuous heterochromatic regions found on the chromosomes of maize and its relatives. The number, locations, and sizes of knobs vary dramatically, with most lines containing between four and eight knobs in mid-arm positions. Prior data suggest that some knobs may reduce recombination. However, comprehensive tests have not been carried out, primarily because most knobs have not been placed on the genetic map. We used fluorescent in situ hybridization and two recombinant inbred populations to map seven knobs and to accurately place three knobs from the B73 inbred on the genomic sequence assembly. The data show that knobs lie in gene-dense regions of the maize genome. Comparisons to 23 other recombinant inbred populations segregating for knobs at the same sites confirm that large knobs can locally reduce crossing over by as much as twofold on a cM/Mb scale. These effects do not extend beyond regions ~10 cM to either side of knobs and do not appear to affect linkage disequilibrium among genes within and near knob repeat regions of the B73 RefGen_v2 assembly.


Assuntos
Cromossomos de Plantas/genética , Heterocromatina/genética , Recombinação Genética , DNA de Plantas , Hibridização in Situ Fluorescente , Zea mays
8.
Database (Oxford) ; 2011: bar022, 2011.
Artigo em Inglês | MEDLINE | ID: mdl-21624896

RESUMO

First released in 1991 with the name MaizeDB, the Maize Genetics and Genomics Database, now MaizeGDB, celebrates its 20th anniversary this year. MaizeGDB has transitioned from a focus on comprehensive curation of the literature, genetic maps and stocks to a paradigm that accommodates the recent release of a reference maize genome sequence, multiple diverse maize genomes and sequence-based gene expression data sets. The MaizeGDB Team is relatively small, and relies heavily on the research community to provide data, nomenclature standards and most importantly, to recommend future directions, priorities and strategies. Key aspects of MaizeGDB's intimate interaction with the community are the co-location of curators with maize research groups in multiple locations across the USA as well as coordination with MaizeGDB's close partner, the Maize Genetics Cooperation--Stock Center. In this report, we describe how the MaizeGDB Team currently interacts with the maize research community and our plan for future interactions that will support updates to the functional and structural annotation of the B73 reference genome.


Assuntos
Bases de Dados Genéticas , Genômica , Anotação de Sequência Molecular , Zea mays/genética , Genoma de Planta/genética
9.
Database (Oxford) ; 2011: bar012, 2011.
Artigo em Inglês | MEDLINE | ID: mdl-21558151

RESUMO

Model Organism Databases, including the various plant genome databases, collect and enable access to massive amounts of heterogeneous information, including sequence data, gene product information, images of mutant phenotypes, etc, as well as textual descriptions of many of these entities. While a variety of basic browsing and search capabilities are available to allow researchers to query and peruse the names and attributes of phenotypic data, next-generation search mechanisms that allow querying and ranking of text descriptions are much less common. In addition, the plant community needs an innovative way to leverage the existing links in these databases to search groups of text descriptions simultaneously. Furthermore, though much time and effort have been afforded to the development of plant-related ontologies, the knowledge embedded in these ontologies remains largely unused in available plant search mechanisms. Addressing these issues, we have developed a unique search engine for mutant phenotypes from MaizeGDB. This advanced search mechanism integrates various text description sources in MaizeGDB to aid a user in retrieving desired mutant phenotype information. Currently, descriptions of mutant phenotypes, loci and gene products are utilized collectively for each search, though expansion of the search mechanism to include other sources is straightforward. The retrieval engine, to our knowledge, is the first engine to exploit the content and structure of available domain ontologies, currently the Plant and Gene Ontologies, to expand and enrich retrieval results in major plant genomic databases. Database URL: http:www.PhenomicsWorld.org/QBTA.php.


Assuntos
Biologia Computacional/métodos , Armazenamento e Recuperação da Informação , Mutação/genética , Ferramenta de Busca , Zea mays/genética , Bases de Dados Genéticas , Fenótipo , Interface Usuário-Computador
10.
Database (Oxford) ; 2011: bar016, 2011.
Artigo em Inglês | MEDLINE | ID: mdl-21565781

RESUMO

Video tutorials are an effective way for researchers to quickly learn how to use online tools offered by biological databases. At MaizeGDB, we have developed a number of video tutorials that demonstrate how to use various tools and explicitly outline the caveats researchers should know to interpret the information available to them. One such popular video currently available is 'Using the MaizeGDB Genome Browser', which describes how the maize genome was sequenced and assembled as well as how the sequence can be visualized and interacted with via the MaizeGDB Genome Browser. Database


Assuntos
Biologia , Bases de Dados Genéticas , Tecnologia Educacional , Genoma de Planta/genética , Internet , Pesquisadores , Gravação de Videoteipe , Zea mays/genética , Relações Comunidade-Instituição
11.
Int J Plant Genomics ; 2011: 923035, 2011.
Artigo em Inglês | MEDLINE | ID: mdl-22253616

RESUMO

The purpose of the online resource presented here, POPcorn (Project Portal for corn), is to enhance accessibility of maize genetic and genomic resources for plant biologists. Currently, many online locations are difficult to find, some are best searched independently, and individual project websites often degrade over time-sometimes disappearing entirely. The POPcorn site makes available (1) a centralized, web-accessible resource to search and browse descriptions of ongoing maize genomics projects, (2) a single, stand-alone tool that uses web Services and minimal data warehousing to search for sequence matches in online resources of diverse offsite projects, and (3) a set of tools that enables researchers to migrate their data to the long-term model organism database for maize genetic and genomic information: MaizeGDB. Examples demonstrating POPcorn's utility are provided herein.

12.
Database (Oxford) ; 2010: baq007, 2010 Jul 06.
Artigo em Inglês | MEDLINE | ID: mdl-20627860

RESUMO

As the B73 maize genome sequencing project neared completion, MaizeGDB began to integrate a graphical genome browser with its existing web interface and database. To ensure that maize researchers would optimally benefit from the potential addition of a genome browser to the existing MaizeGDB resource, personnel at MaizeGDB surveyed researchers' needs. Collected data indicate that existing genome browsers for maize were inadequate and suggest implementation of a browser with quick interface and intuitive tools would meet most researchers' needs. Here, we document the survey's outcomes, review functionalities of available genome browser software platforms and offer our rationale for choosing the GBrowse software suite for MaizeGDB. Because the genome as represented within the MaizeGDB Genome Browser is tied to detailed phenotypic data, molecular marker information, available stocks, etc., the MaizeGDB Genome Browser represents a novel mechanism by which the researchers can leverage maize sequence information toward crop improvement directly. Database URL: http://gbrowse.maizegdb.org/


Assuntos
Bases de Dados Genéticas , Genoma de Planta , Zea mays/genética , Marcadores Genéticos , Internet , Modelos Genéticos , Fenótipo , Software , Interface Usuário-Computador
13.
Bioinformatics ; 26(3): 434-6, 2010 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-20124413

RESUMO

SUMMARY: Methods to automatically integrate sequence information with physical and genetic maps are scarce. The Locus Lookup tool enables researchers to define windows of genomic sequence likely to contain loci of interest where only genetic or physical mapping associations are reported. Using the Locus Lookup tool, researchers will be able to locate specific genes more efficiently that will ultimately help them develop a better maize plant. With the availability of the well-documented source code, the tool can be easily adapted to other biological systems. AVAILABILITY: The Locus Lookup tool is available on the web at http://maizegdb.org/cgi-bin/locus_lookup.cgi. It is implemented in PHP, Oracle and Apache, with all major browsers supported. Source code is freely available for download at http://ftp.maizegdb.org/open_source/locus_lookup/.


Assuntos
Biologia Computacional/métodos , Genoma de Planta , Software , Zea mays/genética , Bases de Dados Genéticas , Internet , Análise de Sequência de DNA , Interface Usuário-Computador
14.
Artigo em Inglês | MEDLINE | ID: mdl-20150665

RESUMO

Many in silico investigations in bioinformatics require access to multiple, distributed data sources and analytic tools. The requisite data sources may include large public data repositories, community databases, and project databases for use in domain-specific research. Different data sources frequently utilize distinct query languages and return results in unique formats, and therefore researchers must either rely upon a small number of primary data sources or become familiar with multiple query languages and formats. Similarly, the associated analytic tools often require specific input formats and produce unique outputs which make it difficult to utilize the output from one tool as input to another. The BioExtract Server (http://bioextract.org) is a Web-based data integration application designed to consolidate, analyze, and serve data from heterogeneous biomolecular databases in the form of a mash-up. The basic operations of the BioExtract Server allow researchers, via their Web browsers, to specify data sources, flexibly query data sources, apply analytic tools, download result sets, and store query results for later reuse. As a researcher works with the system, their "steps" are saved in the background. At any time, these steps can be preserved long-term as a workflow simply by providing a workflow name and description.


Assuntos
Biopolímeros/química , Mineração de Dados/métodos , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Factuais , Disseminação de Informação/métodos , Internet , Software , Biopolímeros/classificação , Biopolímeros/fisiologia , Biologia Computacional/métodos , Fluxo de Trabalho
15.
Plant J ; 58(5): 883-92, 2009 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-19207214

RESUMO

Insertional mutagenesis is a cornerstone of functional genomics. High-copy transposable element systems such as Mutator (Mu) in maize (Zea mays) afford the advantage of high forward mutation rates but pose a challenge for identifying the particular element responsible for a given mutation. Several large mutant collections have been generated in Mu-active genetic stocks, but current methods limit the ability to rapidly identify the causal Mu insertions. Here we present a method to rapidly assay Mu insertions that are genetically linked to a mutation of interest. The method combines elements of MuTAIL (thermal asymmetrically interlaced) and amplification of insertion mutagenized sites (AIMS) protocols and is applicable to the analysis of single mutants or to high-throughput analyses of mutant collections. Briefly, genomic DNA is digested with a restriction enzyme and adapters are ligated. Polymerase chain reaction is performed with TAIL cycling parameters, using a fluorescently labeled Mu primer, which results in the preferential amplification and labeling of Mu-containing genomic fragments. Products from a segregating line are analyzed on a capillary sequencer. To recover a fragment of interest, PCR products are cloned and sequenced. Sequences with lengths matching the size of a band that co-segregates with the mutant phenotype represent candidate linked insertion sites, which are then confirmed by PCR. We demonstrate the utility of the method by identifying Mu insertion sites linked to seed-lethal mutations with a preliminary success rate of nearly 50%.


Assuntos
Análise Mutacional de DNA/métodos , Elementos de DNA Transponíveis , Genoma de Planta , Zea mays/genética , DNA de Plantas/genética , Ligação Genética , Mutagênese Insercional , Fenótipo , Reação em Cadeia da Polimerase , Software
16.
Database (Oxford) ; 2009: bap020, 2009.
Artigo em Inglês | MEDLINE | ID: mdl-21847242

RESUMO

MaizeGDB is the maize research community's central repository for genetic and genomic information about the crop plant and research model Zea mays ssp. mays. The MaizeGDB team endeavors to meet research needs as they evolve based on researcher feedback and guidance. Recent work has focused on better integrating existing data with sequence information as it becomes available for the B73, Mo17 and Palomero Toluqueño genomes. Major endeavors along these lines include the implementation of a genome browser to graphically represent genome sequences; implementation of POPcorn, a portal ancillary to MaizeGDB that offers access to independent maize projects and will allow BLAST similarity searches of participating projects' data sets from a single point; and a joint MaizeGDB/PlantGDB project to involve the maize community in genome annotation. In addition to summarizing recent achievements and future plans, this article also discusses specific examples of community involvement in setting priorities and design aspects of MaizeGDB, which should be of interest to other database and resource providers seeking to better engage their users. MaizeGDB is accessible online at http://www.maizegdb.org.Database URL:http://www.maizegdb.org.


Assuntos
Sistemas de Gerenciamento de Base de Dados , Bases de Dados Genéticas , Genoma de Planta , Zea mays/genética , Internet , Mutação , Fenótipo , Alinhamento de Sequência , Análise de Sequência de DNA , Interface Usuário-Computador
17.
Int J Plant Genomics ; 2008: 496957, 2008.
Artigo em Inglês | MEDLINE | ID: mdl-18769488

RESUMO

In 2001 maize became the number one production crop in the world with the Food and Agriculture Organization of the United Nations reporting over 614 million tonnes produced. Its success is due to the high productivity per acre in tandem with a wide variety of commercial uses. Not only is maize an excellent source of food, feed, and fuel, but also its by-products are used in the production of various commercial products. Maize's unparalleled success in agriculture stems from basic research, the outcomes of which drive breeding and product development. In order for basic, translational, and applied researchers to benefit from others' investigations, newly generated data must be made freely and easily accessible. MaizeGDB is the maize research community's central repository for genetics and genomics information. The overall goals of MaizeGDB are to facilitate access to the outcomes of maize research by integrating new maize data into the database and to support the maize research community by coordinating group activities.

18.
Nucleic Acids Res ; 36(Database issue): D959-65, 2008 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-18063570

RESUMO

PlantGDB (http://www.plantgdb.org/) is a genomics database encompassing sequence data for green plants (Viridiplantae). PlantGDB provides annotated transcript assemblies for >100 plant species, with transcripts mapped to their cognate genomic context where available, integrated with a variety of sequence analysis tools and web services. For 14 plant species with emerging or complete genome sequence, PlantGDB's genome browsers (xGDB) serve as a graphical interface for viewing, evaluating and annotating transcript and protein alignments to chromosome or bacterial artificial chromosome (BAC)-based genome assemblies. Annotation is facilitated by the integrated yrGATE module for community curation of gene models. Novel web services at PlantGDB include Tracembler, an iterative alignment tool that generates contigs from GenBank trace file data and BioExtract Server, a web-based server for executing custom sequence analysis workflows. PlantGDB also hosts a plant genomics research outreach portal (PGROP) that facilitates access to a large number of resources for research and training.


Assuntos
Bases de Dados Genéticas , Genoma de Planta , Genes de Plantas , Genômica , Internet , Proteínas de Plantas/química , Proteínas de Plantas/genética , RNA Mensageiro/química , Alinhamento de Sequência , Software , Interface Usuário-Computador
19.
Int J Comput Biol Drug Des ; 1(3): 302-12, 2008.
Artigo em Inglês | MEDLINE | ID: mdl-20054995

RESUMO

Computational workflows in bioinformatics are becoming increasingly important in the achievement of scientific advances. These workflows typically require the integrated use of multiple, distributed data sources and analytic tools. The BioExtract Server (http://bioextract.org) is a distributed service designed to provide researchers with the web ability to query multiple data sources, save results as searchable data sets, and execute analytic tools. As the researcher works with the system, their tasks are saved in the background. At any time these steps can be saved as a workflow that can then be executed again and/or modified later.


Assuntos
Biologia Computacional/métodos , Redes de Comunicação de Computadores , Biologia Computacional/estatística & dados numéricos , Sistemas Computacionais , Bases de Dados Genéticas , Internet , Design de Software
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...