Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
1.
Bioinformatics ; 40(9)2024 09 02.
Artigo em Inglês | MEDLINE | ID: mdl-39180716

RESUMO

SUMMARY: Identification of allelic or corresponding genes (pan-genes) within a species or genus is important for discovery of biologically significant genetic conservation and variation. Similarly, identification of orthologs (gene families) across wider evolutionary distances is important for understanding the genetic basis for similar or differing traits. Especially in plants, several complications make identification of pan-genes and gene families challenging, including whole-genome duplications, evolutionary rate differences among lineages, and varying qualities of assemblies and annotations. Here, we document and distribute a set of workflows that we have used to address these problems. RESULTS: Pandagma is a set of configurable workflows for identifying and comparing pan-gene sets and gene families for annotation sets from eukaryotic genomes, using a combination of homology, synteny, and expected rates of synonymous change in coding sequence. AVAILABILITY AND IMPLEMENTATION: The Pandagma workflows, example configurations, implementation details, and scripts for retrieving public datasets, are available at https://github.com/legumeinfo/pandagma.


Assuntos
Evolução Molecular , Duplicação Gênica , Software , Família Multigênica , Genômica/métodos , Anotação de Sequência Molecular/métodos
2.
Nucleic Acids Res ; 49(D1): D1496-D1501, 2021 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-33264401

RESUMO

SoyBase, a USDA genetic and genomics database, holds professionally curated soybean genetic and genomic data, which is integrated and made accessible to researchers and breeders. The site holds several reference genome assemblies, as well as genetic maps, thousands of mapped traits, expression and epigenetic data, pedigree information, and extensive variant and genotyping data sets. SoyBase displays include genetic, genomic, and epigenetic maps of the soybean genome. Gene expression data is presented in the genome viewer as heat maps and pictorial and tabular displays in gene report pages. Millions of sequence variants have been added, representing variations across various collections of cultivars. This variant data is explorable using new interactive tools to visualize the distribution of those variants across the genome, between selected accessions. SoyBase holds several reference-quality soybean genome assemblies, accessible via various query tools and browsers, including a new visualization system for exploring the soybean pan-genome. SoyBase also serves as a nexus of announcements pertinent to the greater soybean research community. The database also includes a soybean-specific anatomic and biochemical trait ontology. The database can be accessed at https://soybase.org.


Assuntos
Bases de Dados Genéticas , Regulação da Expressão Gênica de Plantas , Genoma de Planta , Genótipo , Glycine max/genética , Proteínas de Plantas/genética , Mapeamento Cromossômico , Produtos Agrícolas , Epigênese Genética , Estudos de Associação Genética , Internet , Anotação de Sequência Molecular , Filogenia , Melhoramento Vegetal/métodos , Proteínas de Plantas/metabolismo , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Característica Quantitativa Herdável , Padrões de Referência , Software , Glycine max/classificação , Glycine max/metabolismo , Estados Unidos , United States Department of Agriculture
3.
PLoS Comput Biol ; 14(12): e1006472, 2018 12.
Artigo em Inglês | MEDLINE | ID: mdl-30589835

RESUMO

As sequencing prices drop, genomic data accumulates-seemingly at a steadily increasing pace. Most genomic data potentially have value beyond the initial purpose-but only if shared with the scientific community. This, of course, is often easier said than done. Some of the challenges in sharing genomic data include data volume (raw file sizes and number of files), complexities, formats, nomenclatures, metadata descriptions, and the choice of a repository. In this paper, we describe 10 quick tips for sharing open genomic data.


Assuntos
Bases de Dados Genéticas/tendências , Disseminação de Informação/métodos , Armazenamento e Recuperação da Informação/métodos , Bases de Dados Factuais/estatística & dados numéricos , Bases de Dados Factuais/tendências , Bases de Dados Genéticas/estatística & dados numéricos , Genômica , Software , Interface Usuário-Computador
4.
Nucleic Acids Res ; 44(D1): D1181-8, 2016 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-26546515

RESUMO

Legume Information System (LIS), at http://legumeinfo.org, is a genomic data portal (GDP) for the legume family. LIS provides access to genetic and genomic information for major crop and model legumes. With more than two-dozen domesticated legume species, there are numerous specialists working on particular species, and also numerous GDPs for these species. LIS has been redesigned in the last three years both to better integrate data sets across the crop and model legumes, and to better accommodate specialized GDPs that serve particular legume species. To integrate data sets, LIS provides genome and map viewers, holds synteny mappings among all sequenced legume species and provides a set of gene families to allow traversal among orthologous and paralogous sequences across the legumes. To better accommodate other specialized GDPs, LIS uses open-source GMOD components where possible, and advocates use of common data templates, formats, schemas and interfaces so that data collected by one legume research community are accessible across all legume GDPs, through similar interfaces and using common APIs. This federated model for the legumes is managed as part of the 'Legume Federation' project (accessible via http://legumefederation.org), which can be thought of as an umbrella project encompassing LIS and other legume GDPs.


Assuntos
Bases de Dados Genéticas , Fabaceae/genética , Fabaceae/classificação , Genoma de Planta , Genômica , Internet , Família Multigênica , Filogenia , Proteínas de Plantas/química , Proteínas de Plantas/genética , Estrutura Terciária de Proteína , Locos de Características Quantitativas , Sintenia
5.
BMC Genomics ; 15: 950, 2014 Nov 03.
Artigo em Inglês | MEDLINE | ID: mdl-25362847

RESUMO

BACKGROUND: The homeodomain leucine zipper (HD-Zip) transcription factor family is one of the largest plant specific superfamilies, and includes genes with roles in modulation of plant growth and response to environmental stresses. Many HD-Zip genes are characterized in Arabidopsis (Arabidopsis thaliana), and members of the family are being investigated for abiotic stress responses in rice (Oryza sativa), maize (Zea mays), poplar (Populus trichocarpa) and cucumber (Cucmis sativus). Findings in these species suggest HD-Zip genes as high priority candidates for crop improvement. RESULTS: In this study we have identified members of the HD-Zip gene family in soybean cv. 'Williams 82', and characterized their expression under dehydration and salt stress. Homology searches with BLASTP and Hidden Markov Model guided sequence alignments identified 101 HD-Zip genes in the soybean genome. Phylogeny reconstruction coupled with domain and gene structure analyses using soybean, Arabidopsis, rice, grape (Vitis vinifera), and Medicago truncatula homologues enabled placement of these sequences into four previously described subfamilies. Of the 101 HD-Zip genes identified in soybean, 88 exist as whole-genome duplication-derived gene pairs, indicating high retention of these genes following polyploidy in Glycine ~13 Mya. The HD-Zip genes exhibit ubiquitous expression patterns across 24 conditions that include 17 tissues of soybean. An RNA-Seq experiment performed to study differential gene expression at 0, 1, 6 and 12 hr soybean roots under dehydration and salt stress identified 20 differentially expressed (DE) genes. Several of these DE genes are orthologs of genes previously reported to play a role under abiotic stress, implying conservation of HD-Zip gene functions across species. Screening of HD-Zip promoters identified transcription factor binding sites that are overrepresented in the DE genes under both dehydration and salt stress, providing further support for the role of HD-Zip genes in abiotic stress responses. CONCLUSIONS: We provide a thorough description of soybean HD-Zip genes, and identify potential candidates with probable roles in dehydration and salt stress. Expression profiles generated for all soybean genes, under dehydration and salt stress, at four time points, will serve as an important resource for the soybean research community, and will aid in understanding plant responses to abiotic stress.


Assuntos
Desidratação/genética , Perfilação da Expressão Gênica , Glycine max/genética , Glycine max/metabolismo , Proteínas de Homeodomínio/genética , Zíper de Leucina/genética , Tolerância ao Sal/genética , Fatores de Transcrição/genética , Sítios de Ligação , Mapeamento Cromossômico , Análise por Conglomerados , Biologia Computacional/métodos , Sequência Conservada , Dosagem de Genes , Regulação da Expressão Gênica de Plantas , Genoma de Planta , Proteínas de Homeodomínio/química , Proteínas de Homeodomínio/classificação , Anotação de Sequência Molecular , Família Multigênica , Motivos de Nucleotídeos , Especificidade de Órgãos/genética , Filogenia , Regiões Promotoras Genéticas , Domínios e Motivos de Interação entre Proteínas , Estresse Fisiológico , Fatores de Transcrição/química , Fatores de Transcrição/classificação
6.
Methods Mol Biol ; 2443: 81-100, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35037201

RESUMO

In this chapter, we introduce the main components of the Legume Information System ( https://legumeinfo.org ) and several associated resources. Additionally, we provide an example of their use by exploring a biological question: is there a common molecular basis, across legume species, that underlies the photoperiod-mediated transition from vegetative to reproductive development, that is, days to flowering? The Legume Information System (LIS) holds genetic and genomic data for a large number of crop and model legumes and provides a set of online bioinformatic tools designed to help biologists address questions and tasks related to legume biology. Such tasks include identifying the molecular basis of agronomic traits; identifying orthologs/syntelogs for known genes; determining gene expression patterns; accessing genomic datasets; identifying markers for breeding work; and identifying genetic similarities and differences among selected accessions. LIS integrates with other legume-focused informatics resources such as SoyBase ( https://soybase.org ), PeanutBase ( https://peanutbase.org ), and projects of the Legume Federation ( https://legumefederation.org ).


Assuntos
Fabaceae , Bases de Dados Genéticas , Fabaceae/genética , Genoma de Planta , Genômica , Melhoramento Vegetal
7.
Nat Genet ; 51(5): 877-884, 2019 05.
Artigo em Inglês | MEDLINE | ID: mdl-31043755

RESUMO

Like many other crops, the cultivated peanut (Arachis hypogaea L.) is of hybrid origin and has a polyploid genome that contains essentially complete sets of chromosomes from two ancestral species. Here we report the genome sequence of peanut and show that after its polyploid origin, the genome has evolved through mobile-element activity, deletions and by the flow of genetic information between corresponding ancestral chromosomes (that is, homeologous recombination). Uniformity of patterns of homeologous recombination at the ends of chromosomes favors a single origin for cultivated peanut and its wild counterpart A. monticola. However, through much of the genome, homeologous recombination has created diversity. Using new polyploid hybrids made from the ancestral species, we show how this can generate phenotypic changes such as spontaneous changes in the color of the flowers. We suggest that diversity generated by these genetic mechanisms helped to favor the domestication of the polyploid A. hypogaea over other diploid Arachis species cultivated by humans.


Assuntos
Arachis/genética , Arachis/classificação , Argentina , Cromossomos de Plantas/genética , Produtos Agrícolas/genética , Metilação de DNA , DNA de Plantas/genética , Domesticação , Evolução Molecular , Regulação da Expressão Gênica de Plantas , Variação Genética , Genoma de Planta , Hibridização Genética , Fenótipo , Poliploidia , Recombinação Genética , Especificidade da Espécie , Tetraploidia
8.
Sci Rep ; 6: 34908, 2016 10 10.
Artigo em Inglês | MEDLINE | ID: mdl-27721469

RESUMO

For species with potential as new crops, rapid improvement may be facilitated by new genomic methods. Apios (Apios americana Medik.), once a staple food source of Native American Indians, produces protein-rich tubers, tolerates a wide range of soils, and symbiotically fixes nitrogen. We report the first high-quality de novo transcriptome assembly, an expression atlas, and a set of 58,154 SNP and 39,609 gene expression markers (GEMs) for characterization of a breeding collection. Both SNPs and GEMs identify six genotypic clusters in the collection. Transcripts mapped to the Phaseolus vulgaris genome-another phaseoloid legume with the same chromosome number-provide provisional genetic locations for 46,852 SNPs. Linkage disequilibrium decays within 10 kb (based on the provisional genetic locations), consistent with outcrossing reproduction. SNPs and GEMs identify more than 21 marker-trait associations for at least 11 traits. This study demonstrates a holistic approach for mining plant collections to accelerate crop improvement.


Assuntos
Cruzamento , Fabaceae/crescimento & desenvolvimento , Fabaceae/genética , Genômica , Genótipo , Plantas Comestíveis/crescimento & desenvolvimento , Plantas Comestíveis/genética , Análise por Conglomerados , Perfilação da Expressão Gênica , Desequilíbrio de Ligação , Phaseolus/genética , Polimorfismo de Nucleotídeo Único
9.
Mol Plant ; 5(5): 1020-8, 2012 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-22241453

RESUMO

A comprehensive transcriptome assembly for pigeonpea has been developed by analyzing 128.9 million short Illumina GA IIx single end reads, 2.19 million single end FLX/454 reads, and 18 353 Sanger expressed sequenced tags from more than 16 genotypes. The resultant transcriptome assembly, referred to as CcTA v2, comprised 21 434 transcript assembly contigs (TACs) with an N50 of 1510 bp, the largest one being ~8 kb. Of the 21 434 TACs, 16 622 (77.5%) could be mapped on to the soybean genome build 1.0.9 under fairly stringent alignment parameters. Based on knowledge of intron junctions, 10 009 primer pairs were designed from 5033 TACs for amplifying intron spanning regions (ISRs). By using in silico mapping of BAC-end-derived SSR loci of pigeonpea on the soybean genome as a reference, putative mapping positions at the chromosome level were predicted for 6284 ISR markers, covering all 11 pigeonpea chromosomes. A subset of 128 ISR markers were analyzed on a set of eight genotypes. While 116 markers were validated, 70 markers showed one to three alleles, with an average of 0.16 polymorphism information content (PIC) value. In summary, the CcTA v2 transcript assembly and ISR markers will serve as a useful resource to accelerate genetic research and breeding applications in pigeonpea.


Assuntos
Cajanus/genética , Análise de Sequência de DNA/métodos , Transcriptoma , Cajanus/classificação , Mapeamento Cromossômico , Fabaceae/classificação , Fabaceae/genética , Genótipo , Filogenia , Polimorfismo de Nucleotídeo Único
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA