Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
1.
Proc Natl Acad Sci U S A ; 111(26): 9645-50, 2014 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-24927599

RESUMO

The increasing number of sequenced plant genomes is placing new demands on the methods applied to analyze, annotate, and model these genomes. Today's annotation pipelines result in inconsistent gene assignments that complicate comparative analyses and prevent efficient construction of metabolic models. To overcome these problems, we have developed the PlantSEED, an integrated, metabolism-centric database to support subsystems-based annotation and metabolic model reconstruction for plant genomes. PlantSEED combines SEED subsystems technology, first developed for microbial genomes, with refined protein families and biochemical data to assign fully consistent functional annotations to orthologous genes, particularly those encoding primary metabolic pathways. Seamless integration with its parent, the prokaryotic SEED database, makes PlantSEED a unique environment for cross-kingdom comparative analysis of plant and bacterial genomes. The consistent annotations imposed by PlantSEED permit rapid reconstruction and modeling of primary metabolism for all plant genomes in the database. This feature opens the unique possibility of model-based assessment of the completeness and accuracy of gene annotation and thus allows computational identification of genes and pathways that are restricted to certain genomes or need better curation. We demonstrate the PlantSEED system by producing consistent annotations for 10 reference genomes. We also produce a functioning metabolic model for each genome, gapfilling to identify missing annotations and proposing gene candidates for missing annotations. Models are built around an extended biomass composition representing the most comprehensive published to date. To our knowledge, our models are the first to be published for seven of the genomes analyzed.


Assuntos
Biologia Computacional/métodos , Bases de Dados Genéticas , Genoma de Planta/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Anotação de Sequência Molecular/métodos , Plantas/genética , Software , Redes e Vias Metabólicas/genética , Modelos Biológicos , Plantas/metabolismo , Biologia de Sistemas/métodos
2.
Nucleic Acids Res ; 42(Database issue): D1193-9, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24217918

RESUMO

Gramene (http://www.gramene.org) is a curated online resource for comparative functional genomics in crops and model plant species, currently hosting 27 fully and 10 partially sequenced reference genomes in its build number 38. Its strength derives from the application of a phylogenetic framework for genome comparison and the use of ontologies to integrate structural and functional annotation data. Whole-genome alignments complemented by phylogenetic gene family trees help infer syntenic and orthologous relationships. Genetic variation data, sequences and genome mappings available for 10 species, including Arabidopsis, rice and maize, help infer putative variant effects on genes and transcripts. The pathways section also hosts 10 species-specific metabolic pathways databases developed in-house or by our collaborators using Pathway Tools software, which facilitates searches for pathway, reaction and metabolite annotations, and allows analyses of user-defined expression datasets. Recently, we released a Plant Reactome portal featuring 133 curated rice pathways. This portal will be expanded for Arabidopsis, maize and other plant species. We continue to provide genetic and QTL maps and marker datasets developed by crop researchers. The project provides a unique community platform to support scientific research in plant genomics including studies in evolution, genetics, plant breeding, molecular biology, biochemistry and systems biology.


Assuntos
Bases de Dados Genéticas , Genoma de Planta , Genômica , Produtos Agrícolas/genética , Variação Genética , Internet , Redes e Vias Metabólicas/genética , Anotação de Sequência Molecular , Plantas/genética , Plantas/metabolismo
3.
Proc Natl Acad Sci U S A ; 110(19): 7940-5, 2013 May 07.
Artigo em Inglês | MEDLINE | ID: mdl-23610408

RESUMO

The current limitations in genome sequencing technology require the construction of physical maps for high-quality draft sequences of large plant genomes, such as that of Aegilops tauschii, the wheat D-genome progenitor. To construct a physical map of the Ae. tauschii genome, we fingerprinted 461,706 bacterial artificial chromosome clones, assembled contigs, designed a 10K Ae. tauschii Infinium SNP array, constructed a 7,185-marker genetic map, and anchored on the map contigs totaling 4.03 Gb. Using whole genome shotgun reads, we extended the SNP marker sequences and found 17,093 genes and gene fragments. We showed that collinearity of the Ae. tauschii genes with Brachypodium distachyon, rice, and sorghum decreased with phylogenetic distance and that structural genome evolution rates have been high across all investigated lineages in subfamily Pooideae, including that of Brachypodieae. We obtained additional information about the evolution of the seven Triticeae chromosomes from 12 ancestral chromosomes and uncovered a pattern of centromere inactivation accompanying nested chromosome insertions in grasses. We showed that the density of noncollinear genes along the Ae. tauschii chromosomes positively correlates with recombination rates, suggested a cause, and showed that new genes, exemplified by disease resistance genes, are preferentially located in high-recombination chromosome regions.


Assuntos
Mapeamento de Sequências Contíguas , Genoma de Planta , Poaceae/genética , Centrômero/ultraestrutura , Cromossomos Artificiais Bacterianos , Cromossomos de Plantas/ultraestrutura , Evolução Molecular , Genes de Plantas , Marcadores Genéticos , Polimorfismo de Nucleotídeo Único , Recombinação Genética , Análise de Sequência de DNA , Triticum/genética
4.
Nature ; 449(7164): 851-61, 2007 Oct 18.
Artigo em Inglês | MEDLINE | ID: mdl-17943122

RESUMO

We describe the Phase II HapMap, which characterizes over 3.1 million human single nucleotide polymorphisms (SNPs) genotyped in 270 individuals from four geographically diverse populations and includes 25-35% of common SNP variation in the populations surveyed. The map is estimated to capture untyped common variation with an average maximum r2 of between 0.9 and 0.96 depending on population. We demonstrate that the current generation of commercial genome-wide genotyping products captures common Phase II SNPs with an average maximum r2 of up to 0.8 in African and up to 0.95 in non-African populations, and that potential gains in power in association studies can be obtained through imputation. These data also reveal novel aspects of the structure of linkage disequilibrium. We show that 10-30% of pairs of individuals within a population share at least one region of extended genetic identity arising from recent ancestry and that up to 1% of all common variants are untaggable, primarily because they lie within recombination hotspots. We show that recombination rates vary systematically around genes and between genes of different function. Finally, we demonstrate increased differentiation at non-synonymous, compared to synonymous, SNPs, resulting from systematic differences in the strength or efficacy of natural selection between populations.


Assuntos
Haplótipos/genética , Polimorfismo de Nucleotídeo Único/genética , Feminino , Homozigoto , Humanos , Desequilíbrio de Ligação/genética , Masculino , Grupos Raciais/genética , Recombinação Genética/genética , Seleção Genética
5.
Nature ; 449(7164): 913-8, 2007 Oct 18.
Artigo em Inglês | MEDLINE | ID: mdl-17943131

RESUMO

With the advent of dense maps of human genetic variation, it is now possible to detect positive natural selection across the human genome. Here we report an analysis of over 3 million polymorphisms from the International HapMap Project Phase 2 (HapMap2). We used 'long-range haplotype' methods, which were developed to identify alleles segregating in a population that have undergone recent selection, and we also developed new methods that are based on cross-population comparisons to discover alleles that have swept to near-fixation within a population. The analysis reveals more than 300 strong candidate regions. Focusing on the strongest 22 regions, we develop a heuristic for scrutinizing these regions to identify candidate targets of selection. In a complementary analysis, we identify 26 non-synonymous, coding, single nucleotide polymorphisms showing regional evidence of positive selection. Examination of these candidates highlights three cases in which two genes in a common biological process have apparently undergone positive selection in the same population:LARGE and DMD, both related to infection by the Lassa virus, in West Africa;SLC24A5 and SLC45A2, both involved in skin pigmentation, in Europe; and EDAR and EDA2R, both involved in development of hair follicles, in Asia.


Assuntos
Genoma Humano/genética , Seleção Genética , Antiporters/genética , Receptor Edar/química , Receptor Edar/genética , Frequência do Gene , Genética Populacional , Geografia , Haplótipos/genética , Humanos , Modelos Moleculares , Polimorfismo de Nucleotídeo Único/genética , Estrutura Terciária de Proteína
6.
Dev Biol ; 354(1): 1-8, 2011 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-21458440

RESUMO

We present a genetic map for Xenopus tropicalis, consisting of 2886 Simple Sequence Length Polymorphism (SSLP) markers. Using a bioinformatics-based strategy, we identified unique SSLPs within the X. tropicalis genome. Scaffolds from X. tropicalis genome assembly 2.0 (JGI) were scanned for Simple Sequence Repeats (SSRs); unique SSRs were then tested for amplification and polymorphisms using DNA from inbred Nigerian and Ivory Coast individuals. Thus identified, the SSLPs were genotyped against a mapping cross panel of DNA samples from 190 F2 individuals. Nearly 4000 SSLPs were genotyped, yielding a 2886-marker genetic map consisting of 10 major linkage groups between 73 and 132cM in length, and 4 smaller linkage groups between 7 and 40cM. The total effective size of the map is 1658cM, and the average intermarker distance for each linkage group ranged from 0.27 to 0.75cM. Fluorescence In Situ Hybridization (FISH) was carried out using probes for genes located on mapped scaffolds to assign linkage groups to chromosomes. Comparisons of this map with the X. tropicalis genome Assembly 4.1 (JGI) indicate that the map provides representation of a minimum of 66% of the X. tropicalis genome, incorporating 758 of the approximately 1300 scaffolds over 100,000bp. The genetic map and SSLP marker database constitute an essential resource for genetic and genomic analyses in X. tropicalis.


Assuntos
Mapeamento Cromossômico/métodos , Marcadores Genéticos/genética , Cariotipagem Espectral/métodos , Xenopus/genética , Animais , Bandeamento Cromossômico , Genoma/genética , Genótipo , Internet , Repetições Minissatélites/genética , Polimorfismo Genético , Proteínas de Xenopus/genética
7.
PLoS Genet ; 5(11): e1000711, 2009 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-19936062

RESUMO

About 85% of the maize genome consists of highly repetitive sequences that are interspersed by low-copy, gene-coding sequences. The maize community has dealt with this genomic complexity by the construction of an integrated genetic and physical map (iMap), but this resource alone was not sufficient for ensuring the quality of the current sequence build. For this purpose, we constructed a genome-wide, high-resolution optical map of the maize inbred line B73 genome containing >91,000 restriction sites (averaging 1 site/ approximately 23 kb) accrued from mapping genomic DNA molecules. Our optical map comprises 66 contigs, averaging 31.88 Mb in size and spanning 91.5% (2,103.93 Mb/ approximately 2,300 Mb) of the maize genome. A new algorithm was created that considered both optical map and unfinished BAC sequence data for placing 60/66 (2,032.42 Mb) optical map contigs onto the maize iMap. The alignment of optical maps against numerous data sources yielded comprehensive results that proved revealing and productive. For example, gaps were uncovered and characterized within the iMap, the FPC (fingerprinted contigs) map, and the chromosome-wide pseudomolecules. Such alignments also suggested amended placements of FPC contigs on the maize genetic map and proactively guided the assembly of chromosome-wide pseudomolecules, especially within complex genomic regions. Lastly, we think that the full integration of B73 optical maps with the maize iMap would greatly facilitate maize sequence finishing efforts that would make it a valuable reference for comparative studies among cereals, or other maize inbred lines and cultivars.


Assuntos
Genoma de Planta/genética , Zea mays/genética , Algoritmos , Sequência de Bases , Cromossomos Artificiais Bacterianos/genética , Mapeamento de Sequências Contíguas , Dados de Sequência Molecular , Fenômenos Ópticos , Mapeamento Físico do Cromossomo , Alinhamento de Sequência
8.
PLoS Genet ; 5(11): e1000728, 2009 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-19936048

RESUMO

Most of our understanding of plant genome structure and evolution has come from the careful annotation of small (e.g., 100 kb) sequenced genomic regions or from automated annotation of complete genome sequences. Here, we sequenced and carefully annotated a contiguous 22 Mb region of maize chromosome 4 using an improved pseudomolecule for annotation. The sequence segment was comprehensively ordered, oriented, and confirmed using the maize optical map. Nearly 84% of the sequence is composed of transposable elements (TEs) that are mostly nested within each other, of which most families are low-copy. We identified 544 gene models using multiple levels of evidence, as well as five miRNA genes. Gene fragments, many captured by TEs, are prevalent within this region. Elimination of gene redundancy from a tetraploid maize ancestor that originated a few million years ago is responsible in this region for most disruptions of synteny with sorghum and rice. Consistent with other sub-genomic analyses in maize, small RNA mapping showed that many small RNAs match TEs and that most TEs match small RNAs. These results, performed on approximately 1% of the maize genome, demonstrate the feasibility of refining the B73 RefGen_v1 genome assembly by incorporating optical map, high-resolution genetic map, and comparative genomic data sets. Such improvements, along with those of gene and repeat annotation, will serve to promote future functional genomic and phylogenomic research in maize and other grasses.


Assuntos
Pareamento de Bases/genética , Genoma de Planta/genética , Zea mays/genética , Sequência de Bases , Cromossomos de Plantas/genética , Elementos de DNA Transponíveis/genética , Evolução Molecular , Duplicação Gênica , Rearranjo Gênico/genética , Genes de Plantas , Loci Gênicos/genética , Dados de Sequência Molecular , Mutação/genética , Fases de Leitura Aberta/genética , Oryza/genética , Mapeamento Físico do Cromossomo , RNA de Plantas/genética , Homologia de Sequência do Ácido Nucleico , Sorghum/genética , Sintenia/genética
9.
PLoS Genet ; 1(3): e41, 2005 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-16205789

RESUMO

A region of approximately one megabase of human Chromosome 12 shows extensive linkage disequilibrium in Utah residents with ancestry from northern and western Europe. This strikingly large linkage disequilibrium block was analyzed with statistical and experimental methods to determine whether natural selection could be implicated in shaping the current genome structure. Extended Haplotype Homozygosity and Relative Extended Haplotype Homozygosity analyses on this region mapped a core region of the strongest conserved haplotype to the exon 1 of the Spinocerebellar ataxia type 2 gene (SCA2). Direct DNA sequencing of this region of the SCA2 gene revealed a significant association between a pre-expanded allele [(CAG)8CAA(CAG)4CAA(CAG)8] of CAG repeats within exon 1 and the selected haplotype of the SCA2 gene. A significantly negative Tajima's D value (-2.20, p < 0.01) on this site consistently suggested selection on the CAG repeat. This region was also investigated in the three other populations, none of which showed signs of selection. These results suggest that a recent positive selection of the pre-expansion SCA2 CAG repeat has occurred in Utah residents with European ancestry.


Assuntos
Cromossomos Humanos Par 12 , Proteínas do Tecido Nervoso/genética , Degenerações Espinocerebelares/genética , Repetições de Trinucleotídeos , Ataxinas , Sequência de Bases , Mapeamento Cromossômico , Europa (Continente)/etnologia , Éxons , Frequência do Gene , Humanos , Dados de Sequência Molecular , Polimorfismo de Nucleotídeo Único , Seleção Genética , Utah , População Branca/genética
10.
Front Plant Sci ; 2: 34, 2011.
Artigo em Inglês | MEDLINE | ID: mdl-22645531

RESUMO

The iPlant Collaborative (iPlant) is a United States National Science Foundation (NSF) funded project that aims to create an innovative, comprehensive, and foundational cyberinfrastructure in support of plant biology research (PSCIC, 2006). iPlant is developing cyberinfrastructure that uniquely enables scientists throughout the diverse fields that comprise plant biology to address Grand Challenges in new ways, to stimulate and facilitate cross-disciplinary research, to promote biology and computer science research interactions, and to train the next generation of scientists on the use of cyberinfrastructure in research and education. Meeting humanity's projected demands for agricultural and forest products and the expectation that natural ecosystems be managed sustainably will require synergies from the application of information technologies. The iPlant cyberinfrastructure design is based on an unprecedented period of research community input, and leverages developments in high-performance computing, data storage, and cyberinfrastructure for the physical sciences. iPlant is an open-source project with application programming interfaces that allow the community to extend the infrastructure to meet its needs. iPlant is sponsoring community-driven workshops addressing specific scientific questions via analysis tool integration and hypothesis testing. These workshops teach researchers how to add bioinformatics tools and/or datasets into the iPlant cyberinfrastructure enabling plant scientists to perform complex analyses on large datasets without the need to master the command-line or high-performance computational services.

11.
Science ; 326(5956): 1112-5, 2009 Nov 20.
Artigo em Inglês | MEDLINE | ID: mdl-19965430

RESUMO

We report an improved draft nucleotide sequence of the 2.3-gigabase genome of maize, an important crop plant and model for biological research. Over 32,000 genes were predicted, of which 99.8% were placed on reference chromosomes. Nearly 85% of the genome is composed of hundreds of families of transposable elements, dispersed nonuniformly across the genome. These were responsible for the capture and amplification of numerous gene fragments and affect the composition, sizes, and positions of centromeres. We also report on the correlation of methylation-poor regions with Mu transposon insertions and recombination, and copy number variants with insertions and/or deletions, as well as how uneven gene losses between duplicated regions were involved in returning an ancient allotetraploid to a genetically diploid state. These analyses inform and set the stage for further investigations to improve our understanding of the domestication and agricultural improvements of maize.


Assuntos
Variação Genética , Genoma de Planta , Análise de Sequência de DNA , Zea mays/genética , Sequência de Bases , Centrômero/genética , Mapeamento Cromossômico , Cromossomos de Plantas/genética , Produtos Agrícolas/genética , Variações do Número de Cópias de DNA , Metilação de DNA , Elementos de DNA Transponíveis , DNA de Plantas/genética , Genes de Plantas , Endogamia , MicroRNAs/genética , Dados de Sequência Molecular , Ploidias , RNA de Plantas/genética , Recombinação Genética , Retroelementos
12.
Genome Res ; 15(2): 269-75, 2005 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-15687290

RESUMO

Large-scale genetic studies are highly dependent on efficient and scalable multiplex SNP assays. In this study, we report the development of Molecular Inversion Probe technology with four-color, single array detection, applied to large-scale genotyping of up to 12,000 SNPs per reaction. While generating 38,429 SNP assays using this technology in a population of 30 trios from the Centre d'Etude Polymorphisme Humain family panel as part of the International HapMap project, we established SNP conversion rates of approximately 90% with concordance rates >99.6% and completeness levels >98% for assays multiplexed up to 12,000plex levels. Furthermore, these individual metrics can be "traded off" and, by sacrificing a small fraction of the conversion rate, the accuracy can be increased to very high levels. No loss of performance is seen when scaling from 6,000plex to 12,000plex assays, strongly validating the ability of the technology to suppress cross-reactivity at high multiplex levels. The results of this study demonstrate the suitability of this technology for comprehensive association studies that use targeted SNPs in indirect linkage disequilibrium studies or that directly screen for causative mutations.


Assuntos
Inversão Cromossômica/genética , Técnicas de Sonda Molecular/tendências , Polimorfismo de Nucleotídeo Único/genética , Inversão Cromossômica/estatística & dados numéricos , Análise por Conglomerados , Sondas de DNA/genética , Perfilação da Expressão Gênica , Genoma Humano , Genótipo , Humanos , Técnicas de Sonda Molecular/normas , Técnicas de Sonda Molecular/estatística & dados numéricos , Sondas Moleculares/genética , Projetos de Pesquisa
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA