Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 12 de 12
Filtrar
1.
Proc Natl Acad Sci U S A ; 111(26): 9645-50, 2014 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-24927599

RESUMEN

The increasing number of sequenced plant genomes is placing new demands on the methods applied to analyze, annotate, and model these genomes. Today's annotation pipelines result in inconsistent gene assignments that complicate comparative analyses and prevent efficient construction of metabolic models. To overcome these problems, we have developed the PlantSEED, an integrated, metabolism-centric database to support subsystems-based annotation and metabolic model reconstruction for plant genomes. PlantSEED combines SEED subsystems technology, first developed for microbial genomes, with refined protein families and biochemical data to assign fully consistent functional annotations to orthologous genes, particularly those encoding primary metabolic pathways. Seamless integration with its parent, the prokaryotic SEED database, makes PlantSEED a unique environment for cross-kingdom comparative analysis of plant and bacterial genomes. The consistent annotations imposed by PlantSEED permit rapid reconstruction and modeling of primary metabolism for all plant genomes in the database. This feature opens the unique possibility of model-based assessment of the completeness and accuracy of gene annotation and thus allows computational identification of genes and pathways that are restricted to certain genomes or need better curation. We demonstrate the PlantSEED system by producing consistent annotations for 10 reference genomes. We also produce a functioning metabolic model for each genome, gapfilling to identify missing annotations and proposing gene candidates for missing annotations. Models are built around an extended biomass composition representing the most comprehensive published to date. To our knowledge, our models are the first to be published for seven of the genomes analyzed.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Genoma de Planta/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Anotación de Secuencia Molecular/métodos , Plantas/genética , Programas Informáticos , Redes y Vías Metabólicas/genética , Modelos Biológicos , Plantas/metabolismo , Biología de Sistemas/métodos
2.
Nucleic Acids Res ; 42(Database issue): D1193-9, 2014 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-24217918

RESUMEN

Gramene (http://www.gramene.org) is a curated online resource for comparative functional genomics in crops and model plant species, currently hosting 27 fully and 10 partially sequenced reference genomes in its build number 38. Its strength derives from the application of a phylogenetic framework for genome comparison and the use of ontologies to integrate structural and functional annotation data. Whole-genome alignments complemented by phylogenetic gene family trees help infer syntenic and orthologous relationships. Genetic variation data, sequences and genome mappings available for 10 species, including Arabidopsis, rice and maize, help infer putative variant effects on genes and transcripts. The pathways section also hosts 10 species-specific metabolic pathways databases developed in-house or by our collaborators using Pathway Tools software, which facilitates searches for pathway, reaction and metabolite annotations, and allows analyses of user-defined expression datasets. Recently, we released a Plant Reactome portal featuring 133 curated rice pathways. This portal will be expanded for Arabidopsis, maize and other plant species. We continue to provide genetic and QTL maps and marker datasets developed by crop researchers. The project provides a unique community platform to support scientific research in plant genomics including studies in evolution, genetics, plant breeding, molecular biology, biochemistry and systems biology.


Asunto(s)
Bases de Datos Genéticas , Genoma de Planta , Genómica , Productos Agrícolas/genética , Variación Genética , Internet , Redes y Vías Metabólicas/genética , Anotación de Secuencia Molecular , Plantas/genética , Plantas/metabolismo
3.
Proc Natl Acad Sci U S A ; 110(19): 7940-5, 2013 May 07.
Artículo en Inglés | MEDLINE | ID: mdl-23610408

RESUMEN

The current limitations in genome sequencing technology require the construction of physical maps for high-quality draft sequences of large plant genomes, such as that of Aegilops tauschii, the wheat D-genome progenitor. To construct a physical map of the Ae. tauschii genome, we fingerprinted 461,706 bacterial artificial chromosome clones, assembled contigs, designed a 10K Ae. tauschii Infinium SNP array, constructed a 7,185-marker genetic map, and anchored on the map contigs totaling 4.03 Gb. Using whole genome shotgun reads, we extended the SNP marker sequences and found 17,093 genes and gene fragments. We showed that collinearity of the Ae. tauschii genes with Brachypodium distachyon, rice, and sorghum decreased with phylogenetic distance and that structural genome evolution rates have been high across all investigated lineages in subfamily Pooideae, including that of Brachypodieae. We obtained additional information about the evolution of the seven Triticeae chromosomes from 12 ancestral chromosomes and uncovered a pattern of centromere inactivation accompanying nested chromosome insertions in grasses. We showed that the density of noncollinear genes along the Ae. tauschii chromosomes positively correlates with recombination rates, suggested a cause, and showed that new genes, exemplified by disease resistance genes, are preferentially located in high-recombination chromosome regions.


Asunto(s)
Mapeo Contig , Genoma de Planta , Poaceae/genética , Centrómero/ultraestructura , Cromosomas Artificiales Bacterianos , Cromosomas de las Plantas/ultraestructura , Evolución Molecular , Genes de Plantas , Marcadores Genéticos , Polimorfismo de Nucleótido Simple , Recombinación Genética , Análisis de Secuencia de ADN , Triticum/genética
4.
Nature ; 449(7164): 851-61, 2007 Oct 18.
Artículo en Inglés | MEDLINE | ID: mdl-17943122

RESUMEN

We describe the Phase II HapMap, which characterizes over 3.1 million human single nucleotide polymorphisms (SNPs) genotyped in 270 individuals from four geographically diverse populations and includes 25-35% of common SNP variation in the populations surveyed. The map is estimated to capture untyped common variation with an average maximum r2 of between 0.9 and 0.96 depending on population. We demonstrate that the current generation of commercial genome-wide genotyping products captures common Phase II SNPs with an average maximum r2 of up to 0.8 in African and up to 0.95 in non-African populations, and that potential gains in power in association studies can be obtained through imputation. These data also reveal novel aspects of the structure of linkage disequilibrium. We show that 10-30% of pairs of individuals within a population share at least one region of extended genetic identity arising from recent ancestry and that up to 1% of all common variants are untaggable, primarily because they lie within recombination hotspots. We show that recombination rates vary systematically around genes and between genes of different function. Finally, we demonstrate increased differentiation at non-synonymous, compared to synonymous, SNPs, resulting from systematic differences in the strength or efficacy of natural selection between populations.


Asunto(s)
Haplotipos/genética , Polimorfismo de Nucleótido Simple/genética , Femenino , Homocigoto , Humanos , Desequilibrio de Ligamiento/genética , Masculino , Grupos Raciales/genética , Recombinación Genética/genética , Selección Genética
5.
Nature ; 449(7164): 913-8, 2007 Oct 18.
Artículo en Inglés | MEDLINE | ID: mdl-17943131

RESUMEN

With the advent of dense maps of human genetic variation, it is now possible to detect positive natural selection across the human genome. Here we report an analysis of over 3 million polymorphisms from the International HapMap Project Phase 2 (HapMap2). We used 'long-range haplotype' methods, which were developed to identify alleles segregating in a population that have undergone recent selection, and we also developed new methods that are based on cross-population comparisons to discover alleles that have swept to near-fixation within a population. The analysis reveals more than 300 strong candidate regions. Focusing on the strongest 22 regions, we develop a heuristic for scrutinizing these regions to identify candidate targets of selection. In a complementary analysis, we identify 26 non-synonymous, coding, single nucleotide polymorphisms showing regional evidence of positive selection. Examination of these candidates highlights three cases in which two genes in a common biological process have apparently undergone positive selection in the same population:LARGE and DMD, both related to infection by the Lassa virus, in West Africa;SLC24A5 and SLC45A2, both involved in skin pigmentation, in Europe; and EDAR and EDA2R, both involved in development of hair follicles, in Asia.


Asunto(s)
Genoma Humano/genética , Selección Genética , Antiportadores/genética , Receptor Edar/química , Receptor Edar/genética , Frecuencia de los Genes , Genética de Población , Geografía , Haplotipos/genética , Humanos , Modelos Moleculares , Polimorfismo de Nucleótido Simple/genética , Estructura Terciaria de Proteína
6.
Dev Biol ; 354(1): 1-8, 2011 Jun 01.
Artículo en Inglés | MEDLINE | ID: mdl-21458440

RESUMEN

We present a genetic map for Xenopus tropicalis, consisting of 2886 Simple Sequence Length Polymorphism (SSLP) markers. Using a bioinformatics-based strategy, we identified unique SSLPs within the X. tropicalis genome. Scaffolds from X. tropicalis genome assembly 2.0 (JGI) were scanned for Simple Sequence Repeats (SSRs); unique SSRs were then tested for amplification and polymorphisms using DNA from inbred Nigerian and Ivory Coast individuals. Thus identified, the SSLPs were genotyped against a mapping cross panel of DNA samples from 190 F2 individuals. Nearly 4000 SSLPs were genotyped, yielding a 2886-marker genetic map consisting of 10 major linkage groups between 73 and 132cM in length, and 4 smaller linkage groups between 7 and 40cM. The total effective size of the map is 1658cM, and the average intermarker distance for each linkage group ranged from 0.27 to 0.75cM. Fluorescence In Situ Hybridization (FISH) was carried out using probes for genes located on mapped scaffolds to assign linkage groups to chromosomes. Comparisons of this map with the X. tropicalis genome Assembly 4.1 (JGI) indicate that the map provides representation of a minimum of 66% of the X. tropicalis genome, incorporating 758 of the approximately 1300 scaffolds over 100,000bp. The genetic map and SSLP marker database constitute an essential resource for genetic and genomic analyses in X. tropicalis.


Asunto(s)
Mapeo Cromosómico/métodos , Marcadores Genéticos/genética , Cariotipificación Espectral/métodos , Xenopus/genética , Animales , Bandeo Cromosómico , Genoma/genética , Genotipo , Internet , Repeticiones de Minisatélite/genética , Polimorfismo Genético , Proteínas de Xenopus/genética
7.
PLoS Genet ; 5(11): e1000711, 2009 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-19936062

RESUMEN

About 85% of the maize genome consists of highly repetitive sequences that are interspersed by low-copy, gene-coding sequences. The maize community has dealt with this genomic complexity by the construction of an integrated genetic and physical map (iMap), but this resource alone was not sufficient for ensuring the quality of the current sequence build. For this purpose, we constructed a genome-wide, high-resolution optical map of the maize inbred line B73 genome containing >91,000 restriction sites (averaging 1 site/ approximately 23 kb) accrued from mapping genomic DNA molecules. Our optical map comprises 66 contigs, averaging 31.88 Mb in size and spanning 91.5% (2,103.93 Mb/ approximately 2,300 Mb) of the maize genome. A new algorithm was created that considered both optical map and unfinished BAC sequence data for placing 60/66 (2,032.42 Mb) optical map contigs onto the maize iMap. The alignment of optical maps against numerous data sources yielded comprehensive results that proved revealing and productive. For example, gaps were uncovered and characterized within the iMap, the FPC (fingerprinted contigs) map, and the chromosome-wide pseudomolecules. Such alignments also suggested amended placements of FPC contigs on the maize genetic map and proactively guided the assembly of chromosome-wide pseudomolecules, especially within complex genomic regions. Lastly, we think that the full integration of B73 optical maps with the maize iMap would greatly facilitate maize sequence finishing efforts that would make it a valuable reference for comparative studies among cereals, or other maize inbred lines and cultivars.


Asunto(s)
Genoma de Planta/genética , Zea mays/genética , Algoritmos , Secuencia de Bases , Cromosomas Artificiales Bacterianos/genética , Mapeo Contig , Datos de Secuencia Molecular , Fenómenos Ópticos , Mapeo Físico de Cromosoma , Alineación de Secuencia
8.
PLoS Genet ; 5(11): e1000728, 2009 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-19936048

RESUMEN

Most of our understanding of plant genome structure and evolution has come from the careful annotation of small (e.g., 100 kb) sequenced genomic regions or from automated annotation of complete genome sequences. Here, we sequenced and carefully annotated a contiguous 22 Mb region of maize chromosome 4 using an improved pseudomolecule for annotation. The sequence segment was comprehensively ordered, oriented, and confirmed using the maize optical map. Nearly 84% of the sequence is composed of transposable elements (TEs) that are mostly nested within each other, of which most families are low-copy. We identified 544 gene models using multiple levels of evidence, as well as five miRNA genes. Gene fragments, many captured by TEs, are prevalent within this region. Elimination of gene redundancy from a tetraploid maize ancestor that originated a few million years ago is responsible in this region for most disruptions of synteny with sorghum and rice. Consistent with other sub-genomic analyses in maize, small RNA mapping showed that many small RNAs match TEs and that most TEs match small RNAs. These results, performed on approximately 1% of the maize genome, demonstrate the feasibility of refining the B73 RefGen_v1 genome assembly by incorporating optical map, high-resolution genetic map, and comparative genomic data sets. Such improvements, along with those of gene and repeat annotation, will serve to promote future functional genomic and phylogenomic research in maize and other grasses.


Asunto(s)
Emparejamiento Base/genética , Genoma de Planta/genética , Zea mays/genética , Secuencia de Bases , Cromosomas de las Plantas/genética , Elementos Transponibles de ADN/genética , Evolución Molecular , Duplicación de Gen , Reordenamiento Génico/genética , Genes de Plantas , Sitios Genéticos/genética , Datos de Secuencia Molecular , Mutación/genética , Sistemas de Lectura Abierta/genética , Oryza/genética , Mapeo Físico de Cromosoma , ARN de Planta/genética , Homología de Secuencia de Ácido Nucleico , Sorghum/genética , Sintenía/genética
9.
PLoS Genet ; 1(3): e41, 2005 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-16205789

RESUMEN

A region of approximately one megabase of human Chromosome 12 shows extensive linkage disequilibrium in Utah residents with ancestry from northern and western Europe. This strikingly large linkage disequilibrium block was analyzed with statistical and experimental methods to determine whether natural selection could be implicated in shaping the current genome structure. Extended Haplotype Homozygosity and Relative Extended Haplotype Homozygosity analyses on this region mapped a core region of the strongest conserved haplotype to the exon 1 of the Spinocerebellar ataxia type 2 gene (SCA2). Direct DNA sequencing of this region of the SCA2 gene revealed a significant association between a pre-expanded allele [(CAG)8CAA(CAG)4CAA(CAG)8] of CAG repeats within exon 1 and the selected haplotype of the SCA2 gene. A significantly negative Tajima's D value (-2.20, p < 0.01) on this site consistently suggested selection on the CAG repeat. This region was also investigated in the three other populations, none of which showed signs of selection. These results suggest that a recent positive selection of the pre-expansion SCA2 CAG repeat has occurred in Utah residents with European ancestry.


Asunto(s)
Cromosomas Humanos Par 12 , Proteínas del Tejido Nervioso/genética , Degeneraciones Espinocerebelosas/genética , Repeticiones de Trinucleótidos , Ataxinas , Secuencia de Bases , Mapeo Cromosómico , Europa (Continente)/etnología , Exones , Frecuencia de los Genes , Humanos , Datos de Secuencia Molecular , Polimorfismo de Nucleótido Simple , Selección Genética , Utah , Población Blanca/genética
10.
Front Plant Sci ; 2: 34, 2011.
Artículo en Inglés | MEDLINE | ID: mdl-22645531

RESUMEN

The iPlant Collaborative (iPlant) is a United States National Science Foundation (NSF) funded project that aims to create an innovative, comprehensive, and foundational cyberinfrastructure in support of plant biology research (PSCIC, 2006). iPlant is developing cyberinfrastructure that uniquely enables scientists throughout the diverse fields that comprise plant biology to address Grand Challenges in new ways, to stimulate and facilitate cross-disciplinary research, to promote biology and computer science research interactions, and to train the next generation of scientists on the use of cyberinfrastructure in research and education. Meeting humanity's projected demands for agricultural and forest products and the expectation that natural ecosystems be managed sustainably will require synergies from the application of information technologies. The iPlant cyberinfrastructure design is based on an unprecedented period of research community input, and leverages developments in high-performance computing, data storage, and cyberinfrastructure for the physical sciences. iPlant is an open-source project with application programming interfaces that allow the community to extend the infrastructure to meet its needs. iPlant is sponsoring community-driven workshops addressing specific scientific questions via analysis tool integration and hypothesis testing. These workshops teach researchers how to add bioinformatics tools and/or datasets into the iPlant cyberinfrastructure enabling plant scientists to perform complex analyses on large datasets without the need to master the command-line or high-performance computational services.

11.
Science ; 326(5956): 1112-5, 2009 Nov 20.
Artículo en Inglés | MEDLINE | ID: mdl-19965430

RESUMEN

We report an improved draft nucleotide sequence of the 2.3-gigabase genome of maize, an important crop plant and model for biological research. Over 32,000 genes were predicted, of which 99.8% were placed on reference chromosomes. Nearly 85% of the genome is composed of hundreds of families of transposable elements, dispersed nonuniformly across the genome. These were responsible for the capture and amplification of numerous gene fragments and affect the composition, sizes, and positions of centromeres. We also report on the correlation of methylation-poor regions with Mu transposon insertions and recombination, and copy number variants with insertions and/or deletions, as well as how uneven gene losses between duplicated regions were involved in returning an ancient allotetraploid to a genetically diploid state. These analyses inform and set the stage for further investigations to improve our understanding of the domestication and agricultural improvements of maize.


Asunto(s)
Variación Genética , Genoma de Planta , Análisis de Secuencia de ADN , Zea mays/genética , Secuencia de Bases , Centrómero/genética , Mapeo Cromosómico , Cromosomas de las Plantas/genética , Productos Agrícolas/genética , Variaciones en el Número de Copia de ADN , Metilación de ADN , Elementos Transponibles de ADN , ADN de Plantas/genética , Genes de Plantas , Endogamia , MicroARNs/genética , Datos de Secuencia Molecular , Ploidias , ARN de Planta/genética , Recombinación Genética , Retroelementos
12.
Genome Res ; 15(2): 269-75, 2005 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-15687290

RESUMEN

Large-scale genetic studies are highly dependent on efficient and scalable multiplex SNP assays. In this study, we report the development of Molecular Inversion Probe technology with four-color, single array detection, applied to large-scale genotyping of up to 12,000 SNPs per reaction. While generating 38,429 SNP assays using this technology in a population of 30 trios from the Centre d'Etude Polymorphisme Humain family panel as part of the International HapMap project, we established SNP conversion rates of approximately 90% with concordance rates >99.6% and completeness levels >98% for assays multiplexed up to 12,000plex levels. Furthermore, these individual metrics can be "traded off" and, by sacrificing a small fraction of the conversion rate, the accuracy can be increased to very high levels. No loss of performance is seen when scaling from 6,000plex to 12,000plex assays, strongly validating the ability of the technology to suppress cross-reactivity at high multiplex levels. The results of this study demonstrate the suitability of this technology for comprehensive association studies that use targeted SNPs in indirect linkage disequilibrium studies or that directly screen for causative mutations.


Asunto(s)
Inversión Cromosómica/genética , Técnicas de Sonda Molecular/tendencias , Polimorfismo de Nucleótido Simple/genética , Inversión Cromosómica/estadística & datos numéricos , Análisis por Conglomerados , Sondas de ADN/genética , Perfilación de la Expresión Génica , Genoma Humano , Genotipo , Humanos , Técnicas de Sonda Molecular/normas , Técnicas de Sonda Molecular/estadística & datos numéricos , Sondas Moleculares/genética , Proyectos de Investigación
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA