Pesquisa | BVS - MINISTÉRIO DA SAÚDE

1.

A high-performance computational workflow to accelerate GATK SNP detection across a 25-genome dataset.

Zhou, Yong; Kathiresan, Nagarajan; Yu, Zhichao; Rivera, Luis F; Yang, Yujian; Thimma, Manjula; Manickam, Keerthana; Chebotarov, Dmytro; Mauleon, Ramil; Chougule, Kapeel; Wei, Sharon; Gao, Tingting; Green, Carl D; Zuccolo, Andrea; Xie, Weibo; Ware, Doreen; Zhang, Jianwei; McNally, Kenneth L; Wing, Rod A.

BMC Biol ; 22(1): 13, 2024 Jan 25.

Artigo em Inglês | MEDLINE | ID: mdl-38273258

RESUMO

BACKGROUND: Single-nucleotide polymorphisms (SNPs) are the most widely used form of molecular genetic variation studies. As reference genomes and resequencing data sets expand exponentially, tools must be in place to call SNPs at a similar pace. The genome analysis toolkit (GATK) is one of the most widely used SNP calling software tools publicly available, but unfortunately, high-performance computing versions of this tool have yet to become widely available and affordable. RESULTS: Here we report an open-source high-performance computing genome variant calling workflow (HPC-GVCW) for GATK that can run on multiple computing platforms from supercomputers to desktop machines. We benchmarked HPC-GVCW on multiple crop species for performance and accuracy with comparable results with previously published reports (using GATK alone). Finally, we used HPC-GVCW in production mode to call SNPs on a "subpopulation aware" 16-genome rice reference panel with ~ 3000 resequenced rice accessions. The entire process took ~ 16 weeks and resulted in the identification of an average of 27.3 M SNPs/genome and the discovery of ~ 2.3 million novel SNPs that were not present in the flagship reference genome for rice (i.e., IRGSP RefSeq). CONCLUSIONS: This study developed an open-source pipeline (HPC-GVCW) to run GATK on HPC platforms, which significantly improved the speed at which SNPs can be called. The workflow is widely applicable as demonstrated successfully for four major crop species with genomes ranging in size from 400 Mb to 2.4 Gb. Using HPC-GVCW in production mode to call SNPs on a 25 multi-crop-reference genome data set produced over 1.1 billion SNPs that were publicly released for functional and breeding studies. For rice, many novel SNPs were identified and were found to reside within genes and open chromatin regions that are predicted to have functional consequences. Combined, our results demonstrate the usefulness of combining a high-performance SNP calling architecture solution with a subpopulation-aware reference genome panel for rapid SNP discovery and public deployment.

Assuntos

Genoma de Planta , Polimorfismo de Nucleotídeo Único , Fluxo de Trabalho , Melhoramento Vegetal , Software , Sequenciamento de Nucleotídeos em Larga Escala/métodos

2.

A large sequenced mutant library - valuable reverse genetic resource that covers 98% of sorghum genes.

Jiao, Yinping; Nigam, Deepti; Barry, Kerrie; Daum, Chris; Yoshinaga, Yuko; Lipzen, Anna; Khan, Adil; Parasa, Sai-Praneeth; Wei, Sharon; Lu, Zhenyuan; Tello-Ruiz, Marcela K; Dhiman, Pallavi; Burow, Gloria; Hayes, Chad; Chen, Junping; Brandizzi, Federica; Mortimer, Jenny; Ware, Doreen; Xin, Zhanguo.

Plant J ; 117(5): 1543-1557, 2024 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-38100514

RESUMO

Mutant populations are crucial for functional genomics and discovering novel traits for crop breeding. Sorghum, a drought and heat-tolerant C4 species, requires a vast, large-scale, annotated, and sequenced mutant resource to enhance crop improvement through functional genomics research. Here, we report a sorghum large-scale sequenced mutant population with 9.5 million ethyl methane sulfonate (EMS)-induced mutations that covered 98% of sorghum's annotated genes using inbred line BTx623. Remarkably, a total of 610 320 mutations within the promoter and enhancer regions of 18 000 and 11 790 genes, respectively, can be leveraged for novel research of cis-regulatory elements. A comparison of the distribution of mutations in the large-scale mutant library and sorghum association panel (SAP) provides insights into the influence of selection. EMS-induced mutations appeared to be random across different regions of the genome without significant enrichment in different sections of a gene, including the 5' UTR, gene body, and 3'-UTR. In contrast, there were low variation density in the coding and UTR regions in the SAP. Based on the Ka /Ks value, the mutant library (~1) experienced little selection, unlike the SAP (0.40), which has been strongly selected through breeding. All mutation data are publicly searchable through SorbMutDB (https://www.depts.ttu.edu/igcast/sorbmutdb.php) and SorghumBase (https://sorghumbase.org/). This current large-scale sequence-indexed sorghum mutant population is a crucial resource that enriched the sorghum gene pool with novel diversity and a highly valuable tool for the Poaceae family, that will advance plant biology research and crop breeding.

Assuntos

Sorghum , Sorghum/genética , Genética Reversa , Melhoramento Vegetal , Mutação , Fenótipo , Grão Comestível/genética , Metanossulfonato de Etila/farmacologia , Genoma de Planta/genética

3.

SorghumBase: a web-based portal for sorghum genetic information and community advancement.

Gladman, Nicholas; Olson, Andrew; Wei, Sharon; Chougule, Kapeel; Lu, Zhenyuan; Tello-Ruiz, Marcela; Meijs, Ivar; Van Buren, Peter; Jiao, Yinping; Wang, Bo; Kumar, Vivek; Kumari, Sunita; Zhang, Lifang; Burke, John; Chen, Junping; Burow, Gloria; Hayes, Chad; Emendack, Yves; Xin, Zhanguo; Ware, Doreen.

Planta ; 255(2): 35, 2022 Jan 11.

Artigo em Inglês | MEDLINE | ID: mdl-35015132

RESUMO

MAIN CONCLUSION: SorghumBase provides a community portal that integrates genetic, genomic, and breeding resources for sorghum germplasm improvement. Public research and development in agriculture rely on proper data and resource sharing within stakeholder communities. For plant breeders, agronomists, molecular biologists, geneticists, and bioinformaticians, centralizing desirable data into a user-friendly hub for crop systems is essential for successful collaborations and breakthroughs in germplasm development. Here, we present the SorghumBase web portal ( https://www.sorghumbase.org ), a resource for the sorghum research community. SorghumBase hosts a wide range of sorghum genomic information in a modular framework, built with open-source software, to provide a sustainable platform. This initial release of SorghumBase includes: (1) five sorghum reference genome assemblies in a pan-genome browser; (2) genetic variant information for natural diversity panels and ethyl methanesulfonate (EMS)-induced mutant populations; (3) search interface and integrated views of various data types; (4) links supporting interconnectivity with other repositories including genebank, QTL, and gene expression databases; and (5) a content management system to support access to community news and training materials. SorghumBase offers sorghum investigators improved data collation and access that will facilitate the growth of a robust research community to support genomics-assisted breeding.

Assuntos

Sorghum , Bases de Dados Genéticas , Grão Comestível , Genoma de Planta/genética , Genômica , Internet , Melhoramento Vegetal , Sorghum/genética

4.

Comparison of seven heterophile antibody assays for laboratory diagnosis of infectious mononucleosis in pediatric patients.

Wei, Sharon; Huang, Wei; Farah, Mohamed; Khanolkar, Aaruni; Zheng, Xiaotian.

J Med Virol ; 93(11): 6404-6407, 2021 11.

Artigo em Inglês | MEDLINE | ID: mdl-34347299

RESUMO

Heterophile antibody assays have been used to aid the diagnosis of infectious mononucleosis caused by the Epstein-Barr virus. Seven commercially available assays currently widely utilized in clinical laboratories were compared in this study. Variable performance characteristics and assay times are observed, and these pieces of data may assist clinical laboratories in assay selection and result interpretation.

Assuntos

Anticorpos Heterófilos/sangue , Anticorpos Antivirais/sangue , Técnicas de Laboratório Clínico/normas , Infecções por Vírus Epstein-Barr/diagnóstico , Mononucleose Infecciosa/diagnóstico , Mononucleose Infecciosa/imunologia , Kit de Reagentes para Diagnóstico/normas , Adolescente , Anticorpos Heterófilos/imunologia , Criança , Técnicas de Laboratório Clínico/métodos , Infecções por Vírus Epstein-Barr/sangue , Humanos , Imunoglobulina M/sangue , Mononucleose Infecciosa/sangue , Adulto Jovem

5.

De novo assembly, annotation, and comparative analysis of 26 diverse maize genomes.

Hufford, Matthew B; Seetharam, Arun S; Woodhouse, Margaret R; Chougule, Kapeel M; Ou, Shujun; Liu, Jianing; Ricci, William A; Guo, Tingting; Olson, Andrew; Qiu, Yinjie; Della Coletta, Rafael; Tittes, Silas; Hudson, Asher I; Marand, Alexandre P; Wei, Sharon; Lu, Zhenyuan; Wang, Bo; Tello-Ruiz, Marcela K; Piri, Rebecca D; Wang, Na; Kim, Dong Won; Zeng, Yibing; O'Connor, Christine H; Li, Xianran; Gilbert, Amanda M; Baggs, Erin; Krasileva, Ksenia V; Portwood, John L; Cannon, Ethalinda K S; Andorf, Carson M; Manchanda, Nancy; Snodgrass, Samantha J; Hufnagel, David E; Jiang, Qiuhan; Pedersen, Sarah; Syring, Michael L; Kudrna, David A; Llaca, Victor; Fengler, Kevin; Schmitz, Robert J; Ross-Ibarra, Jeffrey; Yu, Jianming; Gent, Jonathan I; Hirsch, Candice N; Ware, Doreen; Dawe, R Kelly.

Science ; 373(6555): 655-662, 2021 08 06.

Artigo em Inglês | MEDLINE | ID: mdl-34353948

RESUMO

We report de novo genome assemblies, transcriptomes, annotations, and methylomes for the 26 inbreds that serve as the founders for the maize nested association mapping population. The number of pan-genes in these diverse genomes exceeds 103,000, with approximately a third found across all genotypes. The results demonstrate that the ancient tetraploid character of maize continues to degrade by fractionation to the present day. Excellent contiguity over repeat arrays and complete annotation of centromeres revealed additional variation in major cytological landmarks. We show that combining structural variation with single-nucleotide polymorphisms can improve the power of quantitative mapping studies. We also document variation at the level of DNA methylation and demonstrate that unmethylated regions are enriched for cis-regulatory elements that contribute to phenotypic variation.

Assuntos

Genoma de Planta , Anotação de Sequência Molecular , Zea mays/genética , Centrômero/genética , Mapeamento Cromossômico , Cromossomos de Plantas , Metilação de DNA , Resistência à Doença/genética , Genes de Plantas , Variação Genética , Genótipo , Sequenciamento de Nucleotídeos em Larga Escala , Herança Multifatorial/genética , Fenótipo , Doenças das Plantas , Polimorfismo de Nucleotídeo Único , Sequências Reguladoras de Ácido Nucleico , Análise de Sequência de DNA , Tetraploidia , Transcriptoma , Sequenciamento Completo do Genoma

6.

Gramene 2021: harnessing the power of comparative genomics and pathways for plant research.

Tello-Ruiz, Marcela K; Naithani, Sushma; Gupta, Parul; Olson, Andrew; Wei, Sharon; Preece, Justin; Jiao, Yinping; Wang, Bo; Chougule, Kapeel; Garg, Priyanka; Elser, Justin; Kumari, Sunita; Kumar, Vivek; Contreras-Moreira, Bruno; Naamati, Guy; George, Nancy; Cook, Justin; Bolser, Daniel; D'Eustachio, Peter; Stein, Lincoln D; Gupta, Amit; Xu, Weijia; Regala, Jennifer; Papatheodorou, Irene; Kersey, Paul J; Flicek, Paul; Taylor, Crispin; Jaiswal, Pankaj; Ware, Doreen.

Nucleic Acids Res ; 49(D1): D1452-D1463, 2021 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-33170273

RESUMO

Gramene (http://www.gramene.org), a knowledgebase founded on comparative functional analyses of genomic and pathway data for model plants and major crops, supports agricultural researchers worldwide. The resource is committed to open access and reproducible science based on the FAIR data principles. Since the last NAR update, we made nine releases; doubled the genome portal's content; expanded curated genes, pathways and expression sets; and implemented the Domain Informational Vocabulary Extraction (DIVE) algorithm for extracting gene function information from publications. The current release, #63 (October 2020), hosts 93 reference genomes-over 3.9 million genes in 122 947 families with orthologous and paralogous classifications. Plant Reactome portrays pathway networks using a combination of manual biocuration in rice (320 reference pathways) and orthology-based projections to 106 species. The Reactome platform facilitates comparison between reference and projected pathways, gene expression analyses and overlays of gene-gene interactions. Gramene integrates ontology-based protein structure-function annotation; information on genetic, epigenetic, expression, and phenotypic diversity; and gene functional annotations extracted from plant-focused journals using DIVE. We train plant researchers in biocuration of genes and pathways; host curated maize gene structures as tracks in the maize genome browser; and integrate curated rice genes and pathways in the Plant Reactome.

Assuntos

Bases de Dados Genéticas , Regulação da Expressão Gênica de Plantas , Genoma de Planta , Genômica/métodos , Proteínas de Plantas/genética , Plantas/genética , Produtos Agrícolas , Elementos de DNA Transponíveis , Duplicação Gênica , Ontologia Genética , Redes Reguladoras de Genes , Internet , Bases de Conhecimento , Redes e Vias Metabólicas , Anotação de Sequência Molecular , Oryza/genética , Oryza/metabolismo , Proteínas de Plantas/metabolismo , Plantas/classificação , Plantas/metabolismo , Poliploidia , Mapeamento de Interação de Proteínas , Software , Zea mays/genética , Zea mays/metabolismo

7.

Effect of sequence depth and length in long-read assembly of the maize inbred NC358.

Ou, Shujun; Liu, Jianing; Chougule, Kapeel M; Fungtammasan, Arkarachai; Seetharam, Arun S; Stein, Joshua C; Llaca, Victor; Manchanda, Nancy; Gilbert, Amanda M; Wei, Sharon; Chin, Chen-Shan; Hufnagel, David E; Pedersen, Sarah; Snodgrass, Samantha J; Fengler, Kevin; Woodhouse, Margaret; Walenz, Brian P; Koren, Sergey; Phillippy, Adam M; Hannigan, Brett T; Dawe, R Kelly; Hirsch, Candice N; Hufford, Matthew B; Ware, Doreen.

Nat Commun ; 11(1): 2288, 2020 05 08.

Artigo em Inglês | MEDLINE | ID: mdl-32385271

RESUMO

Improvements in long-read data and scaffolding technologies have enabled rapid generation of reference-quality assemblies for complex genomes. Still, an assessment of critical sequence depth and read length is important for allocating limited resources. To this end, we have generated eight assemblies for the complex genome of the maize inbred line NC358 using PacBio datasets ranging from 20 to 75 × genomic depth and with N50 subread lengths of 11-21 kb. Assemblies with ≤30 × depth and N50 subread length of 11 kb are highly fragmented, with even low-copy genic regions showing degradation at 20 × depth. Distinct sequence-quality thresholds are observed for complete assembly of genes, transposable elements, and highly repetitive genomic features such as telomeres, heterochromatic knobs, and centromeres. In addition, we show high-quality optical maps can dramatically improve contiguity in even our most fragmented base assembly. This study provides a useful resource allocation reference to the community as long-read technologies continue to mature.

Assuntos

Sequenciamento de Nucleotídeos em Larga Escala/métodos , Endogamia , Zea mays/genética , Sequência de Bases , Elementos de DNA Transponíveis/genética , Genoma de Planta , Sequências Repetitivas de Ácido Nucleico/genética

8.

Plant Reactome: a knowledgebase and resource for comparative pathway analysis.

Naithani, Sushma; Gupta, Parul; Preece, Justin; D'Eustachio, Peter; Elser, Justin L; Garg, Priyanka; Dikeman, Daemon A; Kiff, Jason; Cook, Justin; Olson, Andrew; Wei, Sharon; Tello-Ruiz, Marcela K; Mundo, Antonio Fabregat; Munoz-Pomer, Alfonso; Mohammed, Suhaib; Cheng, Tiejun; Bolton, Evan; Papatheodorou, Irene; Stein, Lincoln; Ware, Doreen; Jaiswal, Pankaj.

Nucleic Acids Res ; 48(D1): D1093-D1103, 2020 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-31680153

RESUMO

Plant Reactome (https://plantreactome.gramene.org) is an open-source, comparative plant pathway knowledgebase of the Gramene project. It uses Oryza sativa (rice) as a reference species for manual curation of pathways and extends pathway knowledge to another 82 plant species via gene-orthology projection using the Reactome data model and framework. It currently hosts 298 reference pathways, including metabolic and transport pathways, transcriptional networks, hormone signaling pathways, and plant developmental processes. In addition to browsing plant pathways, users can upload and analyze their omics data, such as the gene-expression data, and overlay curated or experimental gene-gene interaction data to extend pathway knowledge. The curation team actively engages researchers and students on gene and pathway curation by offering workshops and online tutorials. The Plant Reactome supports, implements and collaborates with the wider community to make data and tools related to genes, genomes, and pathways Findable, Accessible, Interoperable and Re-usable (FAIR).

Assuntos

Biologia Computacional/métodos , Bases de Dados Genéticas , Genômica , Metabolômica , Plantas/genética , Plantas/metabolismo , Proteômica , Redes Reguladoras de Genes , Genômica/métodos , Humanos , Redes e Vias Metabólicas , Metabolômica/métodos , Proteômica/métodos , Transdução de Sinais , Navegador

9.

Publisher Correction: Genomes of 13 domesticated and wild rice relatives highlight genetic conservation, turnover and innovation across the genus Oryza.

Stein, Joshua C; Yu, Yeisoo; Copetti, Dario; Zwickl, Derrick J; Zhang, Li; Zhang, Chengjun; Chougule, Kapeel; Gao, Dongying; Iwata, Aiko; Goicoechea, Jose Luis; Wei, Sharon; Wang, Jun; Liao, Yi; Wang, Muhua; Jacquemin, Julie; Becker, Claude; Kudrna, Dave; Zhang, Jianwei; Londono, Carlos E M; Song, Xiang; Lee, Seunghee; Sanchez, Paul; Zuccolo, Andrea; Ammiraju, Jetty S S; Talag, Jayson; Danowitz, Ann; Rivera, Luis F; Gschwend, Andrea R; Noutsos, Christos; Wu, Cheng-Chieh; Kao, Shu-Min; Zeng, Jhih-Wun; Wei, Fu-Jin; Zhao, Qiang; Feng, Qi; El Baidouri, Moaine; Carpentier, Marie-Christine; Lasserre, Eric; Cooke, Richard; da Rosa Farias, Daniel; da Maia, Luciano Carlos; Dos Santos, Railson S; Nyberg, Kevin G; McNally, Kenneth L; Mauleon, Ramil; Alexandrov, Nickolai; Schmutz, Jeremy; Flowers, Dave; Fan, Chuanzhu; Weigel, Detlef.

Nat Genet ; 50(11): 1618, 2018 11.

Artigo em Inglês | MEDLINE | ID: mdl-30291357

RESUMO

This article was not made open access when initially published online, which was corrected before print publication. In addition, ORCID links were missing for 12 authors and have been added to the HTML and PDF versions of the article.

10.

The maize W22 genome provides a foundation for functional genomics and transposon biology.

Springer, Nathan M; Anderson, Sarah N; Andorf, Carson M; Ahern, Kevin R; Bai, Fang; Barad, Omer; Barbazuk, W Brad; Bass, Hank W; Baruch, Kobi; Ben-Zvi, Gil; Buckler, Edward S; Bukowski, Robert; Campbell, Michael S; Cannon, Ethalinda K S; Chomet, Paul; Dawe, R Kelly; Davenport, Ruth; Dooner, Hugo K; Du, Limei He; Du, Chunguang; Easterling, Katherine A; Gault, Christine; Guan, Jiahn-Chou; Hunter, Charles T; Jander, Georg; Jiao, Yinping; Koch, Karen E; Kol, Guy; Köllner, Tobias G; Kudo, Toru; Li, Qing; Lu, Fei; Mayfield-Jones, Dustin; Mei, Wenbin; McCarty, Donald R; Noshay, Jaclyn M; Portwood, John L; Ronen, Gil; Settles, A Mark; Shem-Tov, Doron; Shi, Jinghua; Soifer, Ilya; Stein, Joshua C; Stitzer, Michelle C; Suzuki, Masaharu; Vera, Daniel L; Vollbrecht, Erik; Vrebalov, Julia T; Ware, Doreen; Wei, Sharon.

Nat Genet ; 50(9): 1282-1288, 2018 09.

Artigo em Inglês | MEDLINE | ID: mdl-30061736

RESUMO

The maize W22 inbred has served as a platform for maize genetics since the mid twentieth century. To streamline maize genome analyses, we have sequenced and de novo assembled a W22 reference genome using short-read sequencing technologies. We show that significant structural heterogeneity exists in comparison to the B73 reference genome at multiple scales, from transposon composition and copy number variation to single-nucleotide polymorphisms. The generation of this reference genome enables accurate placement of thousands of Mutator (Mu) and Dissociation (Ds) transposable element insertions for reverse and forward genetics studies. Annotation of the genome has been achieved using RNA-seq analysis, differential nuclease sensitivity profiling and bisulfite sequencing to map open reading frames, open chromatin sites and DNA methylation profiles, respectively. Collectively, the resources developed here integrate W22 as a community reference genome for functional genomics and provide a foundation for the maize pan-genome.

Assuntos

Elementos de DNA Transponíveis/genética , Genes de Plantas/genética , Genoma de Planta/genética , Zea mays/genética , Cromatina/genética , Cromossomos de Plantas/genética , Variações do Número de Cópias de DNA/genética , Metilação de DNA/genética , DNA de Plantas/genética , Genômica/métodos , Fases de Leitura Aberta/genética , Análise de Sequência de DNA/métodos

11.

Genomes of 13 domesticated and wild rice relatives highlight genetic conservation, turnover and innovation across the genus Oryza.

Stein, Joshua C; Yu, Yeisoo; Copetti, Dario; Zwickl, Derrick J; Zhang, Li; Zhang, Chengjun; Chougule, Kapeel; Gao, Dongying; Iwata, Aiko; Goicoechea, Jose Luis; Wei, Sharon; Wang, Jun; Liao, Yi; Wang, Muhua; Jacquemin, Julie; Becker, Claude; Kudrna, Dave; Zhang, Jianwei; Londono, Carlos E M; Song, Xiang; Lee, Seunghee; Sanchez, Paul; Zuccolo, Andrea; Ammiraju, Jetty S S; Talag, Jayson; Danowitz, Ann; Rivera, Luis F; Gschwend, Andrea R; Noutsos, Christos; Wu, Cheng-Chieh; Kao, Shu-Min; Zeng, Jhih-Wun; Wei, Fu-Jin; Zhao, Qiang; Feng, Qi; El Baidouri, Moaine; Carpentier, Marie-Christine; Lasserre, Eric; Cooke, Richard; Rosa Farias, Daniel da; da Maia, Luciano Carlos; Dos Santos, Railson S; Nyberg, Kevin G; McNally, Kenneth L; Mauleon, Ramil; Alexandrov, Nickolai; Schmutz, Jeremy; Flowers, Dave; Fan, Chuanzhu; Weigel, Detlef.

Nat Genet ; 50(2): 285-296, 2018 02.

Artigo em Inglês | MEDLINE | ID: mdl-29358651

RESUMO

The genus Oryza is a model system for the study of molecular evolution over time scales ranging from a few thousand to 15 million years. Using 13 reference genomes spanning the Oryza species tree, we show that despite few large-scale chromosomal rearrangements rapid species diversification is mirrored by lineage-specific emergence and turnover of many novel elements, including transposons, and potential new coding and noncoding genes. Our study resolves controversial areas of the Oryza phylogeny, showing a complex history of introgression among different chromosomes in the young 'AA' subclade containing the two domesticated species. This study highlights the prevalence of functionally coupled disease resistance genes and identifies many new haplotypes of potential use for future crop protection. Finally, this study marks a milestone in modern rice research with the release of a complete long-read assembly of IR 8 'Miracle Rice', which relieved famine and drove the Green Revolution in Asia 50 years ago.

Assuntos

Produtos Agrícolas/genética , Evolução Molecular , Variação Genética , Oryza/classificação , Oryza/genética , Sequência Conservada , Domesticação , Especiação Genética , Genoma de Planta , Filogenia

12.

Ensembl Genomes 2018: an integrated omics infrastructure for non-vertebrate species.

Kersey, Paul Julian; Allen, James E; Allot, Alexis; Barba, Matthieu; Boddu, Sanjay; Bolt, Bruce J; Carvalho-Silva, Denise; Christensen, Mikkel; Davis, Paul; Grabmueller, Christoph; Kumar, Navin; Liu, Zicheng; Maurel, Thomas; Moore, Ben; McDowall, Mark D; Maheswari, Uma; Naamati, Guy; Newman, Victoria; Ong, Chuang Kee; Paulini, Michael; Pedro, Helder; Perry, Emily; Russell, Matthew; Sparrow, Helen; Tapanari, Electra; Taylor, Kieron; Vullo, Alessandro; Williams, Gareth; Zadissia, Amonida; Olson, Andrew; Stein, Joshua; Wei, Sharon; Tello-Ruiz, Marcela; Ware, Doreen; Luciani, Aurelien; Potter, Simon; Finn, Robert D; Urban, Martin; Hammond-Kosack, Kim E; Bolser, Dan M; De Silva, Nishadi; Howe, Kevin L; Langridge, Nicholas; Maslen, Gareth; Staines, Daniel Michael; Yates, Andrew.

Nucleic Acids Res ; 46(D1): D802-D808, 2018 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-29092050

RESUMO

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including genome sequence, gene models, transcript sequence, genetic variation, and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments and expansions. These include the incorporation of almost 20 000 additional genome sequences and over 35 000 tracks of RNA-Seq data, which have been aligned to genomic sequence and made available for visualization. Other advances since 2015 include the release of the database in Resource Description Framework (RDF) format, a large increase in community-derived curation, a new high-performance protein sequence search, additional cross-references, improved annotation of non-protein-coding genes, and the launch of pre-release and archival sites. Collectively, these changes are part of a continuing response to the increasing quantity of publicly-available genome-scale data, and the consequent need to archive, integrate, annotate and disseminate these using automated, scalable methods.

Assuntos

Archaea/genética , Bactérias/genética , Bases de Dados Genéticas , Bases de Dados de Proteínas , Eucariotos/genética , Genômica , Sequência de Aminoácidos , Animais , Sequência de Bases , Mineração de Dados , Previsões , Genoma , Anotação de Sequência Molecular , RNA/genética , Interface Usuário-Computador

13.

Gramene 2018: unifying comparative genomics and pathway resources for plant research.

Tello-Ruiz, Marcela K; Naithani, Sushma; Stein, Joshua C; Gupta, Parul; Campbell, Michael; Olson, Andrew; Wei, Sharon; Preece, Justin; Geniza, Matthew J; Jiao, Yinping; Lee, Young Koung; Wang, Bo; Mulvaney, Joseph; Chougule, Kapeel; Elser, Justin; Al-Bader, Noor; Kumari, Sunita; Thomason, James; Kumar, Vivek; Bolser, Daniel M; Naamati, Guy; Tapanari, Electra; Fonseca, Nuno; Huerta, Laura; Iqbal, Haider; Keays, Maria; Munoz-Pomer Fuentes, Alfonso; Tang, Amy; Fabregat, Antonio; D'Eustachio, Peter; Weiser, Joel; Stein, Lincoln D; Petryszak, Robert; Papatheodorou, Irene; Kersey, Paul J; Lockhart, Patti; Taylor, Crispin; Jaiswal, Pankaj; Ware, Doreen.

Nucleic Acids Res ; 46(D1): D1181-D1189, 2018 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-29165610

RESUMO

Gramene (http://www.gramene.org) is a knowledgebase for comparative functional analysis in major crops and model plant species. The current release, #54, includes over 1.7 million genes from 44 reference genomes, most of which were organized into 62,367 gene families through orthologous and paralogous gene classification, whole-genome alignments, and synteny. Additional gene annotations include ontology-based protein structure and function; genetic, epigenetic, and phenotypic diversity; and pathway associations. Gramene's Plant Reactome provides a knowledgebase of cellular-level plant pathway networks. Specifically, it uses curated rice reference pathways to derive pathway projections for an additional 66 species based on gene orthology, and facilitates display of gene expression, gene-gene interactions, and user-defined omics data in the context of these pathways. As a community portal, Gramene integrates best-of-class software and infrastructure components including the Ensembl genome browser, Reactome pathway browser, and Expression Atlas widgets, and undergoes periodic data and software upgrades. Via powerful, intuitive search interfaces, users can easily query across various portals and interactively analyze search results by clicking on diverse features such as genomic context, highly augmented gene trees, gene expression anatomograms, associated pathways, and external informatics resources. All data in Gramene are accessible through both visual and programmatic interfaces.

Assuntos

Bases de Dados Genéticas , Regulação da Expressão Gênica de Plantas , Genômica/métodos , Bases de Conhecimento , Plantas/genética , Epigênese Genética , Ontologia Genética , Pesquisa em Genética , Variação Genética , Genoma de Planta , Redes e Vias Metabólicas/genética , Anotação de Sequência Molecular , Plantas/metabolismo , Software , Interface Usuário-Computador

14.

Ensembl Genomes 2016: more genomes, more complexity.

Kersey, Paul Julian; Allen, James E; Armean, Irina; Boddu, Sanjay; Bolt, Bruce J; Carvalho-Silva, Denise; Christensen, Mikkel; Davis, Paul; Falin, Lee J; Grabmueller, Christoph; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Aranganathan, Naveen K; Langridge, Nicholas; Lowy, Ernesto; McDowall, Mark D; Maheswari, Uma; Nuhn, Michael; Ong, Chuang Kee; Overduin, Bert; Paulini, Michael; Pedro, Helder; Perry, Emily; Spudich, Giulietta; Tapanari, Electra; Walts, Brandon; Williams, Gareth; Tello-Ruiz, Marcela; Stein, Joshua; Wei, Sharon; Ware, Doreen; Bolser, Daniel M; Howe, Kevin L; Kulesha, Eugene; Lawson, Daniel; Maslen, Gareth; Staines, Daniel M.

Nucleic Acids Res ; 44(D1): D574-80, 2016 Jan 04.

Artigo em Inglês | MEDLINE | ID: mdl-26578574

RESUMO

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including reference sequence, gene models, transcriptional data, genetic variation and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments. These include the development of new analyses and views to represent polyploid genomes (of which bread wheat is the primary exemplar); and the continued up-scaling of the resource, which now includes over 23 000 bacterial genomes, 400 fungal genomes and 100 protist genomes, in addition to 55 genomes from invertebrate metazoa and 39 genomes from plants. This dramatic increase in the number of included genomes is one part of a broader effort to automate the integration of archival data (genome sequence, but also associated RNA sequence data and variant calls) within the context of reference genomes and make it available through the Ensembl user interfaces.

Assuntos

Bases de Dados Genéticas , Genoma Bacteriano , Genoma Fúngico , Genoma de Planta , Invertebrados/genética , Animais , Diploide , Eucariotos/genética , Variação Genética , Genoma , Poliploidia , Alinhamento de Sequência

15.

Gramene Database: Navigating Plant Comparative Genomics Resources.

Gupta, Parul; Naithani, Sushma; Tello-Ruiz, Marcela Karey; Chougule, Kapeel; D'Eustachio, Peter; Fabregat, Antonio; Jiao, Yinping; Keays, Maria; Lee, Young Koung; Kumari, Sunita; Mulvaney, Joseph; Olson, Andrew; Preece, Justin; Stein, Joshua; Wei, Sharon; Weiser, Joel; Huerta, Laura; Petryszak, Robert; Kersey, Paul; Stein, Lincoln D; Ware, Doreen; Jaiswal, Pankaj.

Curr Plant Biol ; 7-8: 10-15, 2016 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-28713666

RESUMO

Gramene (http://www.gramene.org) is an online, open source, curated resource for plant comparative genomics and pathway analysis designed to support researchers working in plant genomics, breeding, evolutionary biology, system biology, and metabolic engineering. It exploits phylogenetic relationships to enrich the annotation of genomic data and provides tools to perform powerful comparative analyses across a wide spectrum of plant species. It consists of an integrated portal for querying, visualizing and analyzing data for 44 plant reference genomes, genetic variation data sets for 12 species, expression data for 16 species, curated rice pathways and orthology-based pathway projections for 66 plant species including various crops. Here we briefly describe the functions and uses of the Gramene database.

16.

Gramene: A Resource for Comparative Analysis of Plants Genomes and Pathways.

Tello-Ruiz, Marcela Karey; Stein, Joshua; Wei, Sharon; Youens-Clark, Ken; Jaiswal, Pankaj; Ware, Doreen.

Methods Mol Biol ; 1374: 141-63, 2016.

Artigo em Inglês | MEDLINE | ID: mdl-26519404

RESUMO

Gramene is an integrated informatics resource for accessing, visualizing, and comparing plant genomes and biological pathways. Originally targeting grasses, Gramene has grown to host annotations for economically important and research model crops, including wheat, potato, tomato, banana, grape, poplar, and Chlamydomonas. Its strength derives from the application of a phylogenetic framework for genome comparison and the use of ontologies to integrate structural and functional annotation data. This chapter outlines system requirements for end users and database hosting, data types and basic navigation within Gramene, and provides examples of how to (1) view a phylogenetic tree for a family of transcription factors, (2) explore genetic variation in the orthologues of a gene with a known trait association, and (3) upload, visualize, and privately share end user data into a new genome browser track.Moreover, this is the first publication describing Gramene's new web interface-intended to provide a simplified portal to the most complete and up-to-date set of plant genome and pathway annotations.

Assuntos

Biologia Computacional/métodos , Plantas/genética , Plantas/metabolismo , Software , Genoma de Planta , Redes e Vias Metabólicas , Transdução de Sinais , Navegador

17.

Gramene 2016: comparative plant genomics and pathway resources.

Tello-Ruiz, Marcela K; Stein, Joshua; Wei, Sharon; Preece, Justin; Olson, Andrew; Naithani, Sushma; Amarasinghe, Vindhya; Dharmawardhana, Palitha; Jiao, Yinping; Mulvaney, Joseph; Kumari, Sunita; Chougule, Kapeel; Elser, Justin; Wang, Bo; Thomason, James; Bolser, Daniel M; Kerhornou, Arnaud; Walts, Brandon; Fonseca, Nuno A; Huerta, Laura; Keays, Maria; Tang, Y Amy; Parkinson, Helen; Fabregat, Antonio; McKay, Sheldon; Weiser, Joel; D'Eustachio, Peter; Stein, Lincoln; Petryszak, Robert; Kersey, Paul J; Jaiswal, Pankaj; Ware, Doreen.

Nucleic Acids Res ; 44(D1): D1133-40, 2016 Jan 04.

Artigo em Inglês | MEDLINE | ID: mdl-26553803

RESUMO

Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the database website adopted a new Drupal management platform. The genomes section features 39 fully assembled reference genomes that are integrated using ontology-based annotation and comparative analyses, and accessed through both visual and programmatic interfaces. Additional community data, such as genetic variation, expression and methylation, are also mapped for a subset of genomes. The Plant Reactome pathway portal (http://plantreactome.gramene.org) provides a reference resource for analyzing plant metabolic and regulatory pathways. In addition to â¼ 200 curated rice reference pathways, the portal hosts gene homology-based pathway projections for 33 plant species. Both the genome and pathway browsers interface with the EMBL-EBI's Expression Atlas to enable the projection of baseline and differential expression data from curated expression studies in plants. Gramene's archive website (http://archive.gramene.org) continues to provide previously reported resources on comparative maps, markers and QTL. To further aid our users, we have also introduced a live monthly educational webinar series and a Gramene YouTube channel carrying video tutorials.

Assuntos

Bases de Dados Genéticas , Genoma de Planta , Plantas/metabolismo , Expressão Gênica , Variação Genética , Genômica , Internet , Redes e Vias Metabólicas , Anotação de Sequência Molecular , Plantas/genética

18.

Gramene 2013: comparative plant genomics resources.

Monaco, Marcela K; Stein, Joshua; Naithani, Sushma; Wei, Sharon; Dharmawardhana, Palitha; Kumari, Sunita; Amarasinghe, Vindhya; Youens-Clark, Ken; Thomason, James; Preece, Justin; Pasternak, Shiran; Olson, Andrew; Jiao, Yinping; Lu, Zhenyuan; Bolser, Dan; Kerhornou, Arnaud; Staines, Dan; Walts, Brandon; Wu, Guanming; D'Eustachio, Peter; Haw, Robin; Croft, David; Kersey, Paul J; Stein, Lincoln; Jaiswal, Pankaj; Ware, Doreen.

Nucleic Acids Res ; 42(Database issue): D1193-9, 2014 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-24217918

RESUMO

Gramene (http://www.gramene.org) is a curated online resource for comparative functional genomics in crops and model plant species, currently hosting 27 fully and 10 partially sequenced reference genomes in its build number 38. Its strength derives from the application of a phylogenetic framework for genome comparison and the use of ontologies to integrate structural and functional annotation data. Whole-genome alignments complemented by phylogenetic gene family trees help infer syntenic and orthologous relationships. Genetic variation data, sequences and genome mappings available for 10 species, including Arabidopsis, rice and maize, help infer putative variant effects on genes and transcripts. The pathways section also hosts 10 species-specific metabolic pathways databases developed in-house or by our collaborators using Pathway Tools software, which facilitates searches for pathway, reaction and metabolite annotations, and allows analyses of user-defined expression datasets. Recently, we released a Plant Reactome portal featuring 133 curated rice pathways. This portal will be expanded for Arabidopsis, maize and other plant species. We continue to provide genetic and QTL maps and marker datasets developed by crop researchers. The project provides a unique community platform to support scientific research in plant genomics including studies in evolution, genetics, plant breeding, molecular biology, biochemistry and systems biology.

Assuntos

Bases de Dados Genéticas , Genoma de Planta , Genômica , Produtos Agrícolas/genética , Variação Genética , Internet , Redes e Vias Metabólicas/genética , Anotação de Sequência Molecular , Plantas/genética , Plantas/metabolismo

19.

Gramene database in 2010: updates and extensions.

Youens-Clark, Ken; Buckler, Ed; Casstevens, Terry; Chen, Charles; Declerck, Genevieve; Derwent, Paul; Dharmawardhana, Palitha; Jaiswal, Pankaj; Kersey, Paul; Karthikeyan, A S; Lu, Jerry; McCouch, Susan R; Ren, Liya; Spooner, William; Stein, Joshua C; Thomason, Jim; Wei, Sharon; Ware, Doreen.

Nucleic Acids Res ; 39(Database issue): D1085-94, 2011 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-21076153

RESUMO

Now in its 10th year, the Gramene database (http://www.gramene.org) has grown from its primary focus on rice, the first fully-sequenced grass genome, to become a resource for major model and crop plants including Arabidopsis, Brachypodium, maize, sorghum, poplar and grape in addition to several species of rice. Gramene began with the addition of an Ensembl genome browser and has expanded in the last decade to become a robust resource for plant genomics hosting a wide array of data sets including quantitative trait loci (QTL), metabolic pathways, genetic diversity, genes, proteins, germplasm, literature, ontologies and a fully-structured markers and sequences database integrated with genome browsers and maps from various published studies (genetic, physical, bin, etc.). In addition, Gramene now hosts a variety of web services including a Distributed Annotation Server (DAS), BLAST and a public MySQL database. Twice a year, Gramene releases a major build of the database and makes interim releases to correct errors or to make important updates to software and/or data.

Assuntos

Bases de Dados Genéticas , Genoma de Planta , Plantas/genética , Mapeamento Cromossômico , Genes de Plantas , Variação Genética , Genômica , Redes e Vias Metabólicas , Plantas/metabolismo , Locos de Características Quantitativas , Sintenia

20.

The DNA sequence, annotation and analysis of human chromosome 3.

Muzny, Donna M; Scherer, Steven E; Kaul, Rajinder; Wang, Jing; Yu, Jun; Sudbrak, Ralf; Buhay, Christian J; Chen, Rui; Cree, Andrew; Ding, Yan; Dugan-Rocha, Shannon; Gill, Rachel; Gunaratne, Preethi; Harris, R Alan; Hawes, Alicia C; Hernandez, Judith; Hodgson, Anne V; Hume, Jennifer; Jackson, Andrew; Khan, Ziad Mohid; Kovar-Smith, Christie; Lewis, Lora R; Lozado, Ryan J; Metzker, Michael L; Milosavljevic, Aleksandar; Miner, George R; Morgan, Margaret B; Nazareth, Lynne V; Scott, Graham; Sodergren, Erica; Song, Xing-Zhi; Steffen, David; Wei, Sharon; Wheeler, David A; Wright, Mathew W; Worley, Kim C; Yuan, Ye; Zhang, Zhengdong; Adams, Charles Q; Ansari-Lari, M Ali; Ayele, Mulu; Brown, Mary J; Chen, Guan; Chen, Zhijian; Clendenning, James; Clerc-Blankenburg, Kerstin P; Chen, Runsheng; Chen, Zhu; Davis, Clay; Delgado, Oliver.

Nature ; 440(7088): 1194-8, 2006 Apr 27.

Artigo em Inglês | MEDLINE | ID: mdl-16641997

RESUMO

After the completion of a draft human genome sequence, the International Human Genome Sequencing Consortium has proceeded to finish and annotate each of the 24 chromosomes comprising the human genome. Here we describe the sequencing and analysis of human chromosome 3, one of the largest human chromosomes. Chromosome 3 comprises just four contigs, one of which currently represents the longest unbroken stretch of finished DNA sequence known so far. The chromosome is remarkable in having the lowest rate of segmental duplication in the genome. It also includes a chemokine receptor gene cluster as well as numerous loci involved in multiple human cancers such as the gene encoding FHIT, which contains the most common constitutive fragile site in the genome, FRA3B. Using genomic sequence from chimpanzee and rhesus macaque, we were able to characterize the breakpoints defining a large pericentric inversion that occurred some time after the split of Homininae from Ponginae, and propose an evolutionary history of the inversion.

Assuntos

Cromossomos Humanos Par 3/genética , Animais , Sequência de Bases , Quebra Cromossômica/genética , Inversão Cromossômica/genética , Mapeamento de Sequências Contíguas , Ilhas de CpG/genética , DNA Complementar/genética , Evolução Molecular , Etiquetas de Sequências Expressas , Projeto Genoma Humano , Humanos , Macaca mulatta/genética , Dados de Sequência Molecular , Pan troglodytes/genética , Análise de Sequência de DNA , Sintenia/genética

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA