Pesquisa | Portal de Pesquisa da BVS

1.

Green plant genomes: What we know in an era of rapidly expanding opportunities.

Kress, W John; Soltis, Douglas E; Kersey, Paul J; Wegrzyn, Jill L; Leebens-Mack, James H; Gostel, Morgan R; Liu, Xin; Soltis, Pamela S.

Proc Natl Acad Sci U S A ; 119(4)2022 01 25.

Artigo em Inglês | MEDLINE | ID: mdl-35042803

RESUMO

Green plants play a fundamental role in ecosystems, human health, and agriculture. As de novo genomes are being generated for all known eukaryotic species as advocated by the Earth BioGenome Project, increasing genomic information on green land plants is essential. However, setting standards for the generation and storage of the complex set of genomes that characterize the green lineage of life is a major challenge for plant scientists. Such standards will need to accommodate the immense variation in green plant genome size, transposable element content, and structural complexity while enabling research into the molecular and evolutionary processes that have resulted in this enormous genomic variation. Here we provide an overview and assessment of the current state of knowledge of green plant genomes. To date fewer than 300 complete chromosome-scale genome assemblies representing fewer than 900 species have been generated across the estimated 450,000 to 500,000 species in the green plant clade. These genomes range in size from 12 Mb to 27.6 Gb and are biased toward agricultural crops with large branches of the green tree of life untouched by genomic-scale sequencing. Locating suitable tissue samples of most species of plants, especially those taxa from extreme environments, remains one of the biggest hurdles to increasing our genomic inventory. Furthermore, the annotation of plant genomes is at present undergoing intensive improvement. It is our hope that this fresh overview will help in the development of genomic quality standards for a cohesive and meaningful synthesis of green plant genomes as we scale up for the future.

Assuntos

Sequência de Bases/genética , Genômica/tendências , Viridiplantae/genética , Biodiversidade , Evolução Biológica , Elementos de DNA Transponíveis/genética , Ecologia , Ecossistema , Embriófitas/genética , Evolução Molecular , Genoma , Genoma de Planta/genética , Genômica/métodos , Disseminação de Informação/métodos , Armazenamento e Recuperação da Informação/métodos , Filogenia , Plantas/genética

2.

Standards recommendations for the Earth BioGenome Project.

Lawniczak, Mara K N; Durbin, Richard; Flicek, Paul; Lindblad-Toh, Kerstin; Wei, Xiaofeng; Archibald, John M; Baker, William J; Belov, Katherine; Blaxter, Mark L; Marques Bonet, Tomas; Childers, Anna K; Coddington, Jonathan A; Crandall, Keith A; Crawford, Andrew J; Davey, Robert P; Di Palma, Federica; Fang, Qi; Haerty, Wilfried; Hall, Neil; Hoff, Katharina J; Howe, Kerstin; Jarvis, Erich D; Johnson, Warren E; Johnson, Rebecca N; Kersey, Paul J; Liu, Xin; Lopez, Jose Victor; Myers, Eugene W; Pettersson, Olga Vinnere; Phillippy, Adam M; Poelchau, Monica F; Pruitt, Kim D; Rhie, Arang; Castilla-Rubio, Juan Carlos; Sahu, Sunil Kumar; Salmon, Nicholas A; Soltis, Pamela S; Swarbreck, David; Thibaud-Nissen, Françoise; Wang, Sibo; Wegrzyn, Jill L; Zhang, Guojie; Zhang, He; Lewin, Harris A; Richards, Stephen.

Proc Natl Acad Sci U S A ; 119(4)2022 01 25.

Artigo em Inglês | MEDLINE | ID: mdl-35042802

RESUMO

A global international initiative, such as the Earth BioGenome Project (EBP), requires both agreement and coordination on standards to ensure that the collective effort generates rapid progress toward its goals. To this end, the EBP initiated five technical standards committees comprising volunteer members from the global genomics scientific community: Sample Collection and Processing, Sequencing and Assembly, Annotation, Analysis, and IT and Informatics. The current versions of the resulting standards documents are available on the EBP website, with the recognition that opportunities, technologies, and challenges may improve or change in the future, requiring flexibility for the EBP to meet its goals. Here, we describe some highlights from the proposed standards, and areas where additional challenges will need to be met.

Assuntos

Sequência de Bases/genética , Eucariotos/genética , Genômica/normas , Animais , Biodiversidade , Genômica/métodos , Humanos , Padrões de Referência , Valores de Referência , Análise de Sequência de DNA/métodos , Análise de Sequência de DNA/normas

3.

Comparative evolutionary analyses of eight whitefly Bemisia tabaci sensu lato genomes: cryptic species, agricultural pests and plant-virus vectors.

Campbell, Lahcen I; Nwezeobi, Joachim; van Brunschot, Sharon L; Kaweesi, Tadeo; Seal, Susan E; Swamy, Rekha A R; Namuddu, Annet; Maslen, Gareth L; Mugerwa, Habibu; Armean, Irina M; Haggerty, Leanne; Martin, Fergal J; Malka, Osnat; Santos-Garcia, Diego; Juravel, Ksenia; Morin, Shai; Stephens, Michael E; Muhindira, Paul Visendi; Kersey, Paul J; Maruthi, M N; Omongo, Christopher A; Navas-Castillo, Jesús; Fiallo-Olivé, Elvira; Mohammed, Ibrahim Umar; Wang, Hua-Ling; Onyeka, Joseph; Alicai, Titus; Colvin, John.

BMC Genomics ; 24(1): 408, 2023 Jul 19.

Artigo em Inglês | MEDLINE | ID: mdl-37468834

RESUMO

BACKGROUND: The group of > 40 cryptic whitefly species called Bemisia tabaci sensu lato are amongst the world's worst agricultural pests and plant-virus vectors. Outbreaks of B. tabaci s.l. and the associated plant-virus diseases continue to contribute to global food insecurity and social instability, particularly in sub-Saharan Africa and Asia. Published B. tabaci s.l. genomes have limited use for studying African cassava B. tabaci SSA1 species, due to the high genetic divergences between them. Genomic annotations presented here were performed using the 'Ensembl gene annotation system', to ensure that comparative analyses and conclusions reflect biological differences, as opposed to arising from different methodologies underpinning transcript model identification. RESULTS: We present here six new B. tabaci s.l. genomes from Africa and Asia, and two re-annotated previously published genomes, to provide evolutionary insights into these globally distributed pests. Genome sizes ranged between 616-658 Mb and exhibited some of the highest coverage of transposable elements reported within Arthropoda. Many fewer total protein coding genes (PCG) were recovered compared to the previously published B. tabaci s.l. genomes and structural annotations generated via the uniform methodology strongly supported a repertoire of between 12.8-13.2 × 103 PCG. An integrative systematics approach incorporating phylogenomic analysis of nuclear and mitochondrial markers supported a monophyletic Aleyrodidae and the basal positioning of B. tabaci Uganda-1 to the sub-Saharan group of species. Reciprocal cross-mating data and the co-cladogenesis pattern of the primary obligate endosymbiont 'Candidatus Portiera aleyrodidarum' from 11 Bemisia genomes further supported the phylogenetic reconstruction to show that African cassava B. tabaci populations consist of just three biological species. We include comparative analyses of gene families related to detoxification, sugar metabolism, vector competency and evaluate the presence and function of horizontally transferred genes, essential for understanding the evolution and unique biology of constituent B. tabaci. s.l species. CONCLUSIONS: These genomic resources have provided new and critical insights into the genetics underlying B. tabaci s.l. biology. They also provide a rich foundation for post-genomic research, including the selection of candidate gene-targets for innovative whitefly and virus-control strategies.

Assuntos

Hemípteros , Vírus de Plantas , Animais , Filogenia , África , Ásia

4.

A Comprehensive Phylogenomic Platform for Exploring the Angiosperm Tree of Life.

Baker, William J; Bailey, Paul; Barber, Vanessa; Barker, Abigail; Bellot, Sidonie; Bishop, David; Botigué, Laura R; Brewer, Grace; Carruthers, Tom; Clarkson, James J; Cook, Jeffrey; Cowan, Robyn S; Dodsworth, Steven; Epitawalage, Niroshini; Françoso, Elaine; Gallego, Berta; Johnson, Matthew G; Kim, Jan T; Leempoel, Kevin; Maurin, Olivier; Mcginnie, Catherine; Pokorny, Lisa; Roy, Shyamali; Stone, Malcolm; Toledo, Eduardo; Wickett, Norman J; Zuntini, Alexandre R; Eiserhardt, Wolf L; Kersey, Paul J; Leitch, Ilia J; Forest, Félix.

Syst Biol ; 71(2): 301-319, 2022 02 10.

Artigo em Inglês | MEDLINE | ID: mdl-33983440

RESUMO

The tree of life is the fundamental biological roadmap for navigating the evolution and properties of life on Earth, and yet remains largely unknown. Even angiosperms (flowering plants) are fraught with data gaps, despite their critical role in sustaining terrestrial life. Today, high-throughput sequencing promises to significantly deepen our understanding of evolutionary relationships. Here, we describe a comprehensive phylogenomic platform for exploring the angiosperm tree of life, comprising a set of open tools and data based on the 353 nuclear genes targeted by the universal Angiosperms353 sequence capture probes. The primary goals of this article are to (i) document our methods, (ii) describe our first data release, and (iii) present a novel open data portal, the Kew Tree of Life Explorer (https://treeoflife.kew.org). We aim to generate novel target sequence capture data for all genera of flowering plants, exploiting natural history collections such as herbarium specimens, and augment it with mined public data. Our first data release, described here, is the most extensive nuclear phylogenomic data set for angiosperms to date, comprising 3099 samples validated by DNA barcode and phylogenetic tests, representing all 64 orders, 404 families (96$\%$) and 2333 genera (17$\%$). A "first pass" angiosperm tree of life was inferred from the data, which totaled 824,878 sequences, 489,086,049 base pairs, and 532,260 alignment columns, for interactive presentation in the Kew Tree of Life Explorer. This species tree was generated using methods that were rigorous, yet tractable at our scale of operation. Despite limitations pertaining to taxon and gene sampling, gene recovery, models of sequence evolution and paralogy, the tree strongly supports existing taxonomy, while challenging numerous hypothesized relationships among orders and placing many genera for the first time. The validated data set, species tree and all intermediates are openly accessible via the Kew Tree of Life Explorer and will be updated as further data become available. This major milestone toward a complete tree of life for all flowering plant species opens doors to a highly integrated future for angiosperm phylogenomics through the systematic sequencing of standardized nuclear markers. Our approach has the potential to serve as a much-needed bridge between the growing movement to sequence the genomes of all life on Earth and the vast phylogenomic potential of the world's natural history collections. [Angiosperms; Angiosperms353; genomics; herbariomics; museomics; nuclear phylogenomics; open access; target sequence capture; tree of life.].

Assuntos

Magnoliopsida , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Magnoliopsida/genética , Filogenia

5.

Gramene 2021: harnessing the power of comparative genomics and pathways for plant research.

Tello-Ruiz, Marcela K; Naithani, Sushma; Gupta, Parul; Olson, Andrew; Wei, Sharon; Preece, Justin; Jiao, Yinping; Wang, Bo; Chougule, Kapeel; Garg, Priyanka; Elser, Justin; Kumari, Sunita; Kumar, Vivek; Contreras-Moreira, Bruno; Naamati, Guy; George, Nancy; Cook, Justin; Bolser, Daniel; D'Eustachio, Peter; Stein, Lincoln D; Gupta, Amit; Xu, Weijia; Regala, Jennifer; Papatheodorou, Irene; Kersey, Paul J; Flicek, Paul; Taylor, Crispin; Jaiswal, Pankaj; Ware, Doreen.

Nucleic Acids Res ; 49(D1): D1452-D1463, 2021 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-33170273

RESUMO

Gramene (http://www.gramene.org), a knowledgebase founded on comparative functional analyses of genomic and pathway data for model plants and major crops, supports agricultural researchers worldwide. The resource is committed to open access and reproducible science based on the FAIR data principles. Since the last NAR update, we made nine releases; doubled the genome portal's content; expanded curated genes, pathways and expression sets; and implemented the Domain Informational Vocabulary Extraction (DIVE) algorithm for extracting gene function information from publications. The current release, #63 (October 2020), hosts 93 reference genomes-over 3.9 million genes in 122 947 families with orthologous and paralogous classifications. Plant Reactome portrays pathway networks using a combination of manual biocuration in rice (320 reference pathways) and orthology-based projections to 106 species. The Reactome platform facilitates comparison between reference and projected pathways, gene expression analyses and overlays of gene-gene interactions. Gramene integrates ontology-based protein structure-function annotation; information on genetic, epigenetic, expression, and phenotypic diversity; and gene functional annotations extracted from plant-focused journals using DIVE. We train plant researchers in biocuration of genes and pathways; host curated maize gene structures as tracks in the maize genome browser; and integrate curated rice genes and pathways in the Plant Reactome.

Assuntos

Bases de Dados Genéticas , Regulação da Expressão Gênica de Plantas , Genoma de Planta , Genômica/métodos , Proteínas de Plantas/genética , Plantas/genética , Produtos Agrícolas , Elementos de DNA Transponíveis , Duplicação Gênica , Ontologia Genética , Redes Reguladoras de Genes , Internet , Bases de Conhecimento , Redes e Vias Metabólicas , Anotação de Sequência Molecular , Oryza/genética , Oryza/metabolismo , Proteínas de Plantas/metabolismo , Plantas/classificação , Plantas/metabolismo , Poliploidia , Mapeamento de Interação de Proteínas , Software , Zea mays/genética , Zea mays/metabolismo

6.

Ensembl Genomes 2020-enabling non-vertebrate genomic research.

Howe, Kevin L; Contreras-Moreira, Bruno; De Silva, Nishadi; Maslen, Gareth; Akanni, Wasiu; Allen, James; Alvarez-Jarreta, Jorge; Barba, Matthieu; Bolser, Dan M; Cambell, Lahcen; Carbajo, Manuel; Chakiachvili, Marc; Christensen, Mikkel; Cummins, Carla; Cuzick, Alayne; Davis, Paul; Fexova, Silvie; Gall, Astrid; George, Nancy; Gil, Laurent; Gupta, Parul; Hammond-Kosack, Kim E; Haskell, Erin; Hunt, Sarah E; Jaiswal, Pankaj; Janacek, Sophie H; Kersey, Paul J; Langridge, Nick; Maheswari, Uma; Maurel, Thomas; McDowall, Mark D; Moore, Ben; Muffato, Matthieu; Naamati, Guy; Naithani, Sushma; Olson, Andrew; Papatheodorou, Irene; Patricio, Mateus; Paulini, Michael; Pedro, Helder; Perry, Emily; Preece, Justin; Rosello, Marc; Russell, Matthew; Sitnik, Vasily; Staines, Daniel M; Stein, Joshua; Tello-Ruiz, Marcela K; Trevanion, Stephen J; Urban, Martin.

Nucleic Acids Res ; 48(D1): D689-D695, 2020 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-31598706

RESUMO

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of interfaces to genomic data across the tree of life, including reference genome sequence, gene models, transcriptional data, genetic variation and comparative analysis. Data may be accessed via our website, online tools platform and programmatic interfaces, with updates made four times per year (in synchrony with Ensembl). Here, we provide an overview of Ensembl Genomes, with a focus on recent developments. These include the continued growth, more robust and reproducible sets of orthologues and paralogues, and enriched views of gene expression and gene function in plants. Finally, we report on our continued deeper integration with the Ensembl project, which forms a key part of our future strategy for dealing with the increasing quantity of available genome-scale data across the tree of life.

Assuntos

Biologia Computacional/métodos , Bases de Dados Genéticas , Variação Genética , Genoma Bacteriano , Genoma Fúngico , Genoma de Planta , Algoritmos , Animais , Caenorhabditis elegans/genética , Genômica , Internet , Anotação de Sequência Molecular , Fenótipo , Plantas/genética , Valores de Referência , Software , Interface Usuário-Computador

7.

The salmon louse genome: Copepod features and parasitic adaptations.

Skern-Mauritzen, Rasmus; Malde, Ketil; Eichner, Christiane; Dondrup, Michael; Furmanek, Tomasz; Besnier, Francois; Komisarczuk, Anna Zofia; Nuhn, Michael; Dalvin, Sussie; Edvardsen, Rolf B; Klages, Sven; Huettel, Bruno; Stueber, Kurt; Grotmol, Sindre; Karlsbakk, Egil; Kersey, Paul; Leong, Jong S; Glover, Kevin A; Reinhardt, Richard; Lien, Sigbjørn; Jonassen, Inge; Koop, Ben F; Nilsen, Frank.

Genomics ; 113(6): 3666-3680, 2021 11.

Artigo em Inglês | MEDLINE | ID: mdl-34403763

RESUMO

Copepods encompass numerous ecological roles including parasites, detrivores and phytoplankton grazers. Nonetheless, copepod genome assemblies remain scarce. Lepeophtheirus salmonis is an economically and ecologically important ectoparasitic copepod found on salmonid fish. We present the 695.4 Mbp L. salmonis genome assembly containing ≈60% repetitive regions and 13,081 annotated protein-coding genes. The genome comprises 14 autosomes and a ZZ-ZW sex chromosome system. Assembly assessment identified 92.4% of the expected arthropod genes. Transcriptomics supported annotation and indicated a marked shift in gene expression after host attachment, including apparent downregulation of genes related to circadian rhythm coinciding with abandoning diurnal migration. The genome shows evolutionary signatures including loss of genes needed for peroxisome biogenesis, presence of numerous FNII domains, and an incomplete heme homeostasis pathway suggesting heme proteins to be obtained from the host. Despite repeated development of resistance against chemical treatments L. salmonis exhibits low numbers of many genes involved in detoxification.

Assuntos

Copépodes , Doenças dos Peixes , Parasitos , Aclimatação , Animais , Copépodes/genética , Copépodes/parasitologia , Doenças dos Peixes/genética , Parasitos/genética , Transcriptoma

8.

An improved assembly and annotation of the allohexaploid wheat genome identifies complete families of agronomic genes and provides genomic evidence for chromosomal translocations.

Clavijo, Bernardo J; Venturini, Luca; Schudoma, Christian; Accinelli, Gonzalo Garcia; Kaithakottil, Gemy; Wright, Jonathan; Borrill, Philippa; Kettleborough, George; Heavens, Darren; Chapman, Helen; Lipscombe, James; Barker, Tom; Lu, Fu-Hao; McKenzie, Neil; Raats, Dina; Ramirez-Gonzalez, Ricardo H; Coince, Aurore; Peel, Ned; Percival-Alwyn, Lawrence; Duncan, Owen; Trösch, Josua; Yu, Guotai; Bolser, Dan M; Namaati, Guy; Kerhornou, Arnaud; Spannagl, Manuel; Gundlach, Heidrun; Haberer, Georg; Davey, Robert P; Fosker, Christine; Palma, Federica Di; Phillips, Andrew L; Millar, A Harvey; Kersey, Paul J; Uauy, Cristobal; Krasileva, Ksenia V; Swarbreck, David; Bevan, Michael W; Clark, Matthew D.

Genome Res ; 27(5): 885-896, 2017 05.

Artigo em Inglês | MEDLINE | ID: mdl-28420692

RESUMO

Advances in genome sequencing and assembly technologies are generating many high-quality genome sequences, but assemblies of large, repeat-rich polyploid genomes, such as that of bread wheat, remain fragmented and incomplete. We have generated a new wheat whole-genome shotgun sequence assembly using a combination of optimized data types and an assembly algorithm designed to deal with large and complex genomes. The new assembly represents >78% of the genome with a scaffold N50 of 88.8 kb that has a high fidelity to the input data. Our new annotation combines strand-specific Illumina RNA-seq and Pacific Biosciences (PacBio) full-length cDNAs to identify 104,091 high-confidence protein-coding genes and 10,156 noncoding RNA genes. We confirmed three known and identified one novel genome rearrangements. Our approach enables the rapid and scalable assembly of wheat genomes, the identification of structural variants, and the definition of complete gene models, all powerful resources for trait analysis and breeding of this key global crop.

Assuntos

Mapeamento de Sequências Contíguas/métodos , Genoma de Planta , Anotação de Sequência Molecular/métodos , Proteínas de Plantas/genética , Translocação Genética , Triticum/genética , Algoritmos , Mapeamento de Sequências Contíguas/normas , Anotação de Sequência Molecular/normas , Polimorfismo Genético , Poliploidia

9.

Enabling reusability of plant phenomic datasets with MIAPPE 1.1.

Papoutsoglou, Evangelia A; Faria, Daniel; Arend, Daniel; Arnaud, Elizabeth; Athanasiadis, Ioannis N; Chaves, Inês; Coppens, Frederik; Cornut, Guillaume; Costa, Bruno V; Cwiek-Kupczynska, Hanna; Droesbeke, Bert; Finkers, Richard; Gruden, Kristina; Junker, Astrid; King, Graham J; Krajewski, Pawel; Lange, Matthias; Laporte, Marie-Angélique; Michotey, Célia; Oppermann, Markus; Ostler, Richard; Poorter, Hendrik; Rami Rez-Gonzalez, Ricardo; Ramsak, Ziva; Reif, Jochen C; Rocca-Serra, Philippe; Sansone, Susanna-Assunta; Scholz, Uwe; Tardieu, François; Uauy, Cristobal; Usadel, Björn; Visser, Richard G F; Weise, Stephan; Kersey, Paul J; Miguel, Célia M; Adam-Blondon, Anne-Françoise; Pommier, Cyril.

New Phytol ; 227(1): 260-273, 2020 07.

Artigo em Inglês | MEDLINE | ID: mdl-32171029

RESUMO

Enabling data reuse and knowledge discovery is increasingly critical in modern science, and requires an effort towards standardising data publication practices. This is particularly challenging in the plant phenotyping domain, due to its complexity and heterogeneity. We have produced the MIAPPE 1.1 release, which enhances the existing MIAPPE standard in coverage, to support perennial plants, in structure, through an explicit data model, and in clarity, through definitions and examples. We evaluated MIAPPE 1.1 by using it to express several heterogeneous phenotyping experiments in a range of different formats, to demonstrate its applicability and the interoperability between the various implementations. Furthermore, the extended coverage is demonstrated by the fact that one of the datasets could not have been described under MIAPPE 1.0. MIAPPE 1.1 marks a major step towards enabling plant phenotyping data reusability, thanks to its extended coverage, and especially the formalisation of its data model, which facilitates its implementation in different formats. Community feedback has been critical to this development, and will be a key part of ensuring adoption of the standard.

Assuntos

Fenômica , Plantas , Plantas/genética

10.

WormBase 2017: molting into a new stage.

Lee, Raymond Y N; Howe, Kevin L; Harris, Todd W; Arnaboldi, Valerio; Cain, Scott; Chan, Juancarlos; Chen, Wen J; Davis, Paul; Gao, Sibyl; Grove, Christian; Kishore, Ranjana; Muller, Hans-Michael; Nakamura, Cecilia; Nuin, Paulo; Paulini, Michael; Raciti, Daniela; Rodgers, Faye; Russell, Matt; Schindelman, Gary; Tuli, Mary Ann; Van Auken, Kimberly; Wang, Qinghua; Williams, Gary; Wright, Adam; Yook, Karen; Berriman, Matthew; Kersey, Paul; Schedl, Tim; Stein, Lincoln; Sternberg, Paul W.

Nucleic Acids Res ; 46(D1): D869-D874, 2018 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-29069413

RESUMO

WormBase (http://www.wormbase.org) is an important knowledge resource for biomedical researchers worldwide. To accommodate the ever increasing amount and complexity of research data, WormBase continues to advance its practices on data acquisition, curation and retrieval to most effectively deliver comprehensive knowledge about Caenorhabditis elegans, and genomic information about other nematodes and parasitic flatworms. Recent notable enhancements include user-directed submission of data, such as micropublication; genomic data curation and presentation, including additional genomes and JBrowse, respectively; new query tools, such as SimpleMine, Gene Enrichment Analysis; new data displays, such as the Person Lineage browser and the Summary of Ontology-based Annotations. Anticipating more rapid data growth ahead, WormBase continues the process of migrating to a cutting-edge database technology to achieve better stability, scalability, reproducibility and a faster response time. To better serve the broader research community, WormBase, with five other Model Organism Databases and The Gene Ontology project, have begun to collaborate formally as the Alliance of Genome Resources.

Assuntos

Bases de Dados Genéticas , Genoma , Nematoides/genética , Animais , Caenorhabditis/genética , Caenorhabditis elegans/genética , Curadoria de Dados , Mineração de Dados , Conjuntos de Dados como Assunto , Modelos Animais de Doenças , Previsões , Ontologia Genética , Humanos , Armazenamento e Recuperação da Informação , Platelmintos/genética , Editoração , Interferência de RNA , Alinhamento de Sequência , Interface Usuário-Computador , Navegador

11.

Gramene 2018: unifying comparative genomics and pathway resources for plant research.

Tello-Ruiz, Marcela K; Naithani, Sushma; Stein, Joshua C; Gupta, Parul; Campbell, Michael; Olson, Andrew; Wei, Sharon; Preece, Justin; Geniza, Matthew J; Jiao, Yinping; Lee, Young Koung; Wang, Bo; Mulvaney, Joseph; Chougule, Kapeel; Elser, Justin; Al-Bader, Noor; Kumari, Sunita; Thomason, James; Kumar, Vivek; Bolser, Daniel M; Naamati, Guy; Tapanari, Electra; Fonseca, Nuno; Huerta, Laura; Iqbal, Haider; Keays, Maria; Munoz-Pomer Fuentes, Alfonso; Tang, Amy; Fabregat, Antonio; D'Eustachio, Peter; Weiser, Joel; Stein, Lincoln D; Petryszak, Robert; Papatheodorou, Irene; Kersey, Paul J; Lockhart, Patti; Taylor, Crispin; Jaiswal, Pankaj; Ware, Doreen.

Nucleic Acids Res ; 46(D1): D1181-D1189, 2018 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-29165610

RESUMO

Gramene (http://www.gramene.org) is a knowledgebase for comparative functional analysis in major crops and model plant species. The current release, #54, includes over 1.7 million genes from 44 reference genomes, most of which were organized into 62,367 gene families through orthologous and paralogous gene classification, whole-genome alignments, and synteny. Additional gene annotations include ontology-based protein structure and function; genetic, epigenetic, and phenotypic diversity; and pathway associations. Gramene's Plant Reactome provides a knowledgebase of cellular-level plant pathway networks. Specifically, it uses curated rice reference pathways to derive pathway projections for an additional 66 species based on gene orthology, and facilitates display of gene expression, gene-gene interactions, and user-defined omics data in the context of these pathways. As a community portal, Gramene integrates best-of-class software and infrastructure components including the Ensembl genome browser, Reactome pathway browser, and Expression Atlas widgets, and undergoes periodic data and software upgrades. Via powerful, intuitive search interfaces, users can easily query across various portals and interactively analyze search results by clicking on diverse features such as genomic context, highly augmented gene trees, gene expression anatomograms, associated pathways, and external informatics resources. All data in Gramene are accessible through both visual and programmatic interfaces.

Assuntos

Bases de Dados Genéticas , Regulação da Expressão Gênica de Plantas , Genômica/métodos , Bases de Conhecimento , Plantas/genética , Epigênese Genética , Ontologia Genética , Pesquisa em Genética , Variação Genética , Genoma de Planta , Redes e Vias Metabólicas/genética , Anotação de Sequência Molecular , Plantas/metabolismo , Software , Interface Usuário-Computador

12.

Ensembl Genomes 2018: an integrated omics infrastructure for non-vertebrate species.

Kersey, Paul Julian; Allen, James E; Allot, Alexis; Barba, Matthieu; Boddu, Sanjay; Bolt, Bruce J; Carvalho-Silva, Denise; Christensen, Mikkel; Davis, Paul; Grabmueller, Christoph; Kumar, Navin; Liu, Zicheng; Maurel, Thomas; Moore, Ben; McDowall, Mark D; Maheswari, Uma; Naamati, Guy; Newman, Victoria; Ong, Chuang Kee; Paulini, Michael; Pedro, Helder; Perry, Emily; Russell, Matthew; Sparrow, Helen; Tapanari, Electra; Taylor, Kieron; Vullo, Alessandro; Williams, Gareth; Zadissia, Amonida; Olson, Andrew; Stein, Joshua; Wei, Sharon; Tello-Ruiz, Marcela; Ware, Doreen; Luciani, Aurelien; Potter, Simon; Finn, Robert D; Urban, Martin; Hammond-Kosack, Kim E; Bolser, Dan M; De Silva, Nishadi; Howe, Kevin L; Langridge, Nicholas; Maslen, Gareth; Staines, Daniel Michael; Yates, Andrew.

Nucleic Acids Res ; 46(D1): D802-D808, 2018 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-29092050

RESUMO

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including genome sequence, gene models, transcript sequence, genetic variation, and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments and expansions. These include the incorporation of almost 20 000 additional genome sequences and over 35 000 tracks of RNA-Seq data, which have been aligned to genomic sequence and made available for visualization. Other advances since 2015 include the release of the database in Resource Description Framework (RDF) format, a large increase in community-derived curation, a new high-performance protein sequence search, additional cross-references, improved annotation of non-protein-coding genes, and the launch of pre-release and archival sites. Collectively, these changes are part of a continuing response to the increasing quantity of publicly-available genome-scale data, and the consequent need to archive, integrate, annotate and disseminate these using automated, scalable methods.

Assuntos

Archaea/genética , Bactérias/genética , Bases de Dados Genéticas , Bases de Dados de Proteínas , Eucariotos/genética , Genômica , Sequência de Aminoácidos , Animais , Sequência de Bases , Mineração de Dados , Previsões , Genoma , Anotação de Sequência Molecular , RNA/genética , Interface Usuário-Computador

13.

The Earth BioGenome Project 2020: Starting the clock.

Lewin, Harris A; Richards, Stephen; Lieberman Aiden, Erez; Allende, Miguel L; Archibald, John M; Bálint, Miklós; Barker, Katharine B; Baumgartner, Bridget; Belov, Katherine; Bertorelle, Giorgio; Blaxter, Mark L; Cai, Jing; Caperello, Nicolette D; Carlson, Keith; Castilla-Rubio, Juan Carlos; Chaw, Shu-Miaw; Chen, Lei; Childers, Anna K; Coddington, Jonathan A; Conde, Dalia A; Corominas, Montserrat; Crandall, Keith A; Crawford, Andrew J; DiPalma, Federica; Durbin, Richard; Ebenezer, ThankGod E; Edwards, Scott V; Fedrigo, Olivier; Flicek, Paul; Formenti, Giulio; Gibbs, Richard A; Gilbert, M Thomas P; Goldstein, Melissa M; Graves, Jennifer Marshall; Greely, Henry T; Grigoriev, Igor V; Hackett, Kevin J; Hall, Neil; Haussler, David; Helgen, Kristofer M; Hogg, Carolyn J; Isobe, Sachiko; Jakobsen, Kjetill Sigurd; Janke, Axel; Jarvis, Erich D; Johnson, Warren E; Jones, Steven J M; Karlsson, Elinor K; Kersey, Paul J; Kim, Jin-Hyoung.

Proc Natl Acad Sci U S A ; 119(4)2022 01 25.

Artigo em Inglês | MEDLINE | ID: mdl-35042800

Assuntos

Sequência de Bases/genética , Eucariotos/genética , Animais , Biodiversidade , Genômica , Humanos

14.

RNAcentral: a comprehensive database of non-coding RNA sequences.

Petrov, Anton I; Kay, Simon J E; Kalvari, Ioanna; Howe, Kevin L; Gray, Kristian A; Bruford, Elspeth A; Kersey, Paul J; Cochrane, Guy; Finn, Robert D; Bateman, Alex; Kozomara, Ana; Griffiths-Jones, Sam; Frankish, Adam; Zwieb, Christian W; Lau, Britney Y; Williams, Kelly P; Chan, Patricia P; Lowe, Todd M; Cannone, Jamie J; Gutell, Robin; Machnicka, Magdalena A; Bujnicki, Janusz M; Yoshihama, Maki; Kenmochi, Naoya; Chai, Benli; Cole, James R; Szymanski, Maciej; Karlowski, Wojciech M; Wood, Valerie; Huala, Eva; Berardini, Tanya Z; Zhao, Yi; Chen, Runsheng; Zhu, Weimin; Paraskevopoulou, Maria D; Vlachos, Ioannis S; Hatzigeorgiou, Artemis G; Ma, Lina; Zhang, Zhang; Puetz, Joern; Stadler, Peter F; McDonald, Daniel; Basu, Siddhartha; Fey, Petra; Engel, Stacia R; Cherry, J Michael; Volders, Pieter-Jan; Mestdagh, Pieter; Wower, Jacek; Clark, Michael B.

Nucleic Acids Res ; 45(D1): D128-D134, 2017 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-27794554

RESUMO

RNAcentral is a database of non-coding RNA (ncRNA) sequences that aggregates data from specialised ncRNA resources and provides a single entry point for accessing ncRNA sequences of all ncRNA types from all organisms. Since its launch in 2014, RNAcentral has integrated twelve new resources, taking the total number of collaborating database to 22, and began importing new types of data, such as modified nucleotides from MODOMICS and PDB. We created new species-specific identifiers that refer to unique RNA sequences within a context of single species. The website has been subject to continuous improvements focusing on text and sequence similarity searches as well as genome browsing functionality. All RNAcentral data is provided for free and is available for browsing, bulk downloads, and programmatic access at http://rnacentral.org/.

Assuntos

Bases de Dados de Ácidos Nucleicos , RNA não Traduzido/química , Animais , Genômica , Humanos , Nucleotídeos/química , Análise de Sequência de RNA , Especificidade da Espécie

15.

The genome of the biting midge Culicoides sonorensis and gene expression analyses of vector competence for bluetongue virus.

Morales-Hojas, Ramiro; Hinsley, Malcolm; Armean, Irina M; Silk, Rhiannon; Harrup, Lara E; Gonzalez-Uriarte, Asier; Veronesi, Eva; Campbell, Lahcen; Nayduch, Dana; Saski, Christopher; Tabachnick, Walter J; Kersey, Paul; Carpenter, Simon; Fife, Mark.

BMC Genomics ; 19(1): 624, 2018 Aug 22.

Artigo em Inglês | MEDLINE | ID: mdl-30134833

RESUMO

BACKGROUND: The new genomic technologies have provided novel insights into the genetics of interactions between vectors, viruses and hosts, which are leading to advances in the control of arboviruses of medical importance. However, the development of tools and resources available for vectors of non-zoonotic arboviruses remains neglected. Biting midges of the genus Culicoides transmit some of the most important arboviruses of wildlife and livestock worldwide, with a global impact on economic productivity, health and welfare. The absence of a suitable reference genome has hindered genomic analyses to date in this important genus of vectors. In the present study, the genome of Culicoides sonorensis, a vector of bluetongue virus (BTV) in the USA, has been sequenced to provide the first reference genome for these vectors. In this study, we also report the use of the reference genome to perform initial transcriptomic analyses of vector competence for BTV. RESULTS: Our analyses reveal that the genome is 189 Mb, assembled in 7974 scaffolds. Its annotation using the transcriptomic data generated in this study and in a previous study has identified 15,612 genes. Gene expression analyses of C. sonorensis females infected with BTV performed in this study revealed 165 genes that were differentially expressed between vector competent and refractory females. Two candidate genes, glutathione S-transferase (gst) and the antiviral helicase ski2, previously recognized as involved in vector competence for BTV in C. sonorensis (gst) and repressing dsRNA virus propagation (ski2), were confirmed in this study. CONCLUSIONS: The reference genome of C. sonorensis has enabled preliminary analyses of the gene expression profiles of vector competent and refractory individuals. The genome and transcriptomes generated in this study provide suitable tools for future research on arbovirus transmission. These provide a valuable resource for these vector lineage, which diverged from other major Dipteran vector families over 200 million years ago. The genome will be a valuable source of comparative data for other important Dipteran vector families including mosquitoes (Culicidae) and sandflies (Psychodidae), and together with the transcriptomic data can yield potential targets for transgenic modification in vector control and functional studies.

Assuntos

Vírus Bluetongue/fisiologia , Bluetongue/transmissão , Ceratopogonidae/genética , Ceratopogonidae/virologia , Genoma de Inseto , Insetos Vetores , Animais , Bluetongue/imunologia , Bluetongue/virologia , Vírus Bluetongue/imunologia , Ceratopogonidae/imunologia , Evolução Molecular , Perfilação da Expressão Gênica , Interações Hospedeiro-Patógeno/genética , Interações Hospedeiro-Patógeno/imunologia , Imunidade Inata/genética , Insetos Vetores/genética , Insetos Vetores/fisiologia , Anotação de Sequência Molecular , Análise de Sequência de DNA , Transcriptoma/genética

16.

Plant genetic resources for food and agriculture: opportunities and challenges emerging from the science and information technology revolution.

Halewood, Michael; Chiurugwi, Tinashe; Sackville Hamilton, Ruaraidh; Kurtz, Brad; Marden, Emily; Welch, Eric; Michiels, Frank; Mozafari, Javad; Sabran, Muhamad; Patron, Nicola; Kersey, Paul; Bastow, Ruth; Dorius, Shawn; Dias, Sonia; McCouch, Susan; Powell, Wayne.

New Phytol ; 217(4): 1407-1419, 2018 03.

Artigo em Inglês | MEDLINE | ID: mdl-29359808

RESUMO

Contents Summary 1407 I. Introduction 1408 II. Technological advances and their utility for gene banks and breeding, and longer-term contributions to SDGs 1408 III. The challenges that must be overcome to realise emerging R&D opportunities 1410 IV. Renewed governance structures for PGR (and related big data) 1413 V. Access and benefit sharing and big data 1416 VI. Conclusion 1417 Acknowledgements 1417 ORCID 1417 References 1417 SUMMARY: Over the last decade, there has been an ongoing revolution in the exploration, manipulation and synthesis of biological systems, through the development of new technologies that generate, analyse and exploit big data. Users of Plant Genetic Resources (PGR) can potentially leverage these capacities to significantly increase the efficiency and effectiveness of their efforts to conserve, discover and utilise novel qualities in PGR, and help achieve the Sustainable Development Goals (SDGs). This review advances the discussion on these emerging opportunities and discusses how taking advantage of them will require data integration and synthesis across disciplinary, organisational and international boundaries, and the formation of multi-disciplinary, international partnerships. We explore some of the institutional and policy challenges that these efforts will face, particularly how these new technologies may influence the structure and role of research for sustainable development, ownership of resources, and access and benefit sharing. We discuss potential responses to political and institutional challenges, ranging from options for enhanced structure and governance of research discovery platforms to internationally brokered benefit-sharing agreements, and identify a set of broad principles that could guide the global community as it seeks or considers solutions.

Assuntos

Agricultura , Alimentos , Tecnologia da Informação , Plantas/genética , Ciência , Cruzamento

17.

Analysis of the bread wheat genome using whole-genome shotgun sequencing.

Brenchley, Rachel; Spannagl, Manuel; Pfeifer, Matthias; Barker, Gary L A; D'Amore, Rosalinda; Allen, Alexandra M; McKenzie, Neil; Kramer, Melissa; Kerhornou, Arnaud; Bolser, Dan; Kay, Suzanne; Waite, Darren; Trick, Martin; Bancroft, Ian; Gu, Yong; Huo, Naxin; Luo, Ming-Cheng; Sehgal, Sunish; Gill, Bikram; Kianian, Sharyar; Anderson, Olin; Kersey, Paul; Dvorak, Jan; McCombie, W Richard; Hall, Anthony; Mayer, Klaus F X; Edwards, Keith J; Bevan, Michael W; Hall, Neil.

Nature ; 491(7426): 705-10, 2012 Nov 29.

Artigo em Inglês | MEDLINE | ID: mdl-23192148

RESUMO

Bread wheat (Triticum aestivum) is a globally important crop, accounting for 20 per cent of the calories consumed by humans. Major efforts are underway worldwide to increase wheat production by extending genetic diversity and analysing key traits, and genomic resources can accelerate progress. But so far the very large size and polyploid complexity of the bread wheat genome have been substantial barriers to genome analysis. Here we report the sequencing of its large, 17-gigabase-pair, hexaploid genome using 454 pyrosequencing, and comparison of this with the sequences of diploid ancestral and progenitor genomes. We identified between 94,000 and 96,000 genes, and assigned two-thirds to the three component genomes (A, B and D) of hexaploid wheat. High-resolution synteny maps identified many small disruptions to conserved gene order. We show that the hexaploid genome is highly dynamic, with significant loss of gene family members on polyploidization and domestication, and an abundance of gene fragments. Several classes of genes involved in energy harvesting, metabolism and growth are among expanded gene families that could be associated with crop productivity. Our analyses, coupled with the identification of extensive genetic variation, provide a resource for accelerating gene discovery and improving this major crop.

Assuntos

Pão , Genoma de Planta/genética , Triticum/genética , Brachypodium/genética , Cromossomos de Plantas/genética , Produtos Agrícolas/genética , DNA Complementar/genética , DNA de Plantas/genética , Evolução Molecular , Genes de Plantas/genética , Genômica , Família Multigênica/genética , Oryza/genética , Polimorfismo de Nucleotídeo Único/genética , Poliploidia , Pseudogenes/genética , Alinhamento de Sequência , Análise de Sequência de DNA , Triticum/classificação , Zea mays/genética

18.

PhytoPath: an integrative resource for plant pathogen genomics.

Pedro, Helder; Maheswari, Uma; Urban, Martin; Irvine, Alistair George; Cuzick, Alayne; McDowall, Mark D; Staines, Daniel M; Kulesha, Eugene; Hammond-Kosack, Kim Elizabeth; Kersey, Paul Julian.

Nucleic Acids Res ; 44(D1): D688-93, 2016 Jan 04.

Artigo em Inglês | MEDLINE | ID: mdl-26476449

RESUMO

PhytoPath (www.phytopathdb.org) is a resource for genomic and phenotypic data from plant pathogen species, that integrates phenotypic data for genes from PHI-base, an expertly curated catalog of genes with experimentally verified pathogenicity, with the Ensembl tools for data visualization and analysis. The resource is focused on fungi, protists (oomycetes) and bacterial plant pathogens that have genomes that have been sequenced and annotated. Genes with associated PHI-base data can be easily identified across all plant pathogen species using a BioMart-based query tool and visualized in their genomic context on the Ensembl genome browser. The PhytoPath resource contains data for 135 genomic sequences from 87 plant pathogen species, and 1364 genes curated for their role in pathogenicity and as targets for chemical intervention. Support for community annotation of gene models is provided using the WebApollo online gene editor, and we are working with interested communities to improve reference annotation for selected species.

Assuntos

Bases de Dados Genéticas , Genômica , Interações Hospedeiro-Patógeno/genética , Doenças das Plantas/microbiologia , Genes Bacterianos , Genes Fúngicos , Genoma Bacteriano , Genoma Fúngico , Oomicetos/genética , Fenótipo , Alinhamento de Sequência

19.

WormBase 2016: expanding to enable helminth genomic research.

Howe, Kevin L; Bolt, Bruce J; Cain, Scott; Chan, Juancarlos; Chen, Wen J; Davis, Paul; Done, James; Down, Thomas; Gao, Sibyl; Grove, Christian; Harris, Todd W; Kishore, Ranjana; Lee, Raymond; Lomax, Jane; Li, Yuling; Muller, Hans-Michael; Nakamura, Cecilia; Nuin, Paulo; Paulini, Michael; Raciti, Daniela; Schindelman, Gary; Stanley, Eleanor; Tuli, Mary Ann; Van Auken, Kimberly; Wang, Daniel; Wang, Xiaodong; Williams, Gary; Wright, Adam; Yook, Karen; Berriman, Matthew; Kersey, Paul; Schedl, Tim; Stein, Lincoln; Sternberg, Paul W.

Nucleic Acids Res ; 44(D1): D774-80, 2016 Jan 04.

Artigo em Inglês | MEDLINE | ID: mdl-26578572

RESUMO

WormBase (www.wormbase.org) is a central repository for research data on the biology, genetics and genomics of Caenorhabditis elegans and other nematodes. The project has evolved from its original remit to collect and integrate all data for a single species, and now extends to numerous nematodes, ranging from evolutionary comparators of C. elegans to parasitic species that threaten plant, animal and human health. Research activity using C. elegans as a model system is as vibrant as ever, and we have created new tools for community curation in response to the ever-increasing volume and complexity of data. To better allow users to navigate their way through these data, we have made a number of improvements to our main website, including new tools for browsing genomic features and ontology annotations. Finally, we have developed a new portal for parasitic worm genomes. WormBase ParaSite (parasite.wormbase.org) contains all publicly available nematode and platyhelminth annotated genome sequences, and is designed specifically to support helminth genomic research.

Assuntos

Caenorhabditis elegans/genética , Bases de Dados Genéticas , Genoma Helmíntico , Genômica , Nematoides/genética , Animais , Genes de Helmintos , Anotação de Sequência Molecular , Platelmintos/genética , Software

20.

Gramene 2016: comparative plant genomics and pathway resources.

Tello-Ruiz, Marcela K; Stein, Joshua; Wei, Sharon; Preece, Justin; Olson, Andrew; Naithani, Sushma; Amarasinghe, Vindhya; Dharmawardhana, Palitha; Jiao, Yinping; Mulvaney, Joseph; Kumari, Sunita; Chougule, Kapeel; Elser, Justin; Wang, Bo; Thomason, James; Bolser, Daniel M; Kerhornou, Arnaud; Walts, Brandon; Fonseca, Nuno A; Huerta, Laura; Keays, Maria; Tang, Y Amy; Parkinson, Helen; Fabregat, Antonio; McKay, Sheldon; Weiser, Joel; D'Eustachio, Peter; Stein, Lincoln; Petryszak, Robert; Kersey, Paul J; Jaiswal, Pankaj; Ware, Doreen.

Nucleic Acids Res ; 44(D1): D1133-40, 2016 Jan 04.

Artigo em Inglês | MEDLINE | ID: mdl-26553803

RESUMO

Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the database website adopted a new Drupal management platform. The genomes section features 39 fully assembled reference genomes that are integrated using ontology-based annotation and comparative analyses, and accessed through both visual and programmatic interfaces. Additional community data, such as genetic variation, expression and methylation, are also mapped for a subset of genomes. The Plant Reactome pathway portal (http://plantreactome.gramene.org) provides a reference resource for analyzing plant metabolic and regulatory pathways. In addition to â¼ 200 curated rice reference pathways, the portal hosts gene homology-based pathway projections for 33 plant species. Both the genome and pathway browsers interface with the EMBL-EBI's Expression Atlas to enable the projection of baseline and differential expression data from curated expression studies in plants. Gramene's archive website (http://archive.gramene.org) continues to provide previously reported resources on comparative maps, markers and QTL. To further aid our users, we have also introduced a live monthly educational webinar series and a Gramene YouTube channel carrying video tutorials.

Assuntos

Bases de Dados Genéticas , Genoma de Planta , Plantas/metabolismo , Expressão Gênica , Variação Genética , Genômica , Internet , Redes e Vias Metabólicas , Anotação de Sequência Molecular , Plantas/genética

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA