Pesquisa | BVS CLAP/SMR-OPAS/OMS

1.

A cloud-based resource for genome coordinate-based exploration and large-scale analysis of chromosome aberrations and gene fusions in cancer.

Wang, Janet; Zheng, Jeanne; Lee, Elaine E; Aguilar, Boris; Phan, John; Abdilleh, Kawther; Taylor, Ronald C; Longabaugh, William; Johansson, Bertil; Mertens, Fredrik; Mitelman, Felix; Pot, David; LaFramboise, Thomas.

Genes Chromosomes Cancer ; 62(8): 441-448, 2023 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-36695636

RESUMO

Cytogenetic analysis provides important information on the genetic mechanisms of cancer. The Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer (Mitelman DB) is the largest catalog of acquired chromosome aberrations, presently comprising >70 000 cases across multiple cancer types. Although this resource has enabled the identification of chromosome abnormalities leading to specific cancers and cancer mechanisms, a large-scale, systematic analysis of these aberrations and their downstream implications has been difficult due to the lack of a standard, automated mapping from aberrations to genomic coordinates. We previously introduced CytoConverter as a tool that automates such conversions. CytoConverter has now been updated with improved interpretation of karyotypes and has been integrated with the Mitelman DB, providing a comprehensive mapping of the 70 000+ cases to genomic coordinates, as well as visualization of the frequencies of chromosomal gains and losses. Importantly, all CytoConverter-generated genomic coordinates are publicly available in Google BigQuery, a cloud-based data warehouse, facilitating data exploration and integration with other datasets hosted by the Institute for Systems Biology Cancer Gateway in the Cloud (ISB-CGC) Resource. We demonstrate the use of BigQuery for integrative analysis of Mitelman DB with other cancer datasets, including a comparison of the frequency of imbalances identified in Mitelman DB cases with those found in The Cancer Genome Atlas (TCGA) copy number datasets. This solution provides opportunities to leverage the power of cloud computing for low-cost, scalable, and integrated analysis of chromosome aberrations and gene fusions in cancer.

Assuntos

Computação em Nuvem , Neoplasias , Humanos , Aberrações Cromossômicas , Cariotipagem , Neoplasias/genética , Fusão Gênica

2.

National Cancer Institute Imaging Data Commons: Toward Transparency, Reproducibility, and Scalability in Imaging Artificial Intelligence.

Fedorov, Andrey; Longabaugh, William J R; Pot, David; Clunie, David A; Pieper, Steven D; Gibbs, David L; Bridge, Christopher; Herrmann, Markus D; Homeyer, André; Lewis, Rob; Aerts, Hugo J W L; Krishnaswamy, Deepa; Thiriveedhi, Vamsi Krishna; Ciausu, Cosmin; Schacherer, Daniela P; Bontempi, Dennis; Pihl, Todd; Wagner, Ulrike; Farahani, Keyvan; Kim, Erika; Kikinis, Ron.

Radiographics ; 43(12): e230180, 2023 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-37999984

RESUMO

The remarkable advances of artificial intelligence (AI) technology are revolutionizing established approaches to the acquisition, interpretation, and analysis of biomedical imaging data. Development, validation, and continuous refinement of AI tools requires easy access to large high-quality annotated datasets, which are both representative and diverse. The National Cancer Institute (NCI) Imaging Data Commons (IDC) hosts large and diverse publicly available cancer image data collections. By harmonizing all data based on industry standards and colocalizing it with analysis and exploration resources, the IDC aims to facilitate the development, validation, and clinical translation of AI tools and address the well-documented challenges of establishing reproducible and transparent AI processing pipelines. Balanced use of established commercial products with open-source solutions, interconnected by standard interfaces, provides value and performance, while preserving sufficient agility to address the evolving needs of the research community. Emphasis on the development of tools, use cases to demonstrate the utility of uniform data representation, and cloud-based analysis aim to ease adoption and help define best practices. Integration with other data in the broader NCI Cancer Research Data Commons infrastructure opens opportunities for multiomics studies incorporating imaging data to further empower the research community to accelerate breakthroughs in cancer detection, diagnosis, and treatment. Published under a CC BY 4.0 license.

Assuntos

Inteligência Artificial , Neoplasias , Estados Unidos , Humanos , National Cancer Institute (U.S.) , Reprodutibilidade dos Testes , Diagnóstico por Imagem , Multiômica , Neoplasias/diagnóstico por imagem

3.

Evolutionary forces affecting synonymous variations in plant genomes.

Clément, Yves; Sarah, Gautier; Holtz, Yan; Homa, Felix; Pointet, Stéphanie; Contreras, Sandy; Nabholz, Benoit; Sabot, François; Sauné, Laure; Ardisson, Morgane; Bacilieri, Roberto; Besnard, Guillaume; Berger, Angélique; Cardi, Céline; De Bellis, Fabien; Fouet, Olivier; Jourda, Cyril; Khadari, Bouchaib; Lanaud, Claire; Leroy, Thierry; Pot, David; Sauvage, Christopher; Scarcelli, Nora; Tregear, James; Vigouroux, Yves; Yahiaoui, Nabila; Ruiz, Manuel; Santoni, Sylvain; Labouisse, Jean-Pierre; Pham, Jean-Louis; David, Jacques; Glémin, Sylvain.

PLoS Genet ; 13(5): e1006799, 2017 May.

Artigo em Inglês | MEDLINE | ID: mdl-28531201

RESUMO

Base composition is highly variable among and within plant genomes, especially at third codon positions, ranging from GC-poor and homogeneous species to GC-rich and highly heterogeneous ones (particularly Monocots). Consequently, synonymous codon usage is biased in most species, even when base composition is relatively homogeneous. The causes of these variations are still under debate, with three main forces being possibly involved: mutational bias, selection and GC-biased gene conversion (gBGC). So far, both selection and gBGC have been detected in some species but how their relative strength varies among and within species remains unclear. Population genetics approaches allow to jointly estimating the intensity of selection, gBGC and mutational bias. We extended a recently developed method and applied it to a large population genomic dataset based on transcriptome sequencing of 11 angiosperm species spread across the phylogeny. We found that at synonymous positions, base composition is far from mutation-drift equilibrium in most genomes and that gBGC is a widespread and stronger process than selection. gBGC could strongly contribute to base composition variation among plant species, implying that it should be taken into account in plant genome analyses, especially for GC-rich ones.

Assuntos

Evolução Molecular , Genoma de Planta , Magnoliopsida/genética , Polimorfismo Genético , Sequência Rica em GC , Conversão Gênica , Seleção Genética

4.

Population structure and genetic relationships between Ethiopian and Brazilian Coffea arabica genotypes revealed by SSR markers.

da Silva, Bruna Silvestre Rodrigues; Sant'Ana, Gustavo César; Chaves, Camila Lucas; Godoy Androcioli, Leonardo; Ferreira, Rafaelle Vecchia; Sera, Gustavo Hiroshi; Charmetant, Pierre; Leroy, Thierry; Pot, David; Domingues, Douglas Silva; Pereira, Luiz Filipe Protasio.

Genetica ; 147(2): 205-216, 2019 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-31054007

RESUMO

Information about population structure and genetic relationships within and among wild and brazilian Coffea arabica L. genotypes is highly relevant to optimize the use of genetic resources for breeding purposes. In this study, we evaluated genetic diversity, clustering analysis based on Jaccard's coefficient and population structure in 33 genotypes of C. arabica and of three diploid Coffea species (C. canephora, C. eugenioides and C. racemosa) using 30 SSR markers. A total of 206 alleles were identified, with a mean of 6.9 over all loci. The set of SSR markers was able to discriminate all genotypes and revealed that Ethiopian accessions presented higher genetic diversity than commercial varieties. Population structure analysis indicated two genetic groups, one corresponding to Ethiopian accessions and another corresponding predominantly to commercial cultivars. Thirty-four private alleles were detected in the group of accessions collected from West side of Great Rift Valley. We observed a lower average genetic distance of the C. arabica genotypes in relation to C. eugenioides than C. canephora. Interestingly, commercial cultivars were genetically closer to C. eugenioides than C. canephora and C. racemosa. The great allelic richness observed in Ethiopian Arabica coffee, especially in Western group showed that these accessions can be potential source of new alleles to be explored by coffee breeding programs.

Assuntos

Coffea/genética , Repetições de Microssatélites , Polimorfismo Genético , Coffea/classificação , Genótipo , Técnicas de Genotipagem/métodos , Técnicas de Genotipagem/normas , Filogenia , Melhoramento Vegetal/métodos

5.

Identification of candidate genes for drought tolerance in coffee by high-throughput sequencing in the shoot apex of different Coffea arabica cultivars.

Mofatto, Luciana Souto; Carneiro, Fernanda de Araújo; Vieira, Natalia Gomes; Duarte, Karoline Estefani; Vidal, Ramon Oliveira; Alekcevetch, Jean Carlos; Cotta, Michelle Guitton; Verdeil, Jean-Luc; Lapeyre-Montes, Fabienne; Lartaud, Marc; Leroy, Thierry; De Bellis, Fabien; Pot, David; Rodrigues, Gustavo Costa; Carazzolle, Marcelo Falsarella; Pereira, Gonçalo Amarante Guimarães; Andrade, Alan Carvalho; Marraccini, Pierre.

BMC Plant Biol ; 16: 94, 2016 Apr 19.

Artigo em Inglês | MEDLINE | ID: mdl-27095276

RESUMO

BACKGROUND: Drought is a widespread limiting factor in coffee plants. It affects plant development, fruit production, bean development and consequently beverage quality. Genetic diversity for drought tolerance exists within the coffee genus. However, the molecular mechanisms underlying the adaptation of coffee plants to drought are largely unknown. In this study, we compared the molecular responses to drought in two commercial cultivars (IAPAR59, drought-tolerant and Rubi, drought-susceptible) of Coffea arabica grown in the field under control (irrigation) and drought conditions using the pyrosequencing of RNA extracted from shoot apices and analysing the expression of 38 candidate genes. RESULTS: Pyrosequencing from shoot apices generated a total of 34.7 Mbp and 535,544 reads enabling the identification of 43,087 clusters (41,512 contigs and 1,575 singletons). These data included 17,719 clusters (16,238 contigs and 1,575 singletons) exclusively from 454 sequencing reads, along with 25,368 hybrid clusters assembled with 454 sequences. The comparison of DNA libraries identified new candidate genes (n = 20) presenting differential expression between IAPAR59 and Rubi and/or drought conditions. Their expression was monitored in plagiotropic buds, together with those of other (n = 18) candidates genes. Under drought conditions, up-regulated expression was observed in IAPAR59 but not in Rubi for CaSTK1 (protein kinase), CaSAMT1 (SAM-dependent methyltransferase), CaSLP1 (plant development) and CaMAS1 (ABA biosynthesis). Interestingly, the expression of lipid-transfer protein (nsLTP) genes was also highly up-regulated under drought conditions in IAPAR59. This may have been related to the thicker cuticle observed on the abaxial leaf surface in IAPAR59 compared to Rubi. CONCLUSIONS: The full transcriptome assembly of C. arabica, followed by functional annotation, enabled us to identify differentially expressed genes related to drought conditions. Using these data, candidate genes were selected and their differential expression profiles were confirmed by qPCR experiments in plagiotropic buds of IAPAR59 and Rubi under drought conditions. As regards the genes up-regulated under drought conditions, specifically in the drought-tolerant IAPAR59, several corresponded to orphan genes but also to genes coding proteins involved in signal transduction pathways, as well as ABA and lipid metabolism, for example. The identification of these genes should help advance our understanding of the genetic determinism of drought tolerance in coffee.

Assuntos

Adaptação Fisiológica/genética , Coffea/genética , Secas , Genes de Plantas/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Brotos de Planta/genética , Coffea/classificação , Coffea/fisiologia , Café/genética , Café/fisiologia , Perfilação da Expressão Gênica/métodos , Regulação da Expressão Gênica de Plantas , Biblioteca Gênica , Ontologia Genética , Folhas de Planta/genética , Folhas de Planta/fisiologia , Brotos de Planta/fisiologia , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Especificidade da Espécie

6.

NCI Cancer Research Data Commons: Cloud-Based Analytic Resources.

Pot, David; Worman, Zelia; Baumann, Alexander; Pathak, Shirish; Beck, Rowan; Beck, Erin; Thayer, Katherine; Davidsen, Tanja M; Kim, Erika; Davis-Dusenbery, Brandi; Otridge, John; Pihl, Todd; Barnholtz-Sloan, Jill S; Kerlavage, Anthony R.

Cancer Res ; 84(9): 1396-1403, 2024 May 02.

Artigo em Inglês | MEDLINE | ID: mdl-38488504

RESUMO

The NCI's Cloud Resources (CR) are the analytical components of the Cancer Research Data Commons (CRDC) ecosystem. This review describes how the three CRs (Broad Institute FireCloud, Institute for Systems Biology Cancer Gateway in the Cloud, and Seven Bridges Cancer Genomics Cloud) provide access and availability to large, cloud-hosted, multimodal cancer datasets, as well as offer tools and workspaces for performing data analysis where the data resides, without download or storage. In addition, users can upload their own data and tools into their workspaces, allowing researchers to create custom analysis workflows and integrate CRDC-hosted data with their own. See related articles by Brady et al., p. 1384, Wang et al., p. 1388, and Kim et al., p. 1404.

Assuntos

Computação em Nuvem , National Cancer Institute (U.S.) , Neoplasias , Humanos , Neoplasias/genética , Estados Unidos , Pesquisa Biomédica , Genômica/métodos , Biologia Computacional/métodos

7.

NCI Cancer Research Data Commons: Lessons Learned and Future State.

Kim, Erika; Davidsen, Tanja; Davis-Dusenbery, Brandi N; Baumann, Alexander; Maggio, Angela; Chen, Zhaoyi; Meerzaman, Daoud; Casas-Silva, Esmeralda; Pot, David; Pihl, Todd; Otridge, John; Shalley, Eve; Barnholtz-Sloan, Jill S; Kerlavage, Anthony R.

Cancer Res ; 84(9): 1404-1409, 2024 May 02.

Artigo em Inglês | MEDLINE | ID: mdl-38488510

RESUMO

More than ever, scientific progress in cancer research hinges on our ability to combine datasets and extract meaningful interpretations to better understand diseases and ultimately inform the development of better treatments and diagnostic tools. To enable the successful sharing and use of big data, the NCI developed the Cancer Research Data Commons (CRDC), providing access to a large, comprehensive, and expanding collection of cancer data. The CRDC is a cloud-based data science infrastructure that eliminates the need for researchers to download and store large-scale datasets by allowing them to perform analysis where data reside. Over the past 10 years, the CRDC has made significant progress in providing access to data and tools along with training and outreach to support the cancer research community. In this review, we provide an overview of the history and the impact of the CRDC to date, lessons learned, and future plans to further promote data sharing, accessibility, interoperability, and reuse. See related articles by Brady et al., p. 1384, Wang et al., p. 1388, and Pot et al., p. 1396.

Assuntos

Disseminação de Informação , National Cancer Institute (U.S.) , Neoplasias , Humanos , Estados Unidos , Neoplasias/terapia , Disseminação de Informação/métodos , Pesquisa Biomédica/tendências , Bases de Dados Factuais , Big Data

8.

Characterization of adaptation mechanisms in sorghum using a multireference back-cross nested association mapping design and envirotyping.

Garin, Vincent; Diallo, Chiaka; Tékété, Mohamed Lamine; Théra, Korotimi; Guitton, Baptiste; Dagno, Karim; Diallo, Abdoulaye G; Kouressy, Mamoutou; Leiser, Willmar; Rattunde, Fred; Sissoko, Ibrahima; Touré, Aboubacar; Nébié, Baloua; Samaké, Moussa; Kholovà, Jana; Frouin, Julien; Pot, David; Vaksmann, Michel; Weltzien, Eva; Témé, Niaba; Rami, Jean-François.

Genetics ; 226(4)2024 04 03.

Artigo em Inglês | MEDLINE | ID: mdl-38381593

RESUMO

Identifying the genetic factors impacting the adaptation of crops to environmental conditions is of key interest for conservation and selection purposes. It can be achieved using population genomics, and evolutionary or quantitative genetics. Here we present a sorghum multireference back-cross nested association mapping population composed of 3,901 lines produced by crossing 24 diverse parents to 3 elite parents from West and Central Africa-back-cross nested association mapping. The population was phenotyped in environments characterized by differences in photoperiod, rainfall pattern, temperature levels, and soil fertility. To integrate the multiparental and multi-environmental dimension of our data we proposed a new approach for quantitative trait loci (QTL) detection and parental effect estimation. We extended our model to estimate QTL effect sensitivity to environmental covariates, which facilitated the integration of envirotyping data. Our models allowed spatial projections of the QTL effects in agro-ecologies of interest. We utilized this strategy to analyze the genetic architecture of flowering time and plant height, which represents key adaptation mechanisms in environments like West Africa. Our results allowed a better characterization of well-known genomic regions influencing flowering time concerning their response to photoperiod with Ma6 and Ma1 being photoperiod-sensitive and the region of possible candidate gene Elf3 being photoperiod-insensitive. We also accessed a better understanding of plant height genetic determinism with the combined effects of phenology-dependent (Ma6) and independent (qHT7.1 and Dw3) genomic regions. Therefore, we argue that the West and Central Africa-back-cross nested association mapping and the presented analytical approach constitute unique resources to better understand adaptation in sorghum with direct application to develop climate-smart varieties.

Assuntos

Sorghum , Sorghum/genética , Mapeamento Cromossômico , Locos de Características Quantitativas , Fenótipo , Grão Comestível/genética

9.

An initial assessment of linkage disequilibrium (LD) in coffee trees: LD patterns in groups of Coffea canephora Pierre using microsatellite analysis.

Cubry, Philippe; de Bellis, Fabien; Avia, Komlan; Bouchet, Sophie; Pot, David; Dufour, Magali; Legnate, Hyacinthe; Leroy, Thierry.

BMC Genomics ; 14: 10, 2013 Jan 16.

Artigo em Inglês | MEDLINE | ID: mdl-23324026

RESUMO

BACKGROUND: A reciprocal recurrent selection program has been under way for the Coffea canephora coffee tree for approximately thirty years in the Ivory Coast. Association genetics would help to speed up this program by more rapidly selecting zones of interest in the genome. However, prior to any such studies, the linkage disequilibrium (LD) needs to be assessed between the markers on the genome. These data are essential for guiding association studies. RESULTS: This article describes the first results of an LD assessment in a coffee tree species. Guinean and Congolese breeding populations of C. canephora have been used for this work, with the goal of identifying ways of using these populations in association genetics. We identified changes in the LD along the genome within the different C. canephora diversity groups. In the different diversity groups studied, the LD was variable. Some diversity groups displayed disequilibria over long distances (up to 25 cM), whereas others had disequilibria not exceeding 1 cM. We also discovered a fine structure within the Guinean group. CONCLUSIONS: Given these results, association studies can be used within the species C. canephora. The coffee recurrent selection scheme being implemented in the Ivory Coast can thus be optimized. Lastly, our results could be used to improve C. arabica because one of its parents is closely related to C. canephora.

Assuntos

Coffea/genética , Genômica , Desequilíbrio de Ligação/genética , Repetições de Microssatélites/genética , Marcadores Genéticos/genética , Variação Genética/genética , Genótipo

10.

Differentially expressed genes and proteins upon drought acclimation in tolerant and sensitive genotypes of Coffea canephora.

Marraccini, Pierre; Vinecky, Felipe; Alves, Gabriel S C; Ramos, Humberto J O; Elbelt, Sonia; Vieira, Natalia G; Carneiro, Fernanda A; Sujii, Patricia S; Alekcevetch, Jean C; Silva, Vânia A; DaMatta, Fábio M; Ferrão, Maria A G; Leroy, Thierry; Pot, David; Vieira, Luiz G E; da Silva, Felipe R; Andrade, Alan C.

J Exp Bot ; 63(11): 4191-212, 2012 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-22511801

RESUMO

The aim of this study was to investigate the molecular mechanisms underlying drought acclimation in coffee plants by the identification of candidate genes (CGs) using different approaches. The first approach used the data generated during the Brazilian Coffee expressed sequence tag (EST) project to select 13 CGs by an in silico analysis (electronic northern). The second approach was based on screening macroarrays spotted with plasmid DNA (coffee ESTs) with separate hybridizations using leaf cDNA probes from drought-tolerant and susceptible clones of Coffea canephora var. Conilon, grown under different water regimes. This allowed the isolation of seven additional CGs. The third approach used two-dimensional gel electrophoresis to identify proteins displaying differential accumulation in leaves of drought-tolerant and susceptible clones of C. canephora. Six of them were characterized by MALDI-TOF-MS/MS (matrix-assisted laser desorption-time of flight-tandem mass spectrometry) and the corresponding proteins were identified. Finally, additional CGs were selected from the literature, and quantitative real-time polymerase chain reaction (qPCR) was performed to analyse the expression of all identified CGs. Altogether, >40 genes presenting differential gene expression during drought acclimation were identified, some of them showing different expression profiles between drought-tolerant and susceptible clones. Based on the obtained results, it can be concluded that factors involved a complex network of responses probably involving the abscisic signalling pathway and nitric oxide are major molecular determinants that might explain the better efficiency in controlling stomata closure and transpiration displayed by drought-tolerant clones of C. canephora.

Assuntos

Coffea/fisiologia , Regulação da Expressão Gênica de Plantas , Proteínas de Plantas/genética , Aclimatação , Coffea/genética , Secas , Etiquetas de Sequências Expressas , Genótipo , Dados de Sequência Molecular , Proteínas de Plantas/metabolismo

11.

Whole Genome Variant Dataset for Enriching Studies across 18 Different Cancers.

Torcivia, John; Abdilleh, Kawther; Seidl, Fabian; Shahzada, Owais; Rodriguez, Rebecca; Pot, David; Mazumder, Raja.

Onco (Basel) ; 2(2): 129-144, 2022 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-37841494

RESUMO

Whole genome sequencing (WGS) has helped to revolutionize biology, but the computational challenge remains for extracting valuable inferences from this information. Here, we present the cancer-associated variants from the Cancer Genome Atlas (TCGA) WGS dataset. This set of data will allow cancer researchers to further expand their analysis beyond the exomic regions of the genome to the entire genome. A total of 1342 WGS alignments available from the consortium were processed with VarScan2 and deposited to the NCI Cancer Cloud. The sample set covers 18 different cancers and reveals 157,313,519 pooled (non-unique) cancer-associated single-nucleotide variations (SNVs) across all samples. There was an average of 117,223 SNVs per sample, with a range from 1111 to 775,470 and a standard deviation of 163,273. The dataset was incorporated into BigQuery, which allows for fast access and cross-mapping, which will allow researchers to enrich their current studies with a plethora of newly available genomic data.

12.

SL-Cloud: A Cloud-based resource to support synthetic lethal interaction discovery.

Tercan, Bahar; Qin, Guangrong; Kim, Taek-Kyun; Aguilar, Boris; Phan, John; Longabaugh, William; Pot, David; Kemp, Christopher J; Chambwe, Nyasha; Shmulevich, Ilya.

F1000Res ; 11: 493, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-36761837

RESUMO

Synthetic lethal interactions (SLIs), genetic interactions in which the simultaneous inactivation of two genes leads to a lethal phenotype, are promising targets for therapeutic intervention in cancer, as exemplified by the recent success of PARP inhibitors in treating BRCA1/2-deficient tumors. We present SL-Cloud, a new component of the Institute for Systems Biology Cancer Gateway in the Cloud (ISB-CGC), that provides an integrated framework of cloud-hosted data resources and curated workflows to enable facile prediction of SLIs. This resource addresses two main challenges related to SLI inference: the need to wrangle and preprocess large multi-omic datasets and the availability of multiple comparable prediction approaches. SL-Cloud enables customizable computational inference of SLIs and testing of prediction approaches across multiple datasets. We anticipate that cancer researchers will find utility in this tool for discovery of SLIs to support further investigation into potential drug targets for anticancer therapies.

Assuntos

Computação em Nuvem , Neoplasias , Humanos , Neoplasias/genética , Biologia de Sistemas , Multiômica

13.

The 'PUCE CAFE' Project: the first 15K coffee microarray, a new tool for discovering candidate genes correlated to agronomic and quality traits.

Privat, Isabelle; Bardil, Amélie; Gomez, Aureliano Bombarely; Severac, Dany; Dantec, Christelle; Fuentes, Ivanna; Mueller, Lukas; Joët, Thierry; Pot, David; Foucrier, Séverine; Dussert, Stéphane; Leroy, Thierry; Journot, Laurent; de Kochko, Alexandre; Campa, Claudine; Combes, Marie-Christine; Lashermes, Philippe; Bertrand, Benoit.

BMC Genomics ; 12: 5, 2011 Jan 05.

Artigo em Inglês | MEDLINE | ID: mdl-21208403

RESUMO

BACKGROUND: Understanding the genetic elements that contribute to key aspects of coffee biology will have an impact on future agronomical improvements for this economically important tree. During the past years, EST collections were generated in Coffee, opening the possibility to create new tools for functional genomics. RESULTS: The "PUCE CAFE" Project, organized by the scientific consortium NESTLE/IRD/CIRAD, has developed an oligo-based microarray using 15,721 unigenes derived from published coffee EST sequences mostly obtained from different stages of fruit development and leaves in Coffea Canephora (Robusta). Hybridizations for two independent experiments served to compare global gene expression profiles in three types of tissue matter (mature beans, leaves and flowers) in C. canephora as well as in the leaves of three different coffee species (C. canephora, C. eugenoides and C. arabica). Microarray construction, statistical analyses and validation by Q-PCR analysis are presented in this study. CONCLUSION: We have generated the first 15 K coffee array during this PUCE CAFE project, granted by Génoplante (the French consortium for plant genomics). This new tool will help study functional genomics in a wide range of experiments on various plant tissues, such as analyzing bean maturation or resistance to pathogens or drought. Furthermore, the use of this array has proven to be valid in different coffee species (diploid or tetraploid), drastically enlarging its impact for high-throughput gene expression in the community of coffee research.

Assuntos

Agricultura/métodos , Café/genética , Genômica/métodos , Etiquetas de Sequências Expressas , Perfilação da Expressão Gênica , Análise de Sequência com Séries de Oligonucleotídeos , Reação em Cadeia da Polimerase

14.

RBCS1 expression in coffee: Coffea orthologs, Coffea arabica homeologs, and expression variability between genotypes and under drought stress.

Marraccini, Pierre; Freire, Luciana P; Alves, Gabriel S C; Vieira, Natalia G; Vinecky, Felipe; Elbelt, Sonia; Ramos, Humberto J O; Montagnon, Christophe; Vieira, Luiz G E; Leroy, Thierry; Pot, David; Silva, Vânia A; Rodrigues, Gustavo C; Andrade, Alan C.

BMC Plant Biol ; 11: 85, 2011 May 16.

Artigo em Inglês | MEDLINE | ID: mdl-21575242

RESUMO

BACKGROUND: In higher plants, the inhibition of photosynthetic capacity under drought is attributable to stomatal and non-stomatal (i.e., photochemical and biochemical) effects. In particular, a disruption of photosynthetic metabolism and Rubisco regulation can be observed. Several studies reported reduced expression of the RBCS genes, which encode the Rubisco small subunit, under water stress. RESULTS: Expression of the RBCS1 gene was analysed in the allopolyploid context of C. arabica, which originates from a natural cross between the C. canephora and C. eugenioides species. Our study revealed the existence of two homeologous RBCS1 genes in C. arabica: one carried by the C. canephora sub-genome (called CaCc) and the other carried by the C. eugenioides sub-genome (called CaCe). Using specific primer pairs for each homeolog, expression studies revealed that CaCe was expressed in C. eugenioides and C. arabica but was undetectable in C. canephora. On the other hand, CaCc was expressed in C. canephora but almost completely silenced in non-introgressed ("pure") genotypes of C. arabica. However, enhanced CaCc expression was observed in most C. arabica cultivars with introgressed C. canephora genome. In addition, total RBCS1 expression was higher for C. arabica cultivars that had recently introgressed C. canephora genome than for "pure" cultivars. For both species, water stress led to an important decrease in the abundance of RBCS1 transcripts. This was observed for plants grown in either greenhouse or field conditions under severe or moderate drought. However, this reduction of RBCS1 gene expression was not accompanied by a decrease in the corresponding protein in the leaves of C. canephora subjected to water withdrawal. In that case, the amount of RBCS1 was even higher under drought than under unstressed (irrigated) conditions, which suggests great stability of RBCS1 under adverse water conditions. On the other hand, for C. arabica, high nocturnal expression of RBCS1 could also explain the accumulation of the RBCS1 protein under water stress. Altogether, the results presented here suggest that the content of RBCS was not responsible for the loss of photosynthetic capacity that is commonly observed in water-stressed coffee plants. CONCLUSION: We showed that the CaCe homeolog was expressed in C. eugenioides and non-introgressed ("pure") genotypes of C. arabica but that it was undetectable in C. canephora. On the other hand, the CaCc homeolog was expressed in C. canephora but highly repressed in C. arabica. Expression of the CaCc homeolog was enhanced in C. arabica cultivars that experienced recent introgression with C. canephora. For both C. canephora and C. arabica species, total RBCS1 gene expression was highly reduced with WS. Unexpectedly, the accumulation of RBCS1 protein was observed in the leaves of C. canephora under WS, possibly coming from nocturnal RBCS1 expression. These results suggest that the increase in the amount of RBCS1 protein could contribute to the antioxidative function of photorespiration in water-stressed coffee plants.

Assuntos

Coffea/genética , Secas , Folhas de Planta/genética , Ribulose-Bifosfato Carboxilase/metabolismo , Sequência de Bases , Clonagem Molecular , Coffea/enzimologia , Coffea/fisiologia , Perfilação da Expressão Gênica , Regulação da Expressão Gênica de Plantas , Biblioteca Gênica , Genes de Plantas , Genótipo , Espectrometria de Massas , Dados de Sequência Molecular , Peso Molecular , Fotoperíodo , Folhas de Planta/enzimologia , Polimorfismo de Nucleotídeo Único , Isoformas de Proteínas , Ribulose-Bifosfato Carboxilase/química , Ribulose-Bifosfato Carboxilase/genética , Alinhamento de Sequência , Análise de Sequência de Proteína , Estresse Fisiológico , Água/metabolismo

15.

A high-throughput data mining of single nucleotide polymorphisms in Coffea species expressed sequence tags suggests differential homeologous gene expression in the allotetraploid Coffea arabica.

Vidal, Ramon Oliveira; Mondego, Jorge Maurício Costa; Pot, David; Ambrósio, Alinne Batista; Andrade, Alan Carvalho; Pereira, Luiz Filipe Protasio; Colombo, Carlos Augusto; Vieira, Luiz Gonzaga Esteves; Carazzolle, Marcelo Falsarella; Pereira, Gonçalo Amarante Guimarães.

Plant Physiol ; 154(3): 1053-66, 2010 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-20864545

RESUMO

Polyploidization constitutes a common mode of evolution in flowering plants. This event provides the raw material for the divergence of function in homeologous genes, leading to phenotypic novelty that can contribute to the success of polyploids in nature or their selection for use in agriculture. Mounting evidence underlined the existence of homeologous expression biases in polyploid genomes; however, strategies to analyze such transcriptome regulation remained scarce. Important factors regarding homeologous expression biases remain to be explored, such as whether this phenomenon influences specific genes, how paralogs are affected by genome doubling, and what is the importance of the variability of homeologous expression bias to genotype differences. This study reports the expressed sequence tag assembly of the allopolyploid Coffea arabica and one of its direct ancestors, Coffea canephora. The assembly was used for the discovery of single nucleotide polymorphisms through the identification of high-quality discrepancies in overlapped expressed sequence tags and for gene expression information indirectly estimated by the transcript redundancy. Sequence diversity profiles were evaluated within C. arabica (Ca) and C. canephora (Cc) and used to deduce the transcript contribution of the Coffea eugenioides (Ce) ancestor. The assignment of the C. arabica haplotypes to the C. canephora (CaCc) or C. eugenioides (CaCe) ancestral genomes allowed us to analyze gene expression contributions of each subgenome in C. arabica. In silico data were validated by the quantitative polymerase chain reaction and allele-specific combination TaqMAMA-based method. The presence of differential expression of C. arabica homeologous genes and its implications in coffee gene expression, ontology, and physiology are discussed.

Assuntos

Coffea/genética , Etiquetas de Sequências Expressas , Genoma de Planta , Polimorfismo de Nucleotídeo Único , DNA de Plantas/genética , Mineração de Dados , Regulação da Expressão Gênica de Plantas , Frequência do Gene , Haplótipos , Análise de Sequência de DNA , Tetraploidia

16.

The Road to Sorghum Domestication: Evidence From Nucleotide Diversity and Gene Expression Patterns.

Burgarella, Concetta; Berger, Angélique; Glémin, Sylvain; David, Jacques; Terrier, Nancy; Deu, Monique; Pot, David.

Front Plant Sci ; 12: 666075, 2021.

Artigo em Inglês | MEDLINE | ID: mdl-34527004

RESUMO

Native African cereals (sorghum, millets) ensure food security to millions of low-income people from low fertility and drought-prone regions of Africa and Asia. In spite of their agronomic importance, the genetic bases of their phenotype and adaptations are still not well-understood. Here we focus on Sorghum bicolor, which is the fifth cereal worldwide for grain production and constitutes the staple food for around 500 million people. We leverage transcriptomic resources to address the adaptive consequences of the domestication process. Gene expression and nucleotide variability were analyzed in 11 domesticated and nine wild accessions. We documented a downregulation of expression and a reduction of diversity both in nucleotide polymorphism (30%) and gene expression levels (18%) in domesticated sorghum. These findings at the genome-wide level support the occurrence of a global reduction of diversity during the domestication process, although several genes also showed patterns consistent with the action of selection. Nine hundred and forty-nine genes were significantly differentially expressed between wild and domesticated gene pools. Their functional annotation points to metabolic pathways most likely contributing to the sorghum domestication syndrome, such as photosynthesis and auxin metabolism. Coexpression network analyzes revealed 21 clusters of genes sharing similar expression patterns. Four clusters (totaling 2,449 genes) were significantly enriched in differentially expressed genes between the wild and domesticated pools and two were also enriched in domestication and improvement genes previously identified in sorghum. These findings reinforce the evidence that the combined and intricated effects of the domestication and improvement processes do not only affect the behaviors of a few genes but led to a large rewiring of the transcriptome. Overall, these analyzes pave the way toward the identification of key domestication genes valuable for genetic resources characterization and breeding purposes.

17.

NCI Imaging Data Commons.

Fedorov, Andrey; Longabaugh, William J R; Pot, David; Clunie, David A; Pieper, Steve; Aerts, Hugo J W L; Homeyer, André; Lewis, Rob; Akbarzadeh, Afshin; Bontempi, Dennis; Clifford, William; Herrmann, Markus D; Höfener, Henning; Octaviano, Igor; Osborne, Chad; Paquette, Suzanne; Petts, James; Punzo, Davide; Reyes, Madelyn; Schacherer, Daniela P; Tian, Mi; White, George; Ziegler, Erik; Shmulevich, Ilya; Pihl, Todd; Wagner, Ulrike; Farahani, Keyvan; Kikinis, Ron.

Cancer Res ; 81(16): 4188-4193, 2021 08 15.

Artigo em Inglês | MEDLINE | ID: mdl-34185678

RESUMO

The National Cancer Institute (NCI) Cancer Research Data Commons (CRDC) aims to establish a national cloud-based data science infrastructure. Imaging Data Commons (IDC) is a new component of CRDC supported by the Cancer Moonshot. The goal of IDC is to enable a broad spectrum of cancer researchers, with and without imaging expertise, to easily access and explore the value of deidentified imaging data and to support integrated analyses with nonimaging data. We achieve this goal by colocating versatile imaging collections with cloud-based computing resources and data exploration, visualization, and analysis tools. The IDC pilot was released in October 2020 and is being continuously populated with radiology and histopathology collections. IDC provides access to curated imaging collections, accompanied by documentation, a user forum, and a growing number of analysis use cases that aim to demonstrate the value of a data commons framework applied to cancer imaging research. SIGNIFICANCE: This study introduces NCI Imaging Data Commons, a new repository of the NCI Cancer Research Data Commons, which will support cancer imaging research on the cloud.

Assuntos

Diagnóstico por Imagem/métodos , National Cancer Institute (U.S.) , Neoplasias/diagnóstico por imagem , Neoplasias/genética , Pesquisa Biomédica/tendências , Computação em Nuvem , Biologia Computacional/métodos , Gráficos por Computador , Segurança Computacional , Interpretação Estatística de Dados , Bases de Dados Factuais , Diagnóstico por Imagem/normas , Humanos , Processamento de Imagem Assistida por Computador , Projetos Piloto , Linguagens de Programação , Radiologia/métodos , Radiologia/normas , Reprodutibilidade dos Testes , Software , Estados Unidos , Interface Usuário-Computador

18.

Enteropathogen Resource Integration Center (ERIC): bioinformatics support for research on biodefense-relevant enterobacteria.

Glasner, Jeremy D; Plunkett, Guy; Anderson, Bradley D; Baumler, David J; Biehl, Bryan S; Burland, Valerie; Cabot, Eric L; Darling, Aaron E; Mau, Bob; Neeno-Eckwall, Eric C; Pot, David; Qiu, Yu; Rissman, Anna I; Worzella, Sara; Zaremba, Sam; Fedorko, Joel; Hampton, Tom; Liss, Paul; Rusch, Michael; Shaker, Matthew; Shaull, Lorie; Shetty, Panna; Thotakura, Silpa; Whitmore, Jon; Blattner, Frederick R; Greene, John M; Perna, Nicole T.

Nucleic Acids Res ; 36(Database issue): D519-23, 2008 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-17999997

RESUMO

ERIC, the Enteropathogen Resource Integration Center (www.ericbrc.org), is a new web portal serving as a rich source of information about enterobacteria on the NIAID established list of Select Agents related to biodefense-diarrheagenic Escherichia coli, Shigella spp., Salmonella spp., Yersinia enterocolitica and Yersinia pestis. More than 30 genomes have been completely sequenced, many more exist in draft form and additional projects are underway. These organisms are increasingly the focus of studies using high-throughput experimental technologies and computational approaches. This wealth of data provides unprecedented opportunities for understanding the workings of basic biological systems and discovery of novel targets for development of vaccines, diagnostics and therapeutics. ERIC brings information together from disparate sources and supports data comparison across different organisms, analysis of varying data types and visualization of analyses in human and computer-readable formats.

Assuntos

Bases de Dados Genéticas , Enterobacteriaceae/genética , Genoma Bacteriano , Proteínas de Bactérias/química , Proteínas de Bactérias/classificação , Proteínas de Bactérias/genética , Pesquisa Biomédica , Bioterrorismo , Biologia Computacional , Elementos de DNA Transponíveis , Infecções por Enterobacteriaceae/diagnóstico , Infecções por Enterobacteriaceae/prevenção & controle , Infecções por Enterobacteriaceae/terapia , Genômica , Internet , Análise de Sequência com Séries de Oligonucleotídeos , Proteômica , Alinhamento de Sequência , Software , Integração de Sistemas

19.

Transcriptional Regulation of Sorghum Stem Composition: Key Players Identified Through Co-expression Gene Network and Comparative Genomics Analyses.

Hennet, Lauriane; Berger, Angélique; Trabanco, Noemi; Ricciuti, Emeline; Dufayard, Jean-François; Bocs, Stéphanie; Bastianelli, Denis; Bonnal, Laurent; Roques, Sandrine; Rossini, Laura; Luquet, Delphine; Terrier, Nancy; Pot, David.

Front Plant Sci ; 11: 224, 2020.

Artigo em Inglês | MEDLINE | ID: mdl-32194601

RESUMO

Most sorghum biomass accumulates in stem secondary cell walls (SCW). As sorghum stems are used as raw materials for various purposes such as feed, energy and fiber reinforced polymers, identifying the genes responsible for SCW establishment is highly important. Taking advantage of studies performed in model species, most of the structural genes contributing at the molecular level to the SCW biosynthesis in sorghum have been proposed while their regulatory factors have mostly not been determined. Validation of the role of several MYB and NAC transcription factors in SCW regulation in Arabidopsis and a few other species has been provided. In this study, we contributed to the recent efforts made in grasses to uncover the mechanisms underlying SCW establishment. We reported updated phylogenies of NAC and MYB in 9 different species and exploited findings from other species to highlight candidate regulators of SCW in sorghum. We acquired expression data during sorghum internode development and used co-expression analyses to determine groups of co-expressed genes that are likely to be involved in SCW establishment. We were able to identify two groups of co-expressed genes presenting multiple evidences of involvement in SCW building. Gene enrichment analysis of MYB and NAC genes provided evidence that while NAC SECONDARY WALL THICKENING PROMOTING FACTOR NST genes and SECONDARY WALL-ASSOCIATED NAC DOMAIN PROTEIN gene functions appear to be conserved in sorghum, NAC master regulators of SCW in sorghum may not be as tissue compartmentalized as in Arabidopsis. We showed that for every homolog of the key SCW MYB in Arabidopsis, a similar role is expected for sorghum. In addition, we unveiled sorghum MYB and NAC that have not been identified to date as being involved in cell wall regulation. Although specific validation of the MYB and NAC genes uncovered in this study is needed, we provide a network of sorghum genes involved in SCW both at the structural and regulatory levels.

20.

Text-mining of PubMed abstracts by natural language processing to create a public knowledge base on molecular mechanisms of bacterial enteropathogens.

Zaremba, Sam; Ramos-Santacruz, Mila; Hampton, Thomas; Shetty, Panna; Fedorko, Joel; Whitmore, Jon; Greene, John M; Perna, Nicole T; Glasner, Jeremy D; Plunkett, Guy; Shaker, Matthew; Pot, David.

BMC Bioinformatics ; 10: 177, 2009 Jun 10.

Artigo em Inglês | MEDLINE | ID: mdl-19515247

RESUMO

BACKGROUND: The Enteropathogen Resource Integration Center (ERIC; http://www.ericbrc.org) has a goal of providing bioinformatics support for the scientific community researching enteropathogenic bacteria such as Escherichia coli and Salmonella spp. Rapid and accurate identification of experimental conclusions from the scientific literature is critical to support research in this field. Natural Language Processing (NLP), and in particular Information Extraction (IE) technology, can be a significant aid to this process. DESCRIPTION: We have trained a powerful, state-of-the-art IE technology on a corpus of abstracts from the microbial literature in PubMed to automatically identify and categorize biologically relevant entities and predicative relations. These relations include: Genes/Gene Products and their Roles; Gene Mutations and the resulting Phenotypes; and Organisms and their associated Pathogenicity. Evaluations on blind datasets show an F-measure average of greater than 90% for entities (genes, operons, etc.) and over 70% for relations (gene/gene product to role, etc). This IE capability, combined with text indexing and relational database technologies, constitute the core of our recently deployed text mining application. CONCLUSION: Our Text Mining application is available online on the ERIC website (http://www.ericbrc.org/portal/eric/articles). The information retrieval interface displays a list of recently published enteropathogen literature abstracts, and also provides a search interface to execute custom queries by keyword, date range, etc. Upon selection, processed abstracts and the entities and relations extracted from them are retrieved from a relational database and marked up to highlight the entities and relations. The abstract also provides links from extracted genes and gene products to the ERIC Annotations database, thus providing access to comprehensive genomic annotations and adding value to both the text-mining and annotations systems.

Assuntos

Indexação e Redação de Resumos , Biologia Computacional/métodos , Enterobacteriaceae , Armazenamento e Recuperação da Informação , Processamento de Linguagem Natural , PubMed , Fenômenos Fisiológicos Bacterianos , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Factuais , Enterobacteriaceae/genética , Enterobacteriaceae/patogenicidade , Enterobacteriaceae/fisiologia , Escherichia coli/genética , Escherichia coli/patogenicidade , Escherichia coli/fisiologia , Internet , Salmonella/genética , Salmonella/patogenicidade , Salmonella/fisiologia , Interface Usuário-Computador

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA