Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 41
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Cancer Res ; 84(9): 1396-1403, 2024 May 02.
Artigo em Inglês | MEDLINE | ID: mdl-38488504

RESUMO

The NCI's Cloud Resources (CR) are the analytical components of the Cancer Research Data Commons (CRDC) ecosystem. This review describes how the three CRs (Broad Institute FireCloud, Institute for Systems Biology Cancer Gateway in the Cloud, and Seven Bridges Cancer Genomics Cloud) provide access and availability to large, cloud-hosted, multimodal cancer datasets, as well as offer tools and workspaces for performing data analysis where the data resides, without download or storage. In addition, users can upload their own data and tools into their workspaces, allowing researchers to create custom analysis workflows and integrate CRDC-hosted data with their own. See related articles by Brady et al., p. 1384, Wang et al., p. 1388, and Kim et al., p. 1404.


Assuntos
Computação em Nuvem , National Cancer Institute (U.S.) , Neoplasias , Humanos , Neoplasias/genética , Estados Unidos , Pesquisa Biomédica , Genômica/métodos , Biologia Computacional/métodos
2.
Cancer Res ; 84(9): 1404-1409, 2024 May 02.
Artigo em Inglês | MEDLINE | ID: mdl-38488510

RESUMO

More than ever, scientific progress in cancer research hinges on our ability to combine datasets and extract meaningful interpretations to better understand diseases and ultimately inform the development of better treatments and diagnostic tools. To enable the successful sharing and use of big data, the NCI developed the Cancer Research Data Commons (CRDC), providing access to a large, comprehensive, and expanding collection of cancer data. The CRDC is a cloud-based data science infrastructure that eliminates the need for researchers to download and store large-scale datasets by allowing them to perform analysis where data reside. Over the past 10 years, the CRDC has made significant progress in providing access to data and tools along with training and outreach to support the cancer research community. In this review, we provide an overview of the history and the impact of the CRDC to date, lessons learned, and future plans to further promote data sharing, accessibility, interoperability, and reuse. See related articles by Brady et al., p. 1384, Wang et al., p. 1388, and Pot et al., p. 1396.


Assuntos
Disseminação de Informação , National Cancer Institute (U.S.) , Neoplasias , Humanos , Estados Unidos , Neoplasias/terapia , Disseminação de Informação/métodos , Pesquisa Biomédica/tendências , Bases de Dados Factuais , Big Data
3.
Genetics ; 226(4)2024 Apr 03.
Artigo em Inglês | MEDLINE | ID: mdl-38381593

RESUMO

Identifying the genetic factors impacting the adaptation of crops to environmental conditions is of key interest for conservation and selection purposes. It can be achieved using population genomics, and evolutionary or quantitative genetics. Here we present a sorghum multireference back-cross nested association mapping population composed of 3,901 lines produced by crossing 24 diverse parents to 3 elite parents from West and Central Africa-back-cross nested association mapping. The population was phenotyped in environments characterized by differences in photoperiod, rainfall pattern, temperature levels, and soil fertility. To integrate the multiparental and multi-environmental dimension of our data we proposed a new approach for quantitative trait loci (QTL) detection and parental effect estimation. We extended our model to estimate QTL effect sensitivity to environmental covariates, which facilitated the integration of envirotyping data. Our models allowed spatial projections of the QTL effects in agro-ecologies of interest. We utilized this strategy to analyze the genetic architecture of flowering time and plant height, which represents key adaptation mechanisms in environments like West Africa. Our results allowed a better characterization of well-known genomic regions influencing flowering time concerning their response to photoperiod with Ma6 and Ma1 being photoperiod-sensitive and the region of possible candidate gene Elf3 being photoperiod-insensitive. We also accessed a better understanding of plant height genetic determinism with the combined effects of phenology-dependent (Ma6) and independent (qHT7.1 and Dw3) genomic regions. Therefore, we argue that the West and Central Africa-back-cross nested association mapping and the presented analytical approach constitute unique resources to better understand adaptation in sorghum with direct application to develop climate-smart varieties.


Assuntos
Sorghum , Sorghum/genética , Mapeamento Cromossômico , Locos de Características Quantitativas , Fenótipo , Grão Comestível/genética
4.
Radiographics ; 43(12): e230180, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37999984

RESUMO

The remarkable advances of artificial intelligence (AI) technology are revolutionizing established approaches to the acquisition, interpretation, and analysis of biomedical imaging data. Development, validation, and continuous refinement of AI tools requires easy access to large high-quality annotated datasets, which are both representative and diverse. The National Cancer Institute (NCI) Imaging Data Commons (IDC) hosts large and diverse publicly available cancer image data collections. By harmonizing all data based on industry standards and colocalizing it with analysis and exploration resources, the IDC aims to facilitate the development, validation, and clinical translation of AI tools and address the well-documented challenges of establishing reproducible and transparent AI processing pipelines. Balanced use of established commercial products with open-source solutions, interconnected by standard interfaces, provides value and performance, while preserving sufficient agility to address the evolving needs of the research community. Emphasis on the development of tools, use cases to demonstrate the utility of uniform data representation, and cloud-based analysis aim to ease adoption and help define best practices. Integration with other data in the broader NCI Cancer Research Data Commons infrastructure opens opportunities for multiomics studies incorporating imaging data to further empower the research community to accelerate breakthroughs in cancer detection, diagnosis, and treatment. Published under a CC BY 4.0 license.


Assuntos
Inteligência Artificial , Neoplasias , Estados Unidos , Humanos , National Cancer Institute (U.S.) , Reprodutibilidade dos Testes , Diagnóstico por Imagem , Multiômica , Neoplasias/diagnóstico por imagem
5.
Genes Chromosomes Cancer ; 62(8): 441-448, 2023 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-36695636

RESUMO

Cytogenetic analysis provides important information on the genetic mechanisms of cancer. The Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer (Mitelman DB) is the largest catalog of acquired chromosome aberrations, presently comprising >70 000 cases across multiple cancer types. Although this resource has enabled the identification of chromosome abnormalities leading to specific cancers and cancer mechanisms, a large-scale, systematic analysis of these aberrations and their downstream implications has been difficult due to the lack of a standard, automated mapping from aberrations to genomic coordinates. We previously introduced CytoConverter as a tool that automates such conversions. CytoConverter has now been updated with improved interpretation of karyotypes and has been integrated with the Mitelman DB, providing a comprehensive mapping of the 70 000+ cases to genomic coordinates, as well as visualization of the frequencies of chromosomal gains and losses. Importantly, all CytoConverter-generated genomic coordinates are publicly available in Google BigQuery, a cloud-based data warehouse, facilitating data exploration and integration with other datasets hosted by the Institute for Systems Biology Cancer Gateway in the Cloud (ISB-CGC) Resource. We demonstrate the use of BigQuery for integrative analysis of Mitelman DB with other cancer datasets, including a comparison of the frequency of imbalances identified in Mitelman DB cases with those found in The Cancer Genome Atlas (TCGA) copy number datasets. This solution provides opportunities to leverage the power of cloud computing for low-cost, scalable, and integrated analysis of chromosome aberrations and gene fusions in cancer.


Assuntos
Computação em Nuvem , Neoplasias , Humanos , Aberrações Cromossômicas , Cariotipagem , Neoplasias/genética , Fusão Gênica
6.
Onco (Basel) ; 2(2): 129-144, 2022 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-37841494

RESUMO

Whole genome sequencing (WGS) has helped to revolutionize biology, but the computational challenge remains for extracting valuable inferences from this information. Here, we present the cancer-associated variants from the Cancer Genome Atlas (TCGA) WGS dataset. This set of data will allow cancer researchers to further expand their analysis beyond the exomic regions of the genome to the entire genome. A total of 1342 WGS alignments available from the consortium were processed with VarScan2 and deposited to the NCI Cancer Cloud. The sample set covers 18 different cancers and reveals 157,313,519 pooled (non-unique) cancer-associated single-nucleotide variations (SNVs) across all samples. There was an average of 117,223 SNVs per sample, with a range from 1111 to 775,470 and a standard deviation of 163,273. The dataset was incorporated into BigQuery, which allows for fast access and cross-mapping, which will allow researchers to enrich their current studies with a plethora of newly available genomic data.

7.
F1000Res ; 11: 493, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36761837

RESUMO

Synthetic lethal interactions (SLIs), genetic interactions in which the simultaneous inactivation of two genes leads to a lethal phenotype, are promising targets for therapeutic intervention in cancer, as exemplified by the recent success of PARP inhibitors in treating BRCA1/2-deficient tumors. We present SL-Cloud, a new component of the Institute for Systems Biology Cancer Gateway in the Cloud (ISB-CGC), that provides an integrated framework of cloud-hosted data resources and curated workflows to enable facile prediction of SLIs. This resource addresses two main challenges related to SLI inference: the need to wrangle and preprocess large multi-omic datasets and the availability of multiple comparable prediction approaches. SL-Cloud enables customizable computational inference of SLIs and testing of prediction approaches across multiple datasets. We anticipate that cancer researchers will find utility in this tool for discovery of SLIs to support further investigation into potential drug targets for anticancer therapies.


Assuntos
Computação em Nuvem , Neoplasias , Humanos , Neoplasias/genética , Biologia de Sistemas , Multiômica
8.
Front Plant Sci ; 12: 666075, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34527004

RESUMO

Native African cereals (sorghum, millets) ensure food security to millions of low-income people from low fertility and drought-prone regions of Africa and Asia. In spite of their agronomic importance, the genetic bases of their phenotype and adaptations are still not well-understood. Here we focus on Sorghum bicolor, which is the fifth cereal worldwide for grain production and constitutes the staple food for around 500 million people. We leverage transcriptomic resources to address the adaptive consequences of the domestication process. Gene expression and nucleotide variability were analyzed in 11 domesticated and nine wild accessions. We documented a downregulation of expression and a reduction of diversity both in nucleotide polymorphism (30%) and gene expression levels (18%) in domesticated sorghum. These findings at the genome-wide level support the occurrence of a global reduction of diversity during the domestication process, although several genes also showed patterns consistent with the action of selection. Nine hundred and forty-nine genes were significantly differentially expressed between wild and domesticated gene pools. Their functional annotation points to metabolic pathways most likely contributing to the sorghum domestication syndrome, such as photosynthesis and auxin metabolism. Coexpression network analyzes revealed 21 clusters of genes sharing similar expression patterns. Four clusters (totaling 2,449 genes) were significantly enriched in differentially expressed genes between the wild and domesticated pools and two were also enriched in domestication and improvement genes previously identified in sorghum. These findings reinforce the evidence that the combined and intricated effects of the domestication and improvement processes do not only affect the behaviors of a few genes but led to a large rewiring of the transcriptome. Overall, these analyzes pave the way toward the identification of key domestication genes valuable for genetic resources characterization and breeding purposes.

9.
Cancer Res ; 81(16): 4188-4193, 2021 08 15.
Artigo em Inglês | MEDLINE | ID: mdl-34185678

RESUMO

The National Cancer Institute (NCI) Cancer Research Data Commons (CRDC) aims to establish a national cloud-based data science infrastructure. Imaging Data Commons (IDC) is a new component of CRDC supported by the Cancer Moonshot. The goal of IDC is to enable a broad spectrum of cancer researchers, with and without imaging expertise, to easily access and explore the value of deidentified imaging data and to support integrated analyses with nonimaging data. We achieve this goal by colocating versatile imaging collections with cloud-based computing resources and data exploration, visualization, and analysis tools. The IDC pilot was released in October 2020 and is being continuously populated with radiology and histopathology collections. IDC provides access to curated imaging collections, accompanied by documentation, a user forum, and a growing number of analysis use cases that aim to demonstrate the value of a data commons framework applied to cancer imaging research. SIGNIFICANCE: This study introduces NCI Imaging Data Commons, a new repository of the NCI Cancer Research Data Commons, which will support cancer imaging research on the cloud.


Assuntos
Diagnóstico por Imagem/métodos , National Cancer Institute (U.S.) , Neoplasias/diagnóstico por imagem , Neoplasias/genética , Pesquisa Biomédica/tendências , Computação em Nuvem , Biologia Computacional/métodos , Gráficos por Computador , Segurança Computacional , Interpretação Estatística de Dados , Bases de Dados Factuais , Diagnóstico por Imagem/normas , Humanos , Processamento de Imagem Assistida por Computador , Projetos Piloto , Linguagens de Programação , Radiologia/métodos , Radiologia/normas , Reprodutibilidade dos Testes , Software , Estados Unidos , Interface Usuário-Computador
10.
Front Plant Sci ; 11: 224, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32194601

RESUMO

Most sorghum biomass accumulates in stem secondary cell walls (SCW). As sorghum stems are used as raw materials for various purposes such as feed, energy and fiber reinforced polymers, identifying the genes responsible for SCW establishment is highly important. Taking advantage of studies performed in model species, most of the structural genes contributing at the molecular level to the SCW biosynthesis in sorghum have been proposed while their regulatory factors have mostly not been determined. Validation of the role of several MYB and NAC transcription factors in SCW regulation in Arabidopsis and a few other species has been provided. In this study, we contributed to the recent efforts made in grasses to uncover the mechanisms underlying SCW establishment. We reported updated phylogenies of NAC and MYB in 9 different species and exploited findings from other species to highlight candidate regulators of SCW in sorghum. We acquired expression data during sorghum internode development and used co-expression analyses to determine groups of co-expressed genes that are likely to be involved in SCW establishment. We were able to identify two groups of co-expressed genes presenting multiple evidences of involvement in SCW building. Gene enrichment analysis of MYB and NAC genes provided evidence that while NAC SECONDARY WALL THICKENING PROMOTING FACTOR NST genes and SECONDARY WALL-ASSOCIATED NAC DOMAIN PROTEIN gene functions appear to be conserved in sorghum, NAC master regulators of SCW in sorghum may not be as tissue compartmentalized as in Arabidopsis. We showed that for every homolog of the key SCW MYB in Arabidopsis, a similar role is expected for sorghum. In addition, we unveiled sorghum MYB and NAC that have not been identified to date as being involved in cell wall regulation. Although specific validation of the MYB and NAC genes uncovered in this study is needed, we provide a network of sorghum genes involved in SCW both at the structural and regulatory levels.

11.
Genetica ; 147(2): 205-216, 2019 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-31054007

RESUMO

Information about population structure and genetic relationships within and among wild and brazilian Coffea arabica L. genotypes is highly relevant to optimize the use of genetic resources for breeding purposes. In this study, we evaluated genetic diversity, clustering analysis based on Jaccard's coefficient and population structure in 33 genotypes of C. arabica and of three diploid Coffea species (C. canephora, C. eugenioides and C. racemosa) using 30 SSR markers. A total of 206 alleles were identified, with a mean of 6.9 over all loci. The set of SSR markers was able to discriminate all genotypes and revealed that Ethiopian accessions presented higher genetic diversity than commercial varieties. Population structure analysis indicated two genetic groups, one corresponding to Ethiopian accessions and another corresponding predominantly to commercial cultivars. Thirty-four private alleles were detected in the group of accessions collected from West side of Great Rift Valley. We observed a lower average genetic distance of the C. arabica genotypes in relation to C. eugenioides than C. canephora. Interestingly, commercial cultivars were genetically closer to C. eugenioides than C. canephora and C. racemosa. The great allelic richness observed in Ethiopian Arabica coffee, especially in Western group showed that these accessions can be potential source of new alleles to be explored by coffee breeding programs.


Assuntos
Coffea/genética , Repetições de Microssatélites , Polimorfismo Genético , Coffea/classificação , Genótipo , Técnicas de Genotipagem/métodos , Técnicas de Genotipagem/normas , Filogenia , Melhoramento Vegetal/métodos
12.
Sci Rep ; 8(1): 465, 2018 01 11.
Artigo em Inglês | MEDLINE | ID: mdl-29323254

RESUMO

Lipids, including the diterpenes cafestol and kahweol, are key compounds that contribute to the quality of coffee beverages. We determined total lipid content and cafestol and kahweol concentrations in green beans and genotyped 107 Coffea arabica accessions, including wild genotypes from the historical FAO collection from Ethiopia. A genome-wide association study was performed to identify genomic regions associated with lipid, cafestol and kahweol contents and cafestol/kahweol ratio. Using the diploid Coffea canephora genome as a reference, we identified 6,696 SNPs. Population structure analyses suggested the presence of two to three groups (K = 2 and K = 3) corresponding to the east and west sides of the Great Rift Valley and an additional group formed by wild accessions collected in western forests. We identified 5 SNPs associated with lipid content, 4 with cafestol, 3 with kahweol and 9 with cafestol/kahweol ratio. Most of these SNPs are located inside or near candidate genes related to metabolic pathways of these chemical compounds in coffee beans. In addition, three trait-associated SNPs showed evidence of directional selection among cultivated and wild coffee accessions. Our results also confirm a great allelic richness in wild accessions from Ethiopia, especially in accessions originating from forests in the west side of the Great Rift Valley.


Assuntos
Coffea/química , Diterpenos/análise , Estudo de Associação Genômica Ampla/métodos , Polimorfismo de Nucleotídeo Único , Vias Biossintéticas , Coffea/genética , Diterpenos/metabolismo , Lipídeos/análise , Lipídeos/biossíntese , Proteínas de Plantas/genética , Locos de Características Quantitativas , Análise de Sequência de DNA/métodos
13.
Cancer Res ; 77(21): e7-e10, 2017 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-29092928

RESUMO

The ISB Cancer Genomics Cloud (ISB-CGC) is one of three pilot projects funded by the National Cancer Institute to explore new approaches to computing on large cancer datasets in a cloud environment. With a focus on Data as a Service, the ISB-CGC offers multiple avenues for accessing and analyzing The Cancer Genome Atlas, TARGET, and other important references such as GENCODE and COSMIC using the Google Cloud Platform. The open approach allows researchers to choose approaches best suited to the task at hand: from analyzing terabytes of data using complex workflows to developing new analysis methods in common languages such as Python, R, and SQL; to using an interactive web application to create synthetic patient cohorts and to explore the wealth of available genomic data. Links to resources and documentation can be found at www.isb-cgc.org Cancer Res; 77(21); e7-10. ©2017 AACR.


Assuntos
Computação em Nuvem , Biologia Computacional , Genômica , Neoplasias/genética , Conjuntos de Dados como Assunto , Genoma Humano , Humanos , Internet , National Cancer Institute (U.S.) , Pesquisa/tendências , Software , Estados Unidos
14.
Front Plant Sci ; 8: 1516, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28919904

RESUMO

Sorghum is increasingly used as a biomass crop worldwide. Its genetic diversity provides a large range of stem biochemical composition suitable for various end-uses as bioenergy or forage. Its drought tolerance enables it to reasonably sustain biomass production under water limited conditions. However, drought effect on the accumulation of sorghum stem biomass remains poorly understood which limits progress in crop improvement and management. This study aimed at identifying the morphological, biochemical and histological traits underlying biomass accumulation in the sorghum stem and its plasticity in response to water deficit. Two hybrids (G1, G4) different in stem biochemical composition (G4, more lignified, less sweet) were evaluated during 2 years in the field in Southern France, under two water treatments differentiated during stem elongation (irrigated; 1 month dry-down until an average soil water deficit of -8.85 bars). Plant phenology was observed weekly. At the end of the water treatment and at final harvest, plant height, stem and leaf dry-weight and the size, biochemical composition and tissue histology of internodes at 2-4 positions along the stem were measured. Stem biomass accumulation was significantly reduced by drought (in average 42% at the end of the dry-down). This was due to the reduction of the length, but not diameter, of the internodes expanded during water deficit. These internodes had more soluble sugar but lower lignin and cellulose contents. This was associated with a decrease of the areal proportion of lignified cell wall in internode outer zone whereas the areal proportion of this zone was not affected. All internodes for a given genotype and environment followed a common histochemical dynamics. Hemicellulose content and the areal proportion of inner vs. outer internode tissues were set up early during internode growth and were not drought responsive. G4 exhibited a higher drought sensitivity than G1 for plant height only. At final harvest, the stem dry weight was only 18% lower in water deficit (re-watered) compared to well-watered treatment and internodes growing during re-watering were similar to those on the well-watered plants. These results are being valorized to refine the phenotyping of sorghum diversity panels and breeding populations.

15.
PLoS One ; 12(9): e0183454, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28886042

RESUMO

Domestication is known to strongly reduce genomic diversity through population bottlenecks. The resulting loss of polymorphism has been thoroughly documented in numerous cultivated species. Here we investigate the impact of domestication on the diversity of alternative transcript expressions using RNAseq data obtained on cultivated and wild sorghum accessions (ten accessions for each pool). In that aim, we focus on genes expressing two isoforms in sorghum and estimate the ratio between expression levels of those isoforms in each accession. Noticeably, for a given gene, one isoform can either be overexpressed or underexpressed in some wild accessions, whereas in the cultivated accessions, the balance between the two isoforms of the same gene appears to be much more homogenous. Indeed, we observe in sorghum significantly more variation in isoform expression balance among wild accessions than among domesticated accessions. The possibility exists that the loss of nucleotide diversity due to domestication could affect regulatory elements, controlling transcription or degradation of these isoforms. Impact on the isoform expression balance is discussed. As far as we know, this is the first time that the impact of domestication on transcript isoform balance has been studied at the genomic scale. This could pave the way towards the identification of key domestication genes with finely tuned isoform expressions in domesticated accessions while being highly variable in their wild relatives.


Assuntos
Processamento Alternativo/genética , Sorghum/genética , Sorghum/metabolismo , Processamento Alternativo/fisiologia , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo
16.
PLoS Genet ; 13(5): e1006799, 2017 May.
Artigo em Inglês | MEDLINE | ID: mdl-28531201

RESUMO

Base composition is highly variable among and within plant genomes, especially at third codon positions, ranging from GC-poor and homogeneous species to GC-rich and highly heterogeneous ones (particularly Monocots). Consequently, synonymous codon usage is biased in most species, even when base composition is relatively homogeneous. The causes of these variations are still under debate, with three main forces being possibly involved: mutational bias, selection and GC-biased gene conversion (gBGC). So far, both selection and gBGC have been detected in some species but how their relative strength varies among and within species remains unclear. Population genetics approaches allow to jointly estimating the intensity of selection, gBGC and mutational bias. We extended a recently developed method and applied it to a large population genomic dataset based on transcriptome sequencing of 11 angiosperm species spread across the phylogeny. We found that at synonymous positions, base composition is far from mutation-drift equilibrium in most genomes and that gBGC is a widespread and stronger process than selection. gBGC could strongly contribute to base composition variation among plant species, implying that it should be taken into account in plant genome analyses, especially for GC-rich ones.


Assuntos
Evolução Molecular , Genoma de Planta , Magnoliopsida/genética , Polimorfismo Genético , Sequência Rica em GC , Conversão Gênica , Seleção Genética
17.
Carbohydr Polym ; 167: 12-19, 2017 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-28433145

RESUMO

The hypothesis made is that thermal resistance of sorghum and miscanthus stem pieces taken at well-defined positions of the stem is simply related to their biochemical composition. For miscanthus, two different genotypes and two internode levels were selected. For each region, the stem was divided into three radial layers. For sorghum, two different genotypes were selected and the stem was divided into the same three radial layers. The results show that the thermal analysis is only sensitive to very large variations of compositions. But aside of such large composition differences, it is impossible to correlate thermal effects to biochemical composition even on very small size, well-identified pieces of plant materials. The interplay between sugar-based components, lignin and minerals is totally blurring the thermal response. Extreme care must be exercised when willing to explain why a given plant material has a thermal behaviour different of another plant material.


Assuntos
Lignina/química , Caules de Planta/química , Poaceae/química , Sorghum/química , Genótipo , Poaceae/genética , Sorghum/genética , Temperatura
18.
PLoS One ; 12(1): e0169595, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28068432

RESUMO

Coffea arabica L. is an important crop in several developing countries. Despite its economic importance, minimal transcriptome data are available for fruit tissues, especially during fruit development where several compounds related to coffee quality are produced. To understand the molecular aspects related to coffee fruit and grain development, we report a large-scale transcriptome analysis of leaf, flower and perisperm fruit tissue development. Illumina sequencing yielded 41,881,572 high-quality filtered reads. De novo assembly generated 65,364 unigenes with an average length of 1,264 bp. A total of 24,548 unigenes were annotated as protein coding genes, including 12,560 full-length sequences. In the annotation process, we identified nine candidate genes related to the biosynthesis of raffinose family oligossacarides (RFOs). These sugars confer osmoprotection and are accumulated during initial fruit development. Four genes from this pathway had their transcriptional pattern validated by quantitative reverse transcription polymerase chain reaction (qRT-PCR). Furthermore, we identified ~24,000 putative target sites for microRNAs (miRNAs) and 134 putative transcriptionally active transposable elements (TE) sequences in our dataset. This C. arabica transcriptomic atlas provides an important step for identifying candidate genes related to several coffee metabolic pathways, especially those related to fruit chemical composition and therefore beverage quality. Our results are the starting point for enhancing our knowledge about the coffee genes that are transcribed during the flowering and initial fruit development stages.


Assuntos
Coffea/genética , Coffea/metabolismo , Flores/genética , Frutas/genética , Perfilação da Expressão Gênica , Regulação da Expressão Gênica de Plantas , Folhas de Planta/genética , Rafinose/biossíntese , Biologia Computacional/métodos , Elementos de DNA Transponíveis , Anotação de Sequência Molecular , Fases de Leitura Aberta , Especificidade de Órgãos/genética , Transcriptoma
19.
Mol Ecol Resour ; 17(3): 565-580, 2017 May.
Artigo em Inglês | MEDLINE | ID: mdl-27487989

RESUMO

We produced a unique large data set of reference transcriptomes to obtain new knowledge about the evolution of plant genomes and crop domestication. For this purpose, we validated a RNA-Seq data assembly protocol to perform comparative population genomics. For the validation, we assessed and compared the quality of de novo Illumina short-read assemblies using data from two crops for which an annotated reference genome was available, namely grapevine and sorghum. We used the same protocol for the release of 26 new transcriptomes of crop plants and wild relatives, including still understudied crops such as yam, pearl millet and fonio. The species list has a wide taxonomic representation with the inclusion of 15 monocots and 11 eudicots. All contigs were annotated using BLAST, prot4EST and Blast2GO. A strong originality of the data set is that each crop is associated with close relative species, which will permit whole-genome comparative evolutionary studies between crops and their wild-related species. This large resource will thus serve research communities working on both crops and model organisms. All the data are available at http://arcad-bioinformatics.southgreen.fr/.


Assuntos
Produtos Agrícolas/genética , Genoma de Planta , Metagenômica , Transcriptoma , Evolução Biológica , Mapeamento de Sequências Contíguas
20.
Plant Physiol Biochem ; 111: 340-347, 2017 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-28002787

RESUMO

Lipids are among the major chemical compounds present in coffee beans, and they affect the flavor and aroma of the coffee beverage. Coffee oil is rich in kaurene diterpene compounds, mainly cafestol (CAF) and kahweol (KAH), which are related to plant defense mechanisms and to nutraceutical and sensorial beverage characteristics. Despite their importance, the final steps of coffee diterpenes biosynthesis remain unknown. To understand the molecular basis of coffee diterpenes biosynthesis, we report the content dynamics of CAF and KAH in several Coffea arabica tissues and the transcriptional analysis of cytochrome P450 genes (P450). We measured CAF and KAH concentrations in leaves, roots, flower buds, flowers and fruit tissues at seven developmental stages (30-240 days after flowering - DAF) using HPLC. Higher CAF levels were detected in flower buds and flowers when compared to fruits. In contrast, KAH concentration increased along fruit development, peaking at 120 DAF. We did not detect CAF or KAH in leaves, and higher amounts of KAH than CAF were detected in roots. Using P450 candidate genes from a coffee EST database, we performed RT-qPCR transcriptional analysis of leaves, flowers and fruits at three developmental stages (90, 120 and 150 DAF). Three P450 genes (CaCYP76C4, CaCYP82C2 and CaCYP74A1) had transcriptional patterns similar to CAF concentration and two P450 genes (CaCYP71A25 and CaCYP701A3) have transcript accumulation similar to KAH concentration. These data warrant further investigation of these P450s as potential candidate genes involved in the final stages of the CAF and KAH biosynthetic pathways.


Assuntos
Coffea/genética , Sistema Enzimático do Citocromo P-450/genética , Diterpenos/metabolismo , Flores/enzimologia , Frutas/crescimento & desenvolvimento , Folhas de Planta/enzimologia , Raízes de Plantas/enzimologia , Transcrição Gênica , Cromatografia Líquida de Alta Pressão , Coffea/crescimento & desenvolvimento , Diterpenos/análise , Flores/genética , Frutas/genética , Regulação da Expressão Gênica de Plantas , Genes de Plantas , Estudos de Associação Genética , Folhas de Planta/genética , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Raízes de Plantas/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...