Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
1.
Bioinformatics ; 38(5): 1470-1472, 2022 02 07.
Artigo em Inglês | MEDLINE | ID: mdl-34904638

RESUMO

SUMMARY: We have implemented the pypgatk package and the pgdb workflow to create proteogenomics databases based on ENSEMBL resources. The tools allow the generation of protein sequences from novel protein-coding transcripts by performing a three-frame translation of pseudogenes, lncRNAs and other non-canonical transcripts, such as those produced by alternative splicing events. It also includes exonic out-of-frame translation from otherwise canonical protein-coding mRNAs. Moreover, the tool enables the generation of variant protein sequences from multiple sources of genomic variants including COSMIC, cBioportal, gnomAD and mutations detected from sequencing of patient samples. pypgatk and pgdb provide multiple functionalities for database handling including optimized target/decoy generation by the algorithm DecoyPyrat. Finally, we have reanalyzed six public datasets in PRIDE by generating cell-type specific databases for 65 cell lines using the pypgatk and pgdb workflow, revealing a wealth of non-canonical or cryptic peptides amounting to >5% of the total number of peptides identified. AVAILABILITY AND IMPLEMENTATION: The software is freely available. pypgatk: https://github.com/bigbio/py-pgatk/ and pgdb: https://nf-co.re/pgdb. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Proteogenômica , Humanos , Peptídeos/genética , Software , Algoritmos , Proteínas
2.
Nucleic Acids Res ; 44(19): 9110-9120, 2016 Nov 02.
Artigo em Inglês | MEDLINE | ID: mdl-27625394

RESUMO

Gene transcription is regulated mainly by transcription factors (TFs). ENCODE and Roadmap Epigenomics provide global binding profiles of TFs, which can be used to identify regulatory regions. To this end we implemented a method to systematically construct cell-type and species-specific maps of regulatory regions and TF-TF interactions. We illustrated the approach by developing maps for five human cell-lines and two other species. We detected ∼144k putative regulatory regions among the human cell-lines, with the majority of them being ∼300 bp. We found ∼20k putative regulatory elements in the ENCODE heterochromatic domains suggesting a large regulatory potential in the regions presumed transcriptionally silent. Among the most significant TF interactions identified in the heterochromatic regions were CTCF and the cohesin complex, which is in agreement with previous reports. Finally, we investigated the enrichment of the obtained putative regulatory regions in the 3D chromatin domains. More than 90% of the regions were discovered in the 3D contacting domains. We found a significant enrichment of GWAS SNPs in the putative regulatory regions. These significant enrichments provide evidence that the regulatory regions play a crucial role in the genomic structural stability. Additionally, we generated maps of putative regulatory regions for prostate and colorectal cancer human cell-lines.


Assuntos
Genômica , Sequências Reguladoras de Ácido Nucleico , Sítios de Ligação , Linhagem Celular , Cromatina/genética , Cromatina/metabolismo , Imunoprecipitação da Cromatina , Mapeamento Cromossômico , Biologia Computacional/métodos , Genoma Humano , Estudo de Associação Genômica Ampla , Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Anotação de Sequência Molecular , Polimorfismo de Nucleotídeo Único , Ligação Proteica , Mapeamento de Interação de Proteínas , Mapas de Interação de Proteínas , Fatores de Transcrição/metabolismo
3.
Hum Mutat ; 37(9): 904-13, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-27174533

RESUMO

Somatic mutations drive cancer and there are established ways to study those in coding sequences. It has been shown that some regulatory mutations are over-represented in cancer. We develop a new strategy to find putative regulatory mutations based on experimentally established motifs for transcription factors (TFs). In total, we find 1,552 candidate regulatory mutations predicted to significantly reduce binding affinity of many TFs in hepatocellular carcinoma and affecting binding of CTCF also in esophagus, gastric, and pancreatic cancers. Near mutated motifs, there is a significant enrichment of (1) genes mutated in cancer, (2) tumor-suppressor genes, (3) genes in KEGG cancer pathways, and (4) sets of genes previously associated to cancer. Experimental and functional validations support the findings. The strategy can be applied to identify regulatory mutations in any cell type with established TF motifs and will aid identifications of genes contributing to cancer.


Assuntos
Carcinoma Hepatocelular/genética , Neoplasias Hepáticas/genética , Mutação , Fatores de Transcrição/genética , Sítios de Ligação , Bases de Dados Genéticas , Regulação Neoplásica da Expressão Gênica , Redes Reguladoras de Genes , Predisposição Genética para Doença , Células Hep G2 , Humanos , Ligação Proteica , Análise de Sequência de DNA , Fatores de Transcrição/metabolismo
4.
BMC Bioinformatics ; 14: 280, 2013 Sep 23.
Artigo em Inglês | MEDLINE | ID: mdl-24059758

RESUMO

BACKGROUND: Finding peaks in ChIP-seq is an important process in biological inference. In some cases, such as positioning nucleosomes with specific histone modifications or finding transcription factor binding specificities, the precision of the detected peak plays a significant role. There are several applications for finding peaks (called peak finders) based on different algorithms (e.g. MACS, Erange and HPeak). Benchmark studies have shown that the existing peak finders identify different peaks for the same dataset and it is not known which one is the most accurate. We present the first meta-server called Peak Finder MetaServer (PFMS) that collects results from several peak finders and produces consensus peaks. Our application accepts three standard ChIP-seq data formats: BED, BAM, and SAM. RESULTS: Sensitivity and specificity of seven widely used peak finders were examined. For the experiments we used three previously studied Transcription Factors (TF) ChIP-seq datasets and identified three of the selected peak finders that returned results with high specificity and very good sensitivity compared to the remaining four. We also ran PFMS using the three selected peak finders on the same TF datasets and achieved higher specificity and sensitivity than the peak finders individually. CONCLUSIONS: We show that combining outputs from up to seven peak finders yields better results than individual peak finders. In addition, three of the seven peak finders outperform the remaining four, and running PFMS with these three returns even more accurate results. Another added value of PFMS is a separate report of the peaks returned by each of the included peak finders.


Assuntos
Imunoprecipitação da Cromatina , Biologia Computacional/métodos , Bases de Dados Genéticas , Análise de Sequência de DNA/métodos , Software , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo
5.
Mol Oncol ; 17(2): 238-260, 2023 02.
Artigo em Inglês | MEDLINE | ID: mdl-36495079

RESUMO

Glioblastoma (GBM) cancer stem cells (GSCs) contribute to GBM's origin, recurrence, and resistance to treatment. However, the understanding of how mRNA expression patterns of GBM subtypes are reflected at global proteome level in GSCs is limited. To characterize protein expression in GSCs, we performed in-depth proteogenomic analysis of patient-derived GSCs by RNA-sequencing and mass-spectrometry. We quantified > 10 000 proteins in two independent GSC panels and propose a GSC-associated proteomic signature characterizing two distinct phenotypic conditions; one defined by proteins upregulated in proneural and classical GSCs (GPC-like), and another by proteins upregulated in mesenchymal GSCs (GM-like). The GM-like protein set in GBM tissue was associated with necrosis, recurrence, and worse overall survival. Through proteogenomics, we discovered 252 non-canonical peptides in the GSCs, i.e., protein sequences that are variant or derive from genome regions previously considered non-protein-coding, including variants of the heterogeneous ribonucleoproteins implicated in RNA splicing. In summary, GSCs express two protein sets that have an inverse association with clinical outcomes in GBM. The discovery of non-canonical protein sequences questions existing gene models and pinpoints new protein targets for research in GBM.


Assuntos
Neoplasias Encefálicas , Glioblastoma , Humanos , Glioblastoma/genética , Glioblastoma/metabolismo , Proteômica , Neoplasias Encefálicas/metabolismo , Células-Tronco Neoplásicas/metabolismo , Linhagem Celular Tumoral
6.
Leukemia ; 37(3): 550-559, 2023 03.
Artigo em Inglês | MEDLINE | ID: mdl-36572751

RESUMO

Despite improvement of current treatment strategies and novel targeted drugs, relapse and treatment resistance largely determine the outcome for acute myeloid leukemia (AML) patients. To identify the underlying molecular characteristics, numerous studies have been aimed to decipher the genomic- and transcriptomic landscape of AML. Nevertheless, further molecular changes allowing malignant cells to escape treatment remain to be elucidated. Mass spectrometry is a powerful tool enabling detailed insights into proteomic changes that could explain AML relapse and resistance. Here, we investigated AML samples from 47 adult and 22 pediatric patients at serial time-points during disease progression using mass spectrometry-based in-depth proteomics. We show that the proteomic profile at relapse is enriched for mitochondrial ribosomal proteins and subunits of the respiratory chain complex, indicative of reprogrammed energy metabolism from diagnosis to relapse. Further, higher levels of granzymes and lower levels of the anti-inflammatory protein CR1/CD35 suggest an inflammatory signature promoting disease progression. Finally, through a proteogenomic approach, we detected novel peptides, which present a promising repertoire in the search for biomarkers and tumor-specific druggable targets. Altogether, this study highlights the importance of proteomic studies in holistic approaches to improve treatment and survival of AML patients.


Assuntos
Leucemia Mieloide Aguda , Proteogenômica , Humanos , Criança , Adulto , Proteômica/métodos , Leucemia Mieloide Aguda/tratamento farmacológico , Leucemia Mieloide Aguda/genética , Leucemia Mieloide Aguda/patologia , Recidiva , Progressão da Doença
7.
Life Sci Alliance ; 4(9)2021 09.
Artigo em Inglês | MEDLINE | ID: mdl-34282050

RESUMO

In a cancer genome, the noncoding sequence contains the vast majority of somatic mutations. While very few are expected to be cancer drivers, those affecting regulatory elements have the potential to have downstream effects on gene regulation that may contribute to cancer progression. To prioritize regulatory mutations, we screened somatic mutations in the Pan-Cancer Analysis of Whole Genomes cohort of 2,515 cancer genomes on individual bases to assess their potential regulatory roles in their respective cancer types. We found a highly significant enrichment of regulatory mutations associated with the deamination signature overlapping a CpG site in the CCAAT/Enhancer Binding Protein ß recognition sites in many cancer types. Overall, 5,749 mutated regulatory elements were identified in 1,844 tumor samples from 39 cohorts containing 11,962 candidate regulatory mutations. Our analysis indicated 20 or more regulatory mutations in 5.5% of the samples, and an overall average of six per tumor. Several recurrent elements were identified, and major cancer-related pathways were significantly enriched for genes nearby the mutated regulatory elements. Our results provide a detailed view of the role of regulatory elements in cancer genomes.


Assuntos
Biologia Computacional , Genômica , Anotação de Sequência Molecular , Mutação , Neoplasias/genética , Regiões não Traduzidas , Sítios de Ligação , Biomarcadores Tumorais , Biologia Computacional/métodos , Suscetibilidade a Doenças , Regulação Neoplásica da Expressão Gênica , Predisposição Genética para Doença , Genômica/métodos , Humanos , Taxa de Mutação , Neoplasias/metabolismo , Motivos de Nucleotídeos , Ligação Proteica , Sequências Reguladoras de Ácido Nucleico , Transdução de Sinais , Fatores de Transcrição/metabolismo
8.
Cancers (Basel) ; 13(14)2021 Jul 07.
Artigo em Inglês | MEDLINE | ID: mdl-34298622

RESUMO

Knowledge of clinically targetable tumor antigens is becoming vital for broader design and utility of therapeutic cancer vaccines. This information is obtained reliably by directly interrogating the MHC-I presented peptide ligands, the immunopeptidome, with state-of-the-art mass spectrometry. Our manuscript describes direct identification of novel tumor antigens for an aggressive triple-negative breast cancer model. Immunopeptidome profiling revealed 2481 unique antigens, among them a novel ERV antigen originating from an endogenous retrovirus element. The clinical benefit and tumor control potential of the identified tumor antigens and ERV antigen were studied in a preclinical model using two vaccine platforms and therapeutic settings. Prominent control of established tumors was achieved using an oncolytic adenovirus platform designed for flexible and specific tumor targeting, namely PeptiCRAd. Our study presents a pipeline integrating immunopeptidome analysis-driven antigen discovery with a therapeutic cancer vaccine platform for improved personalized oncolytic immunotherapy.

9.
Nat Cancer ; 2(11): 1224-1242, 2021 11.
Artigo em Inglês | MEDLINE | ID: mdl-34870237

RESUMO

Despite major advancements in lung cancer treatment, long-term survival is still rare, and a deeper understanding of molecular phenotypes would allow the identification of specific cancer dependencies and immune evasion mechanisms. Here we performed in-depth mass spectrometry (MS)-based proteogenomic analysis of 141 tumors representing all major histologies of non-small cell lung cancer (NSCLC). We identified six distinct proteome subtypes with striking differences in immune cell composition and subtype-specific expression of immune checkpoints. Unexpectedly, high neoantigen burden was linked to global hypomethylation and complex neoantigens mapped to genomic regions, such as endogenous retroviral elements and introns, in immune-cold subtypes. Further, we linked immune evasion with LAG3 via STK11 mutation-dependent HNF1A activation and FGL1 expression. Finally, we develop a data-independent acquisition MS-based NSCLC subtype classification method, validate it in an independent cohort of 208 NSCLC cases and demonstrate its clinical utility by analyzing an additional cohort of 84 late-stage NSCLC biopsy samples.


Assuntos
Carcinoma Pulmonar de Células não Pequenas , Neoplasias Pulmonares , Proteogenômica , Carcinoma Pulmonar de Células não Pequenas/genética , Fibrinogênio/uso terapêutico , Genômica/métodos , Humanos , Evasão da Resposta Imune/genética , Neoplasias Pulmonares/genética
10.
Sci Rep ; 9(1): 2695, 2019 02 25.
Artigo em Inglês | MEDLINE | ID: mdl-30804403

RESUMO

Several Genome Wide Association Studies (GWAS) have reported variants associated to immune diseases. However, the identified variants are rarely the drivers of the associations and the molecular mechanisms behind the genetic contributions remain poorly understood. ChIP-seq data for TFs and histone modifications provide snapshots of protein-DNA interactions allowing the identification of heterozygous SNPs showing significant allele specific signals (AS-SNPs). AS-SNPs can change a TF binding site resulting in altered gene regulation and are primary candidates to explain associations observed in GWAS and expression studies. We identified 17,293 unique AS-SNPs across 7 lymphoblastoid cell lines. In this set of cell lines we interrogated 85% of common genetic variants in the population for potential regulatory effect and we identified 237 AS-SNPs associated to immune GWAS traits and 714 to gene expression in B cells. To elucidate possible regulatory mechanisms we integrated long-range 3D interactions data to identify putative target genes and motif predictions to identify TFs whose binding may be affected by AS-SNPs yielding a collection of 173 AS-SNPs associated to gene expression and 60 to B cell related traits. We present a systems strategy to find functional gene regulatory variants, the TFs that bind differentially between alleles and novel strategies to detect the regulated genes.


Assuntos
Cromatina/metabolismo , Alelos , Sítios de Ligação , Cromatina/genética , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla/métodos , Antígenos HLA/genética , Antígenos HLA/metabolismo , Humanos , Polimorfismo de Nucleotídeo Único/genética , Ligação Proteica/genética , Ligação Proteica/fisiologia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA