Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 36
Filtrar
1.
Mol Cell ; 73(2): 212-223.e7, 2019 01 17.
Artículo en Inglés | MEDLINE | ID: mdl-30554942

RESUMEN

Cohesin subunits are frequently mutated in cancer, but how they function as tumor suppressors is unknown. Cohesin mediates sister chromatid cohesion, but this is not always perturbed in cancer cells. Here, we identify a previously unknown role for cohesin. We find that cohesin is required to repress transcription at DNA double-strand breaks (DSBs). Notably, cohesin represses transcription at DSBs throughout interphase, indicating that this is distinct from its known role in mediating DNA repair through sister chromatid cohesion. We identified a cancer-associated SA2 mutation that supports sister chromatid cohesion but is unable to repress transcription at DSBs. We further show that failure to repress transcription at DSBs leads to large-scale genome rearrangements. Cancer samples lacking SA2 display mutational patterns consistent with loss of this pathway. These findings uncover a new function for cohesin that provides insights into its frequent loss in cancer.


Asunto(s)
Neoplasias Óseas/genética , Proteínas de Ciclo Celular/genética , Proteínas Cromosómicas no Histona/genética , Roturas del ADN de Doble Cadena , Inestabilidad Genómica , Interfase , Osteosarcoma/genética , Transcripción Genética , Antígenos Nucleares/genética , Antígenos Nucleares/metabolismo , Neoplasias Óseas/metabolismo , Neoplasias Óseas/patología , Proteínas de Ciclo Celular/metabolismo , Línea Celular Tumoral , Proteínas Cromosómicas no Histona/metabolismo , Segregación Cromosómica , Reparación del ADN , Regulación hacia Abajo , Fase G1 , Fase G2 , Regulación Neoplásica de la Expresión Génica , Humanos , Osteosarcoma/metabolismo , Osteosarcoma/patología , Transducción de Señal , Factores de Transcripción/genética , Factores de Transcripción/metabolismo , Cohesinas
2.
J Exp Biol ; 225(7)2022 04 01.
Artículo en Inglés | MEDLINE | ID: mdl-35403696

RESUMEN

Applications of key technologies in biomedical research, such as qRT-PCR or LC-MS-based proteomics, are generating large biological (-omics) datasets which are useful for the identification and quantification of biomarkers in any research area of interest. Genome, transcriptome and proteome databases are already available for a number of model organisms including vertebrates and invertebrates. However, there is insufficient information available for protein sequences of certain invertebrates, such as the great pond snail Lymnaea stagnalis, a model organism that has been used highly successfully in elucidating evolutionarily conserved mechanisms of memory function and dysfunction. Here, we used a bioinformatics approach to designing and benchmarking a comprehensive central nervous system (CNS) proteomics database (LymCNS-PDB) for the identification of proteins from the CNS of Lymnaea by LC-MS-based proteomics. LymCNS-PDB was created by using the Trinity TransDecoder bioinformatics tool to translate amino acid sequences from mRNA transcript assemblies obtained from a published Lymnaea transcriptomics database. The blast-style MMSeq2 software was used to match all translated sequences to UniProtKB sequences for molluscan proteins, including those from Lymnaea and other molluscs. LymCNS-PDB contains 9628 identified matched proteins that were benchmarked by performing LC-MS-based proteomics analysis with proteins isolated from the Lymnaea CNS. MS/MS analysis using the LymCNS-PDB database led to the identification of 3810 proteins. Only 982 proteins were identified by using a non-specific molluscan database. LymCNS-PDB provides a valuable tool that will enable us to perform quantitative proteomics analysis of protein interactomes involved in several CNS functions in Lymnaea, including learning and memory and age-related memory decline.


Asunto(s)
Biología Computacional , Lymnaea , Animales , Benchmarking , Sistema Nervioso Central , Cromatografía Liquida , Lymnaea/genética , Proteínas/metabolismo , Espectrometría de Masas en Tándem
3.
PLoS Comput Biol ; 15(4): e1006888, 2019 04.
Artículo en Inglés | MEDLINE | ID: mdl-30995217

RESUMEN

In response to a need for improved treatments, a number of promising novel targeted cancer therapies are being developed that exploit human synthetic lethal interactions. This is facilitating personalised medicine strategies in cancers where specific tumour suppressors have become inactivated. Mainly due to the constraints of the experimental procedures, relatively few human synthetic lethal interactions have been identified. Here we describe SLant (Synthetic Lethal analysis via Network topology), a computational systems approach to predicting human synthetic lethal interactions that works by identifying and exploiting conserved patterns in protein interaction network topology both within and across species. SLant out-performs previous attempts to classify human SSL interactions and experimental validation of the models predictions suggests it may provide useful guidance for future SSL screenings and ultimately aid targeted cancer therapy development.


Asunto(s)
Mapas de Interacción de Proteínas/genética , Mutaciones Letales Sintéticas , Algoritmos , Animales , Inteligencia Artificial , Biología Computacional , Descubrimiento de Drogas , Ontología de Genes , Genes Esenciales , Humanos , Modelos Biológicos , Terapia Molecular Dirigida , Familia de Multigenes , Neoplasias/genética , Neoplasias/metabolismo , Neoplasias/terapia , Mapeo de Interacción de Proteínas/estadística & datos numéricos , Mapas de Interacción de Proteínas/efectos de los fármacos , Biología Sintética , Mutaciones Letales Sintéticas/genética , Proteínas Supresoras de Tumor/genética , Proteínas Supresoras de Tumor/metabolismo
4.
Int J Mol Sci ; 20(22)2019 Nov 16.
Artículo en Inglés | MEDLINE | ID: mdl-31744086

RESUMEN

Using pan-cancer data from The Cancer Genome Atlas (TCGA), we investigated how patterns in copy number alterations in cancer cells vary both by tissue type and as a function of genetic alteration. We find that patterns in both chromosomal ploidy and individual arm copy number are dependent on tumour type. We highlight for example, the significant losses in chromosome arm 3p and the gain of ploidy in 5q in kidney clear cell renal cell carcinoma tissue samples. We find that specific gene mutations are associated with genome-wide copy number changes. Using signatures derived from non-negative factorisation, we also find gene mutations that are associated with particular patterns of ploidy change. Finally, utilising a set of machine learning classifiers, we successfully predicted the presence of mutated genes in a sample using arm-wise copy number patterns as features. This demonstrates that mutations in specific genes are correlated and may lead to specific patterns of ploidy loss and gain across chromosome arms. Using these same classifiers, we highlight which arms are most predictive of commonly mutated genes in kidney renal clear cell carcinoma (KIRC).


Asunto(s)
Carcinoma de Células Renales/patología , Variaciones en el Número de Copia de ADN/genética , Neoplasias Renales/patología , Área Bajo la Curva , Carcinoma de Células Renales/genética , Cromosomas/genética , Humanos , Neoplasias Renales/genética , Aprendizaje Automático , Mutación , Ploidias , Curva ROC , Proteína p53 Supresora de Tumor/genética , Proteína Supresora de Tumores del Síndrome de Von Hippel-Lindau/genética
5.
Biochem Soc Trans ; 44(3): 925-31, 2016 06 15.
Artículo en Inglés | MEDLINE | ID: mdl-27284061

RESUMEN

All cancers depend upon mutations in critical genes, which confer a selective advantage to the tumour cell. Knowledge of these mutations is crucial to understanding the biology of cancer initiation and progression, and to the development of targeted therapeutic strategies. The key to understanding the contribution of a disease-associated mutation to the development and progression of cancer, comes from an understanding of the consequences of that mutation on the function of the affected protein, and the impact on the pathways in which that protein is involved. In this paper we examine the mutation patterns observed in oncogenes and tumour suppressors, and discuss different approaches that have been developed to identify driver mutations within cancers that contribute to the disease progress. We also discuss the MOKCa database where we have developed an automatic pipeline that structurally and functionally annotates all proteins from the human proteome that are mutated in cancer.


Asunto(s)
Carcinogénesis/genética , Genes Supresores de Tumor/ética , Mutación , Oncogenes/genética , Humanos
6.
FEBS Lett ; 598(16): 2028-2039, 2024 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-38977941

RESUMEN

Mutually exclusive loss-of-function alterations in gene pairs are those that occur together less frequently than may be expected and may denote a synthetically lethal relationship (SSL) between the genes. SSLs can be exploited therapeutically to selectively kill cancer cells. Here, we analysed mutation, copy number variation, and methylation levels in samples from The Cancer Genome Atlas, using the hypergeometric and the Poisson binomial tests to identify mutually exclusive inactivated genes. We focused on gene pairs where one is an inactivated tumour suppressor and the other a gene whose protein product can be inhibited by known drugs. This provided an abundance of potential targeted therapeutics and repositioning opportunities for several cancers. These data are available on the MexDrugs website, https://bioinformaticslab.sussex.ac.uk/mexdrugs.


Asunto(s)
Variaciones en el Número de Copia de ADN , Neoplasias , Humanos , Neoplasias/genética , Metilación de ADN , Mutación , Mutaciones Letales Sintéticas/genética , Genes Supresores de Tumor
7.
Neurooncol Adv ; 6(1): vdae066, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38770219

RESUMEN

Brain metastases remain a challenging and feared complication for patients with cancer and research in this area has lagged behind research into metastases to other organs. Due to their location and the risks associated with neurosurgical biopsies, the biology underpinning brain metastases response to treatment and evolution over time remains poorly understood. Liquid biopsies are proposed to overcome many of the limitations present with tissue biopsies, providing a better representation of tumor heterogeneity, facilitating repeated sampling, and providing a noninvasive assessment of tumor biology. Several different liquid biopsy approaches have been investigated including circulating tumor cells, circulating tumor DNA, extracellular vesicles, and tumor-educated platelets; however, these have generally been less effective in assessing brain metastases compared to metastases to other organs requiring improved techniques to investigate these approaches, studies combining different liquid biopsy approaches and/or novel liquid biopsy approaches. Through this review, we highlight the current state of the art and define key unanswered questions related to brain metastases liquid biopsies.

8.
Adv Sci (Weinh) ; 11(15): e2306027, 2024 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-38353396

RESUMEN

Temozolomide (TMZ) represents the cornerstone of therapy for glioblastoma (GBM). However, acquisition of resistance limits its therapeutic potential. The human kinome is an undisputable source of druggable targets, still, current knowledge remains confined to a limited fraction of it, with a multitude of under-investigated proteins yet to be characterized. Here, following a kinome-wide RNAi screen, pantothenate kinase 4 (PANK4) isuncovered as a modulator of TMZ resistance in GBM. Validation of PANK4 across various TMZ-resistant GBM cell models, patient-derived GBM cell lines, tissue samples, as well as in vivo studies, corroborates the potential translational significance of these findings. Moreover, PANK4 expression is induced during TMZ treatment, and its expression is associated with a worse clinical outcome. Furthermore, a Tandem Mass Tag (TMT)-based quantitative proteomic approach, reveals that PANK4 abrogation leads to a significant downregulation of a host of proteins with central roles in cellular detoxification and cellular response to oxidative stress. More specifically, as cells undergo genotoxic stress during TMZ exposure, PANK4 depletion represents a crucial event that can lead to accumulation of intracellular reactive oxygen species (ROS) and subsequent cell death. Collectively, a previously unreported role for PANK4 in mediating therapeutic resistance to TMZ in GBM is unveiled.


Asunto(s)
Neoplasias Encefálicas , Glioblastoma , Humanos , Temozolomida/farmacología , Temozolomida/uso terapéutico , Glioblastoma/tratamiento farmacológico , Glioblastoma/metabolismo , Proteómica , Antineoplásicos Alquilantes/farmacología , Antineoplásicos Alquilantes/uso terapéutico , Resistencia a Antineoplásicos , Neoplasias Encefálicas/tratamiento farmacológico , Neoplasias Encefálicas/metabolismo , Línea Celular Tumoral
9.
Bioinform Adv ; 2(1): vbac084, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-36699394

RESUMEN

Motivation: Protein-protein interaction (PPI) networks have been shown to successfully predict essential proteins. However, such networks are derived generically from experiments on many thousands of different cells. Consequently, conventional PPI networks cannot capture the variation of genetic dependencies that exists across different cell types, let alone those that emerge as a result of the massive cell restructuring that occurs during carcinogenesis. Predicting cell-specific dependencies is of considerable therapeutic benefit, facilitating the use of drugs to inhibit those proteins on which the cancer cells have become specifically dependent. In order to go beyond the limitations of the generic PPI, we have attempted to personalise PPI networks to reflect cell-specific patterns of gene expression and mutation. By using 12 topological features of the resulting PPIs, together with matched gene dependency data from DepMap, we trained random-forest classifiers (DependANT) to predict novel gene dependencies. Results: We found that DependANT improves the power of the baseline generic PPI models in predicting common gene dependencies, by up to 10.8% and is more sensitive than the baseline generic model when predicting genes on which only a small number of cell types are dependent. Availability and implementation: Software available at https://bitbucket.org/bioinformatics_lab_sussex/dependant2. Supplementary information: Supplementary data are available at Bioinformatics Advances online.

10.
Nat Commun ; 13(1): 1731, 2022 04 01.
Artículo en Inglés | MEDLINE | ID: mdl-35365638

RESUMEN

Aneuploidy results in decreased cellular fitness in many species and model systems. However, aneuploidy is commonly found in cancer cells and often correlates with aggressive growth, suggesting that the impact of aneuploidy on cellular fitness is context dependent. The BRG1 (SMARCA4) subunit of the SWI/SNF chromatin remodelling complex is frequently lost in cancer. Here, we use a chromosomally stable cell line to test the effect of BRG1 loss on the evolution of aneuploidy. BRG1 deletion leads to an initial loss of fitness in this cell line that improves over time. Notably, we find increased tolerance to aneuploidy immediately upon loss of BRG1, and the fitness recovery over time correlates with chromosome gain. These data show that BRG1 loss creates an environment where karyotype changes can be explored without a fitness penalty. At least in some genetic backgrounds, therefore, BRG1 loss can affect the progression of tumourigenesis through tolerance of aneuploidy.


Asunto(s)
Aneuploidia , Ensamble y Desensamble de Cromatina , Línea Celular , Aberraciones Cromosómicas , Cromosomas , ADN Helicasas/genética , Humanos , Proteínas Nucleares/genética , Factores de Transcripción/genética
11.
Nucleic Acids Res ; 37(Database issue): D824-31, 2009 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-18986996

RESUMEN

Members of the protein kinase family are amongst the most commonly mutated genes in human cancer, and both mutated and activated protein kinases have proved to be tractable targets for the development of new anticancer therapies The MoKCa database (Mutations of Kinases in Cancer, http://strubiol.icr.ac.uk/extra/mokca) has been developed to structurally and functionally annotate, and where possible predict, the phenotypic consequences of mutations in protein kinases implicated in cancer. Somatic mutation data from tumours and tumour cell lines have been mapped onto the crystal structures of the affected protein domains. Positions of the mutated amino-acids are highlighted on a sequence-based domain pictogram, as well as a 3D-image of the protein structure, and in a molecular graphics package, integrated for interactive viewing. The data associated with each mutation is presented in the Web interface, along with expert annotation of the detailed molecular functional implications of the mutation. Proteins are linked to functional annotation resources and are annotated with structural and functional features such as domains and phosphorylation sites. MoKCa aims to provide assessments available from multiple sources and algorithms for each potential cancer-associated mutation, and present these together in a consistent and coherent fashion to facilitate authoritative annotation by cancer biologists and structural biologists, directly involved in the generation and analysis of new mutational data.


Asunto(s)
Bases de Datos de Proteínas , Mutación , Neoplasias/genética , Proteínas Quinasas/genética , Humanos , Neoplasias/enzimología , Mapeo de Interacción de Proteínas , Proteínas Quinasas/química , Proteínas Quinasas/metabolismo , Estructura Terciaria de Proteína , Interfaz Usuario-Computador
12.
Elife ; 102021 03 01.
Artículo en Inglés | MEDLINE | ID: mdl-33647232

RESUMEN

BLM (Bloom syndrome protein) is a RECQ-family helicase involved in the dissolution of complex DNA structures and repair intermediates. Synthetic lethality analysis implicates BLM as a promising target in a range of cancers with defects in the DNA damage response; however, selective small molecule inhibitors of defined mechanism are currently lacking. Here, we identify and characterise a specific inhibitor of BLM's ATPase-coupled DNA helicase activity, by allosteric trapping of a DNA-bound translocation intermediate. Crystallographic structures of BLM-DNA-ADP-inhibitor complexes identify a hitherto unknown interdomain interface, whose opening and closing are integral to translocation of ssDNA, and which provides a highly selective pocket for drug discovery. Comparison with structures of other RECQ helicases provides a model for branch migration of Holliday junctions by BLM.


Asunto(s)
RecQ Helicasas/antagonistas & inhibidores , Bibliotecas de Moléculas Pequeñas/química , Bibliotecas de Moléculas Pequeñas/farmacología , ADN/metabolismo , ADN Cruciforme , ADN de Cadena Simple , Descubrimiento de Drogas , Inhibidores Enzimáticos/química , Inhibidores Enzimáticos/farmacología , Escherichia coli , Ensayos Analíticos de Alto Rendimiento , Humanos , RecQ Helicasas/metabolismo
13.
J Comput Biol ; 27(5): 786-795, 2020 05.
Artículo en Inglés | MEDLINE | ID: mdl-31460787

RESUMEN

Inframe insertion and deletion mutations (indels) are commonly observed in cancer samples accounting for over 1% of all reported mutations. Few somatic inframe indels have been clinically documented as pathogenic and at present there are few tools to predict which indels drive cancer development. However, indels are a common feature of hereditary disease and several tools have been developed to predict the impact of inframe indels on protein function. In this study, we test whether six of the popular prediction tools can be adapted to test for cancer driver mutations and then develop a new algorithm (IndelRF) that discriminates between recurrent indels in known cancer genes and indels not associated with disease. IndelRF was developed to try and identify somatic, driver, and inframe indel mutations. Using a random forest classifier with 11 features, IndelRF achieved accuracies of 0.995 and 0.968 for insertion and deletion mutations, respectively. Finally, we use IndelRF to classify the inframe indel cancer mutations in the MOKCa database.


Asunto(s)
Biología Computacional/métodos , Mutación INDEL/genética , Proteínas de Neoplasias/genética , Neoplasias/genética , Algoritmos , Bases de Datos Genéticas , Genoma Humano/genética , Humanos , Neoplasias/patología , Oncogenes/genética
14.
J Clin Invest ; 130(6): 3188-3204, 2020 06 01.
Artículo en Inglés | MEDLINE | ID: mdl-32125284

RESUMEN

As there is growing evidence for the tumor microenvironment's role in tumorigenesis, we investigated the role of fibroblast-expressed kinases in triple-negative breast cancer (TNBC). Using a high-throughput kinome screen combined with 3D invasion assays, we identified fibroblast-expressed PIK3Cδ (f-PIK3Cδ) as a key regulator of cancer progression. Although PIK3Cδ was expressed in primary fibroblasts derived from TNBC patients, it was barely detectable in breast cancer (BC) cell lines. Genetic and pharmacological gain- and loss-of-function experiments verified the contribution of f-PIK3Cδ in TNBC cell invasion. Integrated secretomics and transcriptomics analyses revealed a paracrine mechanism via which f-PIK3Cδ confers its protumorigenic effects. Inhibition of f-PIK3Cδ promoted the secretion of factors, including PLGF and BDNF, that led to upregulation of NR4A1 in TNBC cells, where it acts as a tumor suppressor. Inhibition of PIK3Cδ in an orthotopic BC mouse model reduced tumor growth only after inoculation with fibroblasts, indicating a role of f-PIK3Cδ in cancer progression. Similar results were observed in the MMTV-PyMT transgenic BC mouse model, along with a decrease in tumor metastasis, emphasizing the potential immune-independent effects of PIK3Cδ inhibition. Finally, analysis of BC patient cohorts and TCGA data sets identified f-PIK3Cδ (protein and mRNA levels) as an independent prognostic factor for overall and disease-free survival, highlighting it as a therapeutic target for TNBC.


Asunto(s)
Fosfatidilinositol 3-Quinasa Clase I/biosíntesis , Fibroblastos/enzimología , Regulación Enzimológica de la Expresión Génica , Regulación Neoplásica de la Expresión Génica , Proteínas de Neoplasias/biosíntesis , Neoplasias de la Mama Triple Negativas/enzimología , Animales , Fosfatidilinositol 3-Quinasa Clase I/genética , Femenino , Fibroblastos/patología , Xenoinjertos , Humanos , Ratones , Ratones Transgénicos , Invasividad Neoplásica , Metástasis de la Neoplasia , Proteínas de Neoplasias/genética , Trasplante de Neoplasias , Neoplasias de la Mama Triple Negativas/genética , Neoplasias de la Mama Triple Negativas/patología
15.
Nucleic Acids Res ; 35(Database issue): D291-7, 2007 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-17135200

RESUMEN

We report the latest release (version 3.0) of the CATH protein domain database (http://www.cathdb.info). There has been a 20% increase in the number of structural domains classified in CATH, up to 86 151 domains. Release 3.0 comprises 1110 fold groups and 2147 homologous superfamilies. To cope with the increases in diverse structural homologues being determined by the structural genomics initiatives, more sensitive methods have been developed for identifying boundaries in multi-domain proteins and for recognising homologues. The CATH classification update is now being driven by an integrated pipeline that links these automated procedures with validation steps, that have been made easier by the provision of information rich web pages summarising comparison scores and relevant links to external sites for each domain being classified. An analysis of the population of domains in the CATH hierarchy and several domain characteristics are presented for version 3.0. We also report an update of the CATH Dictionary of homologous structures (CATH-DHS) which now contains multiple structural alignments, consensus information and functional annotations for 1459 well populated superfamilies in CATH. CATH is directly linked to the Gene3D database which is a projection of CATH structural data onto approximately 2 million sequences in completed genomes and UniProt.


Asunto(s)
Bases de Datos de Proteínas , Estructura Terciaria de Proteína , Clasificación/métodos , Evolución Molecular , Internet , Pliegue de Proteína , Estructura Terciaria de Proteína/genética , Proteínas/clasificación , Homología de Secuencia de Aminoácido , Homología Estructural de Proteína , Interfaz Usuario-Computador
16.
Commun Biol ; 2: 315, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-31453379

RESUMEN

Glioblastoma (GBM) is one of the most aggressive solid tumors for which treatment options and biomarkers are limited. Small extracellular vesicles (sEVs) produced by both GBM and stromal cells are central in the inter-cellular communication that is taking place in the tumor bulk. As tumor sEVs are accessible in biofluids, recent reports have suggested that sEVs contain valuable biomarkers for GBM patient diagnosis and follow-up. The aim of the current study was to describe the protein content of sEVs produced by different GBM cell lines and patient-derived stem cells. Our results reveal that the content of the sEVs mirrors the phenotypic signature of the respective GBM cells, leading to the description of potential informative sEV-associated biomarkers for GBM subtyping, such as CD44. Overall, these data could assist future GBM in vitro studies and provide insights for the development of new diagnostic and therapeutic methods as well as personalized treatment strategies.


Asunto(s)
Biomarcadores de Tumor/metabolismo , Neoplasias Encefálicas/clasificación , Neoplasias Encefálicas/metabolismo , Vesículas Extracelulares/metabolismo , Glioblastoma/clasificación , Glioblastoma/metabolismo , Astrocitos/metabolismo , Astrocitos/patología , Biomarcadores de Tumor/genética , Neoplasias Encefálicas/genética , Neoplasias Encefálicas/patología , Línea Celular Tumoral , Vesículas Extracelulares/ultraestructura , Regulación Neoplásica de la Expresión Génica , Glioblastoma/genética , Glioblastoma/patología , Humanos , Invasividad Neoplásica
17.
PLoS Comput Biol ; 3(11): e232, 2007 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-18052539

RESUMEN

We present CATHEDRAL, an iterative protocol for determining the location of previously observed protein folds in novel multidomain protein structures. CATHEDRAL builds on the features of a fast secondary-structure-based method (using graph theory) to locate known folds within a multidomain context and a residue-based, double-dynamic programming algorithm, which is used to align members of the target fold groups against the query protein structure to identify the closest relative and assign domain boundaries. To increase the fidelity of the assignments, a support vector machine is used to provide an optimal scoring scheme. Once a domain is verified, it is excised, and the search protocol is repeated in an iterative fashion until all recognisable domains have been identified. We have performed an initial benchmark of CATHEDRAL against other publicly available structure comparison methods using a consensus dataset of domains derived from the CATH and SCOP domain classifications. CATHEDRAL shows superior performance in fold recognition and alignment accuracy when compared with many equivalent methods. If a novel multidomain structure contains a known fold, CATHEDRAL will locate it in 90% of cases, with <1% false positives. For nearly 80% of assigned domains in a manually validated test set, the boundaries were correctly delineated within a tolerance of ten residues. For the remaining cases, previously classified domains were very remotely related to the query chain so that embellishments to the core of the fold caused significant differences in domain sizes and manual refinement of the boundaries was necessary. To put this performance in context, a well-established sequence method based on hidden Markov models was only able to detect 65% of domains, with 33% of the subsequent boundaries assigned within ten residues. Since, on average, 50% of newly determined protein structures contain more than one domain unit, and typically 90% or more of these domains are already classified in CATH, CATHEDRAL will considerably facilitate the automation of protein structure classification.


Asunto(s)
Algoritmos , Modelos Químicos , Modelos Moleculares , Proteínas/química , Proteínas/ultraestructura , Análisis de Secuencia de Proteína/métodos , Programas Informáticos , Secuencia de Aminoácidos , Simulación por Computador , Datos de Secuencia Molecular , Conformación Proteica , Pliegue de Proteína , Estructura Terciaria de Proteína , Alineación de Secuencia/métodos
18.
Nucleic Acids Res ; 33(Database issue): D247-51, 2005 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-15608188

RESUMEN

The CATH database of protein domain structures (http://www.biochem.ucl.ac.uk/bsm/cath/) currently contains 43,229 domains classified into 1467 superfamilies and 5107 sequence families. Each structural family is expanded with sequence relatives from GenBank and completed genomes, using a variety of efficient sequence search protocols and reliable thresholds. This extended CATH protein family database contains 616,470 domain sequences classified into 23,876 sequence families. This results in the significant expansion of the CATH HMM model library to include models built from the CATH sequence relatives, giving a 10% increase in coverage for detecting remote homologues. An improved Dictionary of Homologous superfamilies (DHS) (http://www.biochem.ucl.ac.uk/bsm/dhs/) containing specific sequence, structural and functional information for each superfamily in CATH considerably assists manual validation of homologues. Information on sequence relatives in CATH superfamilies, GenBank and completed genomes is presented in the CATH associated DHS and Gene3D resources. Domain partnership information can be obtained from Gene3D (http://www.biochem.ucl.ac.uk/bsm/cath/Gene3D/). A new CATH server has been implemented (http://www.biochem.ucl.ac.uk/cgi-bin/cath/CathServer.pl) providing automatic classification of newly determined sequences and structures using a suite of rapid sequence and structure comparison methods. The statistical significance of matches is assessed and links are provided to the putative superfamily or fold group to which the query sequence or structure is assigned.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Bases de Datos de Proteínas , Genómica , Estructura Terciaria de Proteína , Proteínas/clasificación , Análisis de Secuencia de Proteína , Bases de Datos de Proteínas/estadística & datos numéricos , Internet , Proteínas/genética , Homología de Secuencia de Aminoácido , Integración de Sistemas , Interfaz Usuario-Computador
19.
Oncotarget ; 8(13): 21290-21304, 2017 Mar 28.
Artículo en Inglés | MEDLINE | ID: mdl-28423505

RESUMEN

BACKGROUND: The key to interpreting the contribution of a disease-associated mutation in the development and progression of cancer is an understanding of the consequences of that mutation both on the function of the affected protein and on the pathways in which that protein is involved. Protein domains encapsulate function and position-specific domain based analysis of mutations have been shown to help elucidate their phenotypes. RESULTS: In this paper we examine the domain biases in oncogenes and tumour suppressors, and find that their domain compositions substantially differ. Using data from over 30 different cancers from whole-exome sequencing cancer genomic projects we mapped over one million mutations to their respective Pfam domains to identify which domains are enriched in any of three different classes of mutation; missense, indels or truncations. Next, we identified the mutational hotspots within domain families by mapping small mutations to equivalent positions in multiple sequence alignments of protein domainsWe find that gain of function mutations from oncogenes and loss of function mutations from tumour suppressors are normally found in different domain families and when observed in the same domain families, hotspot mutations are located at different positions within the multiple sequence alignment of the domain. CONCLUSIONS: By considering hotspots in tumour suppressors and oncogenes independently, we find that there are different specific positions within domain families that are particularly suited to accommodate either a loss or a gain of function mutation. The position is also dependent on the class of mutation.We find rare mutations co-located with well-known functional mutation hotspots, in members of homologous domain superfamilies, and we detect novel mutation hotspots in domain families previously unconnected with cancer. The results of this analysis can be accessed through the MOKCa database (http://strubiol.icr.ac.uk/extra/MOKCa).


Asunto(s)
Análisis Mutacional de ADN/métodos , Genes Supresores de Tumor , Mutación/genética , Neoplasias/genética , Oncogenes/genética , Biología Computacional/métodos , Humanos , Modelos Moleculares
20.
Expert Opin Drug Discov ; 12(6): 599-609, 2017 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-28462602

RESUMEN

INTRODUCTION: The development of improved cancer therapies is frequently cited as an urgent unmet medical need. Recent advances in platform technologies and the increasing availability of biological 'big data' are providing an unparalleled opportunity to systematically identify the key genes and pathways involved in tumorigenesis. The discoveries made using these new technologies may lead to novel therapeutic interventions. Areas covered: The authors discuss the current approaches that use 'big data' to identify cancer drivers. These approaches include the analysis of genomic sequencing data, pathway data, multi-platform data, identifying genetic interactions such as synthetic lethality and using cell line data. They review how big data is being used to identify novel drug targets. The authors then provide an overview of the available data repositories and tools being used at the forefront of cancer drug discovery. Expert opinion: Targeted therapies based on the genomic events driving the tumour will eventually inform treatment protocols. However, using a tailored approach to treat all tumour patients may require developing a large repertoire of targeted drugs.


Asunto(s)
Antineoplásicos/farmacología , Descubrimiento de Drogas/métodos , Neoplasias/tratamiento farmacológico , Línea Celular , Diseño de Fármacos , Genómica/métodos , Humanos , Terapia Molecular Dirigida , Neoplasias/patología
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA