Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 30
Filtrar
1.
Mol Cancer ; 22(1): 119, 2023 07 29.
Artículo en Inglés | MEDLINE | ID: mdl-37516825

RESUMEN

Newly growing evidence highlights the essential role that epitranscriptomic marks play in the development of many cancers; however, little is known about the role and implications of altered epitranscriptome deposition in prostate cancer. Here, we show that the transfer RNA N7-methylguanosine (m7G) transferase METTL1 is highly expressed in primary and advanced prostate tumours. Mechanistically, we find that METTL1 depletion causes the loss of m7G tRNA methylation and promotes the biogenesis of a novel class of small non-coding RNAs derived from 5'tRNA fragments. 5'tRNA-derived small RNAs steer translation control to favour the synthesis of key regulators of tumour growth suppression, interferon pathway, and immune effectors. Knockdown of Mettl1 in prostate cancer preclinical models increases intratumoural infiltration of pro-inflammatory immune cells and enhances responses to immunotherapy. Collectively, our findings reveal a therapeutically actionable role of METTL1-directed m7G tRNA methylation in cancer cell translation control and tumour biology.


Asunto(s)
Carcinogénesis , Neoplasias de la Próstata , Masculino , Humanos , Carcinogénesis/genética , Transformación Celular Neoplásica , Neoplasias de la Próstata/genética , Transcripción Genética , Procesamiento Postranscripcional del ARN , Metiltransferasas/genética
2.
Mol Oncol ; 16(21): 3881-3908, 2022 11.
Artículo en Inglés | MEDLINE | ID: mdl-35811332

RESUMEN

Tumour heterogeneity is one of the main characteristics of cancer and can be categorised into inter- or intratumour heterogeneity. This heterogeneity has been revealed as one of the key causes of treatment failure and relapse. Precision oncology is an emerging field that seeks to design tailored treatments for each cancer patient according to epidemiological, clinical and omics data. This discipline relies on bioinformatics tools designed to compute scores to prioritise available drugs, with the aim of helping clinicians in treatment selection. In this review, we describe the current approaches for therapy selection depending on which type of tumour heterogeneity is being targeted and the available next-generation sequencing data. We cover intertumour heterogeneity studies and individual treatment selection using genomics variants, expression data or multi-omics strategies. We also describe intratumour dissection through clonal inference and single-cell transcriptomics, in each case providing bioinformatics tools for tailored treatment selection. Finally, we discuss how these therapy selection workflows could be integrated into the clinical practice.


Asunto(s)
Neoplasias , Humanos , Neoplasias/patología , Biología Computacional , Medicina de Precisión , Genómica , Secuenciación de Nucleótidos de Alto Rendimiento
3.
Bioinformatics ; 38(4): 1155-1156, 2022 01 27.
Artículo en Inglés | MEDLINE | ID: mdl-34788788

RESUMEN

SUMMARY: bollito is an automated, flexible and parallelizable computational pipeline for the comprehensive analysis of single-cell RNA-seq data. Starting from FASTQ files or preprocessed expression matrices, bollito performs both basic and advanced tasks in single-cell analysis integrating >30 state-of-the-art tools. This includes quality control, read alignment, dimensionality reduction, clustering, cell-marker detection, differential expression, functional analysis, trajectory inference and RNA velocity. bollito is built using the Snakemake workflow management system, which easily connects each execution step and facilitates the reproducibility of results. bollito's modular design makes it easy to incorporate other packages into the pipeline enabling its expansion with new functionalities. AVAILABILITY AND IMPLEMENTATION: Source code is freely available at https://gitlab.com/bu_cnio/bollito under the MIT license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Análisis de Expresión Génica de una Sola Célula , Programas Informáticos , Reproducibilidad de los Resultados , ARN , Flujo de Trabajo
4.
Nucleic Acids Res ; 50(D1): D54-D59, 2022 01 07.
Artículo en Inglés | MEDLINE | ID: mdl-34755885

RESUMEN

APPRIS (https://appris.bioinfo.cnio.es) is a well-established database housing annotations for protein isoforms for a range of species. APPRIS selects principal isoforms based on protein structure and function features and on cross-species conservation. Most coding genes produce a single main protein isoform and the principal isoforms chosen by the APPRIS database best represent this main cellular isoform. Human genetic data, experimental protein evidence and the distribution of clinical variants all support the relevance of APPRIS principal isoforms. APPRIS annotations and principal isoforms have now been expanded to 10 model organisms. In this paper we highlight the most recent updates to the database. APPRIS annotations have been generated for two new species, cow and chicken, the protein structural information has been augmented with reliable models from the EMBL-EBI AlphaFold database, and we have substantially expanded the confirmatory proteomics evidence available for the human genome. The most significant change in APPRIS has been the implementation of TRIFID functional isoform scores. TRIFID functional scores are assigned to all splice isoforms, and APPRIS uses the TRIFID functional scores and proteomics evidence to determine principal isoforms when core methods cannot.


Asunto(s)
Bases de Datos de Proteínas , Isoformas de Proteínas/genética , Proteínas/genética , Proteómica , Animales , Bovinos , Pollos/genética , Humanos , Conformación Proteica , Isoformas de Proteínas/clasificación , Proteínas/química , Proteínas/clasificación
5.
Genome Med ; 13(1): 187, 2021 12 16.
Artículo en Inglés | MEDLINE | ID: mdl-34911571

RESUMEN

We present Beyondcell, a computational methodology for identifying tumour cell subpopulations with distinct drug responses in single-cell RNA-seq data and proposing cancer-specific treatments. Our method calculates an enrichment score in a collection of drug signatures, delineating therapeutic clusters (TCs) within cellular populations. Additionally, Beyondcell determines the therapeutic differences among cell populations and generates a prioritised sensitivity-based ranking in order to guide drug selection. We performed Beyondcell analysis in five single-cell datasets and demonstrated that TCs can be exploited to target malignant cells both in cancer cell lines and tumour patients. Beyondcell is available at: https://gitlab.com/bu_cnio/beyondcell .


Asunto(s)
Neoplasias , Análisis de la Célula Individual , Perfilación de la Expresión Génica/métodos , Humanos , Neoplasias/tratamiento farmacológico , Neoplasias/genética , RNA-Seq , Análisis de Secuencia de ARN/métodos , Análisis de la Célula Individual/métodos
6.
NAR Genom Bioinform ; 3(2): lqab044, 2021 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-34046593

RESUMEN

Alternative splicing of messenger RNA can generate an array of mature transcripts, but it is not clear how many go on to produce functionally relevant protein isoforms. There is only limited evidence for alternative proteins in proteomics analyses and data from population genetic variation studies indicate that most alternative exons are evolving neutrally. Determining which transcripts produce biologically important isoforms is key to understanding isoform function and to interpreting the real impact of somatic mutations and germline variations. Here we have developed a method, TRIFID, to classify the functional importance of splice isoforms. TRIFID was trained on isoforms detected in large-scale proteomics analyses and distinguishes these biologically important splice isoforms with high confidence. Isoforms predicted as functionally important by the algorithm had measurable cross species conservation and significantly fewer broken functional domains. Additionally, exons that code for these functionally important protein isoforms are under purifying selection, while exons from low scoring transcripts largely appear to be evolving neutrally. TRIFID has been developed for the human genome, but it could in principle be applied to other well-annotated species. We believe that this method will generate valuable insights into the cellular importance of alternative splicing.

7.
Eur J Cancer ; 147: 74-83, 2021 04.
Artículo en Inglés | MEDLINE | ID: mdl-33626496

RESUMEN

BACKGROUND: Intraductal (IDC) and cribriform (CRIB) histologies in prostate cancer have been associated with germline BRCA2 (gBRCA2) mutations in small retrospective series, leading to the recommendation of genetic testing for patients with IDC in the primary tumour. PATIENTS AND METHODS: To examine the association of gBRCA2 mutations and other tumour molecular features with IDC and/or cribriform (CRIB) histologies, we conducted a case-control study in which primary prostate tumours from 58 gBRCA2 carriers were matched (1:2) by Gleason Grade Group and specimen type to 116 non-carriers. Presence/absence of IDC and CRIB morphologies was established by two expert uropathologists blinded to gBRCA2 status. Fluorescent in-situ hybridization (FISH) and next-generation sequencing (NGS) were used to detect BRCA2 alterations, PTEN deletions and TMPRSS2-ERG fusions. Chi-squared tests were used to compare the frequency of IDC and CRIB in gBRCA2 carriers and controls and to assess associations with other variables. Logistic regression models were constructed to identify independent factors associated with both histology patterns. RESULTS: No significant differences between gBRCA2 carriers and non-carriers were observed in the prevalence of IDC (36% gBRCA2 versus 50% non-carriers, p = 0.085) or CRIB (53% gBRCA2 versus 43% non-carriers p = 0.197) patterns. However, IDC histology was independently associated with bi-allelic BRCA2 alterations (OR 4.3, 95%CI 1.1-16.2) and PTEN homozygous loss (OR 5.2, 95%CI 2.1-13.1). CRIB morphology was also independently associated with bi-allelic BRCA2 alterations (OR 5.6, 95%CI 1.7-19.3). CONCLUSIONS: While we found no association between gBRCA2 mutations and IDC or CRIB histologies, bi-allelic BRCA2 loss in primary prostate tumours was significantly associated with both variant morphologies, independently of other clinical-pathologic factors.


Asunto(s)
Proteína BRCA2/genética , Biomarcadores de Tumor/genética , Mutación , Neoplasias de la Próstata/genética , Adulto , Anciano , Anciano de 80 o más Años , Estudios de Casos y Controles , Análisis Mutacional de ADN , Eliminación de Gen , Predisposición Genética a la Enfermedad , Humanos , Masculino , Persona de Mediana Edad , Clasificación del Tumor , Fosfohidrolasa PTEN/genética , Fenotipo , Neoplasias de la Próstata/patología , Medición de Riesgo , Factores de Riesgo , España
8.
Nucleic Acids Res ; 49(D1): D916-D923, 2021 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-33270111

RESUMEN

The GENCODE project annotates human and mouse genes and transcripts supported by experimental data with high accuracy, providing a foundational resource that supports genome biology and clinical genomics. GENCODE annotation processes make use of primary data and bioinformatic tools and analysis generated both within the consortium and externally to support the creation of transcript structures and the determination of their function. Here, we present improvements to our annotation infrastructure, bioinformatics tools, and analysis, and the advances they support in the annotation of the human and mouse genomes including: the completion of first pass manual annotation for the mouse reference genome; targeted improvements to the annotation of genes associated with SARS-CoV-2 infection; collaborative projects to achieve convergence across reference annotation databases for the annotation of human and mouse protein-coding genes; and the first GENCODE manually supervised automated annotation of lncRNAs. Our annotation is accessible via Ensembl, the UCSC Genome Browser and https://www.gencodegenes.org.


Asunto(s)
COVID-19/prevención & control , Biología Computacional/métodos , Bases de Datos Genéticas , Genómica/métodos , Anotación de Secuencia Molecular/métodos , SARS-CoV-2/genética , Animales , COVID-19/epidemiología , COVID-19/virología , Epidemias , Humanos , Internet , Ratones , Seudogenes/genética , ARN Largo no Codificante/genética , SARS-CoV-2/metabolismo , SARS-CoV-2/fisiología , Transcripción Genética/genética
10.
PLoS Comput Biol ; 16(10): e1008287, 2020 10.
Artículo en Inglés | MEDLINE | ID: mdl-33017396

RESUMEN

The role of alternative splicing is one of the great unanswered questions in cellular biology. There is strong evidence for alternative splicing at the transcript level, and transcriptomics experiments show that many splice events are tissue specific. It has been suggested that alternative splicing evolved in order to remodel tissue-specific protein-protein networks. Here we investigated the evidence for tissue-specific splicing among splice isoforms detected in a large-scale proteomics analysis. Although the data supporting alternative splicing is limited at the protein level, clear patterns emerged among the small numbers of alternative splice events that we could detect in the proteomics data. More than a third of these splice events were tissue-specific and most were ancient: over 95% of splice events that were tissue-specific in both proteomics and RNAseq analyses evolved prior to the ancestors of lobe-finned fish, at least 400 million years ago. By way of contrast, three in four alternative exons in the human gene set arose in the primate lineage, so our results cannot be extrapolated to the whole genome. Tissue-specific alternative protein forms in the proteomics analysis were particularly abundant in nervous and muscle tissues and their genes had roles related to the cytoskeleton and either the structure of muscle fibres or cell-cell connections. Our results suggest that this conserved tissue-specific alternative splicing may have played a role in the development of the vertebrate brain and heart.


Asunto(s)
Empalme Alternativo/genética , Especificidad de Órganos/genética , Isoformas de Proteínas , Animales , Biología Computacional , Genoma/genética , Humanos , Isoformas de Proteínas/química , Isoformas de Proteínas/clasificación , Isoformas de Proteínas/genética , Proteómica
11.
Cancer Cell ; 37(3): 340-353.e6, 2020 03 16.
Artículo en Inglés | MEDLINE | ID: mdl-32109375

RESUMEN

Inhibition of the cell-cycle kinases CDK4 and CDK6 is now part of the standard treatment in advanced breast cancer. CDK4/6 inhibitors, however, are not expected to cooperate with DNA-damaging or antimitotic chemotherapies as the former prevent cell-cycle entry, thus interfering with S-phase- or mitosis-targeting agents. Here, we report that sequential administration of CDK4/6 inhibitors after taxanes cooperates to prevent cellular proliferation in pancreatic ductal adenocarcinoma (PDAC) cells, patient-derived xenografts, and genetically engineered mice with Kras G12V and Cdkn2a-null mutations frequently observed in PDAC. This effect correlates with the repressive activity of CDK4/6 inhibitors on homologous recombination proteins required for the recovery from chromosomal damage. CDK4/6 inhibitors also prevent recovery from multiple DNA-damaging agents, suggesting broad applicability for their sequential administration after available chemotherapeutic agents.


Asunto(s)
Protocolos de Quimioterapia Combinada Antineoplásica/uso terapéutico , Carcinoma Ductal Pancreático/tratamiento farmacológico , Quinasa 4 Dependiente de la Ciclina/antagonistas & inhibidores , Quinasa 6 Dependiente de la Ciclina/antagonistas & inhibidores , Neoplasias Pancreáticas/tratamiento farmacológico , Inhibidores de Proteínas Quinasas/administración & dosificación , Albúminas/administración & dosificación , Animales , Protocolos de Quimioterapia Combinada Antineoplásica/administración & dosificación , Carcinoma Ductal Pancreático/patología , Línea Celular Tumoral , Inhibidor p16 de la Quinasa Dependiente de Ciclina/genética , Reparación del ADN/efectos de los fármacos , Esquema de Medicación , Recombinación Homóloga/efectos de los fármacos , Humanos , Ratones Desnudos , Ratones Transgénicos , Mutación , Neoplasias Experimentales/tratamiento farmacológico , Neoplasias Experimentales/genética , Paclitaxel/administración & dosificación , Neoplasias Pancreáticas/patología , Piperazinas/administración & dosificación , Inhibidores de Proteínas Quinasas/uso terapéutico , Proteínas Proto-Oncogénicas p21(ras)/genética , Piridinas/administración & dosificación , Ensayos Antitumor por Modelo de Xenoinjerto
12.
Bioinformatics ; 35(22): 4846-4848, 2019 11 01.
Artículo en Inglés | MEDLINE | ID: mdl-31173067

RESUMEN

MOTIVATION: Genetic alterations lead to tumor progression and cell survival but also uncover cancer-specific vulnerabilities on gene dependencies that can be therapeutically exploited. RESULTS: vulcanSpot is a novel computational approach implemented to expand the therapeutic options in cancer beyond known-driver genes unlocking alternative ways to target undruggable genes. The method integrates genome-wide information provided by massive screening experiments to detect genetic vulnerabilities associated to tumors. Then, vulcanSpot prioritizes drugs to target cancer-specific gene dependencies using a weighted scoring system based on well known drug-gene relationships and drug repositioning strategies. AVAILABILITY AND IMPLEMENTATION: http://www.vulcanspot.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Neoplasias , Biología Computacional , Reposicionamiento de Medicamentos , Humanos , Mutación , Programas Informáticos
13.
Nucleic Acids Res ; 47(D1): D766-D773, 2019 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-30357393

RESUMEN

The accurate identification and description of the genes in the human and mouse genomes is a fundamental requirement for high quality analysis of data informing both genome biology and clinical genomics. Over the last 15 years, the GENCODE consortium has been producing reference quality gene annotations to provide this foundational resource. The GENCODE consortium includes both experimental and computational biology groups who work together to improve and extend the GENCODE gene annotation. Specifically, we generate primary data, create bioinformatics tools and provide analysis to support the work of expert manual gene annotators and automated gene annotation pipelines. In addition, manual and computational annotation workflows use any and all publicly available data and analysis, along with the research literature to identify and characterise gene loci to the highest standard. GENCODE gene annotations are accessible via the Ensembl and UCSC Genome Browsers, the Ensembl FTP site, Ensembl Biomart, Ensembl Perl and REST APIs as well as https://www.gencodegenes.org.


Asunto(s)
Bases de Datos Genéticas , Genoma Humano/genética , Genómica , Seudogenes/genética , Animales , Biología Computacional , Humanos , Internet , Ratones , Anotación de Secuencia Molecular , Programas Informáticos
14.
Nat Struct Mol Biol ; 25(9): 778-786, 2018 09.
Artículo en Inglés | MEDLINE | ID: mdl-30104661

RESUMEN

RNA viruses are a major threat to animals and plants. RNA interference (RNAi) and the interferon response provide innate antiviral defense against RNA viruses. Here, we performed a large-scale screen using Caenorhabditis elegans and its natural pathogen the Orsay virus (OrV), and we identified cde-1 as important for antiviral defense. CDE-1 is a homolog of the mammalian TUT4 and TUT7 terminal uridylyltransferases (collectively called TUT4(7)); its catalytic activity is required for its antiviral function. CDE-1 uridylates the 3' end of the OrV RNA genome and promotes its degradation in a manner independent of the RNAi pathway. Likewise, TUT4(7) enzymes uridylate influenza A virus (IAV) mRNAs in mammalian cells. Deletion of TUT4(7) leads to increased IAV mRNA and protein levels. Collectively, these data implicate 3'-terminal uridylation of viral RNAs as a conserved antiviral defense mechanism.


Asunto(s)
Caenorhabditis elegans/enzimología , Caenorhabditis elegans/virología , Inmunidad Innata , ARN Nucleotidiltransferasas/metabolismo , Virus ARN/metabolismo , Células A549 , Animales , Caenorhabditis elegans/genética , Humanos , Interferencia de ARN , Virus ARN/inmunología , Virus ARN/fisiología , ARN Viral/metabolismo , Transcriptoma , Replicación Viral
15.
Nucleic Acids Res ; 46(D1): D213-D217, 2018 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-29069475

RESUMEN

The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the 'principal' isoform, as the reference for each gene based on these annotations. A single main splice isoform reflects the biological reality for most protein coding genes and APPRIS principal isoforms are the best predictors of these main proteins isoforms. Here, we present the updates to the database, new developments that include the addition of three new species (chimpanzee, Drosophila melangaster and Caenorhabditis elegans), the expansion of APPRIS to cover the RefSeq gene set and the UniProtKB proteome for six species and refinements in the core methods that make up the annotation pipeline. In addition APPRIS now provides a measure of reliability for individual principal isoforms and updates with each release of the GENCODE/Ensembl and RefSeq reference sets. The individual GENCODE/Ensembl, RefSeq and UniProtKB reference gene sets for six organisms have been merged to produce common sets of splice variants.


Asunto(s)
Bases de Datos Genéticas , Isoformas de Proteínas/genética , Empalme Alternativo , Secuencia de Aminoácidos , Animales , Humanos , Modelos Moleculares , Anotación de Secuencia Molecular , Conformación Proteica , Isoformas de Proteínas/química , Proteoma/genética , Reproducibilidad de los Resultados , Alineación de Secuencia
16.
Dev Cell ; 42(3): 241-255.e6, 2017 08 07.
Artículo en Inglés | MEDLINE | ID: mdl-28787591

RESUMEN

Small RNAs play a crucial role in genome defense against transposable elements and guide Argonaute proteins to nascent RNA transcripts to induce co-transcriptional gene silencing. However, the molecular basis of this process remains unknown. Here, we identify the conserved RNA helicase Aquarius/EMB-4 as a direct and essential link between small RNA pathways and the transcriptional machinery in Caenorhabditis elegans. Aquarius physically interacts with the germline Argonaute HRDE-1. Aquarius is required to initiate small-RNA-induced heritable gene silencing. HRDE-1 and Aquarius silence overlapping sets of genes and transposable elements. Surprisingly, removal of introns from a target gene abolishes the requirement for Aquarius, but not HRDE-1, for small RNA-dependent gene silencing. We conclude that Aquarius allows small RNA pathways to compete for access to nascent transcripts undergoing co-transcriptional splicing in order to detect and silence transposable elements. Thus, Aquarius and HRDE-1 act as gatekeepers coordinating gene expression and genome defense.


Asunto(s)
Proteínas Argonautas/genética , Proteínas de Caenorhabditis elegans/genética , Caenorhabditis elegans/genética , Proteínas Nucleares/genética , Interferencia de ARN , Animales , Proteínas Argonautas/metabolismo , Caenorhabditis elegans/metabolismo , Proteínas de Caenorhabditis elegans/metabolismo , Elementos Transponibles de ADN , Intrones , Proteínas Nucleares/metabolismo , Unión Proteica
17.
Chembiochem ; 18(11): 979-984, 2017 06 01.
Artículo en Inglés | MEDLINE | ID: mdl-28449301

RESUMEN

More than a hundred distinct modified nucleosides have been identified in RNA, but little is known about their distribution across different organisms, their dynamic nature and their response to cellular and environmental stress. Mass-spectrometry-based methods have been at the forefront of identifying and quantifying modified nucleosides. However, they often require synthetic reference standards, which do not exist in the case of many modified nucleosides, and this therefore impedes their analysis. Here we use a metabolic labelling approach to achieve rapid generation of bio-isotopologues of the complete Caenorhabditis elegans transcriptome and its modifications and use them as reference standards to characterise the RNA modification profile in this multicellular organism through an untargeted liquid-chromatography tandem high-resolution mass spectrometry (LC-HRMS) approach. We furthermore show that several of these RNA modifications have a dynamic response to environmental stress and that, in particular, changes in the tRNA wobble base modification 5-methoxycarbonylmethyl-2-thiouridine (mcm5 s2 U) lead to codon-biased gene-expression changes in starved animals.


Asunto(s)
Procesamiento Postranscripcional del ARN , Estrés Fisiológico/genética , Transcriptoma , Animales , Caenorhabditis elegans , Cromatografía Liquida , Marcaje Isotópico , Espectrometría de Masas en Tándem , Tiouridina/análogos & derivados , Tiouridina/metabolismo
18.
BMC Bioinformatics ; 16 Suppl 2: A1-10, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-25708534

RESUMEN

This report summarizes the scientific content and activities of the annual symposium organized by the Student Council of the International Society for Computational Biology (ISCB), held in conjunction with the Intelligent Systems for Molecular Biology (ISMB) conference in Boston, USA, on July 11th, 2014.


Asunto(s)
Biología Computacional , Resistencia a Múltiples Medicamentos , Secuenciación de Nucleótidos de Alto Rendimiento , Repeticiones de Microsatélite/genética , Revisión de la Investigación por Pares , Edición , ARN Mensajero/metabolismo , Análisis de Secuencia de ADN
19.
Nucleic Acids Res ; 43(Database issue): D315-20, 2015 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-25361972

RESUMEN

MobiDB (http://mobidb.bio.unipd.it/) is a database of intrinsically disordered and mobile proteins. Intrinsically disordered regions are key for the function of numerous proteins. Here we provide a new version of MobiDB, a centralized source aimed at providing the most complete picture on different flavors of disorder in protein structures covering all UniProt sequences (currently over 80 million). The database features three levels of annotation: manually curated, indirect and predicted. Manually curated data is extracted from the DisProt database. Indirect data is inferred from PDB structures that are considered an indication of intrinsic disorder. The 10 predictors currently included (three ESpritz flavors, two IUPred flavors, two DisEMBL flavors, GlobPlot, VSL2b and JRONN) enable MobiDB to provide disorder annotations for every protein in absence of more reliable data. The new version also features a consensus annotation and classification for long disordered regions. In order to complement the disorder annotations, MobiDB features additional annotations from external sources. Annotations from the UniProt database include post-translational modifications and linear motifs. Pfam annotations are displayed in graphical form and are link-enabled, allowing the user to visit the corresponding Pfam page for further information. Experimental protein-protein interactions from STRING are also classified for disorder content.


Asunto(s)
Bases de Datos de Proteínas , Proteínas Intrínsecamente Desordenadas/química , Curaduría de Datos , Conformación Proteica
20.
Bioinformatics ; 31(2): 201-8, 2015 Jan 15.
Artículo en Inglés | MEDLINE | ID: mdl-25246432

RESUMEN

MOTIVATION: Intrinsically disordered regions are key for the function of numerous proteins. Due to the difficulties in experimental disorder characterization, many computational predictors have been developed with various disorder flavors. Their performance is generally measured on small sets mainly from experimentally solved structures, e.g. Protein Data Bank (PDB) chains. MobiDB has only recently started to collect disorder annotations from multiple experimental structures. RESULTS: MobiDB annotates disorder for UniProt sequences, allowing us to conduct the first large-scale assessment of fast disorder predictors on 25 833 different sequences with X-ray crystallographic structures. In addition to a comprehensive ranking of predictors, this analysis produced the following interesting observations. (i) The predictors cluster according to their disorder definition, with a consensus giving more confidence. (ii) Previous assessments appear over-reliant on data annotated at the PDB chain level and performance is lower on entire UniProt sequences. (iii) Long disordered regions are harder to predict. (iv) Depending on the structural and functional types of the proteins, differences in prediction performance of up to 10% are observed. AVAILABILITY: The datasets are available from Web site at URL: http://mobidb.bio.unipd.it/lsd. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Proteínas/química , Análisis de Secuencia de Proteína/métodos , Proteína p53 Supresora de Tumor/química , Cristalografía por Rayos X , Bases de Datos de Proteínas , Humanos , Anotación de Secuencia Molecular , Estructura Terciaria de Proteína
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...