Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 30
Filtrar
1.
Mol Cancer ; 22(1): 119, 2023 07 29.
Artigo em Inglês | MEDLINE | ID: mdl-37516825

RESUMO

Newly growing evidence highlights the essential role that epitranscriptomic marks play in the development of many cancers; however, little is known about the role and implications of altered epitranscriptome deposition in prostate cancer. Here, we show that the transfer RNA N7-methylguanosine (m7G) transferase METTL1 is highly expressed in primary and advanced prostate tumours. Mechanistically, we find that METTL1 depletion causes the loss of m7G tRNA methylation and promotes the biogenesis of a novel class of small non-coding RNAs derived from 5'tRNA fragments. 5'tRNA-derived small RNAs steer translation control to favour the synthesis of key regulators of tumour growth suppression, interferon pathway, and immune effectors. Knockdown of Mettl1 in prostate cancer preclinical models increases intratumoural infiltration of pro-inflammatory immune cells and enhances responses to immunotherapy. Collectively, our findings reveal a therapeutically actionable role of METTL1-directed m7G tRNA methylation in cancer cell translation control and tumour biology.


Assuntos
Carcinogênese , Neoplasias da Próstata , Masculino , Humanos , Carcinogênese/genética , Transformação Celular Neoplásica , Neoplasias da Próstata/genética , Transcrição Gênica , Processamento Pós-Transcricional do RNA , Metiltransferases/genética
2.
Mol Oncol ; 16(21): 3881-3908, 2022 11.
Artigo em Inglês | MEDLINE | ID: mdl-35811332

RESUMO

Tumour heterogeneity is one of the main characteristics of cancer and can be categorised into inter- or intratumour heterogeneity. This heterogeneity has been revealed as one of the key causes of treatment failure and relapse. Precision oncology is an emerging field that seeks to design tailored treatments for each cancer patient according to epidemiological, clinical and omics data. This discipline relies on bioinformatics tools designed to compute scores to prioritise available drugs, with the aim of helping clinicians in treatment selection. In this review, we describe the current approaches for therapy selection depending on which type of tumour heterogeneity is being targeted and the available next-generation sequencing data. We cover intertumour heterogeneity studies and individual treatment selection using genomics variants, expression data or multi-omics strategies. We also describe intratumour dissection through clonal inference and single-cell transcriptomics, in each case providing bioinformatics tools for tailored treatment selection. Finally, we discuss how these therapy selection workflows could be integrated into the clinical practice.


Assuntos
Neoplasias , Humanos , Neoplasias/patologia , Biologia Computacional , Medicina de Precisão , Genômica , Sequenciamento de Nucleotídeos em Larga Escala
3.
Nucleic Acids Res ; 50(D1): D54-D59, 2022 01 07.
Artigo em Inglês | MEDLINE | ID: mdl-34755885

RESUMO

APPRIS (https://appris.bioinfo.cnio.es) is a well-established database housing annotations for protein isoforms for a range of species. APPRIS selects principal isoforms based on protein structure and function features and on cross-species conservation. Most coding genes produce a single main protein isoform and the principal isoforms chosen by the APPRIS database best represent this main cellular isoform. Human genetic data, experimental protein evidence and the distribution of clinical variants all support the relevance of APPRIS principal isoforms. APPRIS annotations and principal isoforms have now been expanded to 10 model organisms. In this paper we highlight the most recent updates to the database. APPRIS annotations have been generated for two new species, cow and chicken, the protein structural information has been augmented with reliable models from the EMBL-EBI AlphaFold database, and we have substantially expanded the confirmatory proteomics evidence available for the human genome. The most significant change in APPRIS has been the implementation of TRIFID functional isoform scores. TRIFID functional scores are assigned to all splice isoforms, and APPRIS uses the TRIFID functional scores and proteomics evidence to determine principal isoforms when core methods cannot.


Assuntos
Bases de Dados de Proteínas , Isoformas de Proteínas/genética , Proteínas/genética , Proteômica , Animais , Bovinos , Galinhas/genética , Humanos , Conformação Proteica , Isoformas de Proteínas/classificação , Proteínas/química , Proteínas/classificação
4.
Bioinformatics ; 38(4): 1155-1156, 2022 01 27.
Artigo em Inglês | MEDLINE | ID: mdl-34788788

RESUMO

SUMMARY: bollito is an automated, flexible and parallelizable computational pipeline for the comprehensive analysis of single-cell RNA-seq data. Starting from FASTQ files or preprocessed expression matrices, bollito performs both basic and advanced tasks in single-cell analysis integrating >30 state-of-the-art tools. This includes quality control, read alignment, dimensionality reduction, clustering, cell-marker detection, differential expression, functional analysis, trajectory inference and RNA velocity. bollito is built using the Snakemake workflow management system, which easily connects each execution step and facilitates the reproducibility of results. bollito's modular design makes it easy to incorporate other packages into the pipeline enabling its expansion with new functionalities. AVAILABILITY AND IMPLEMENTATION: Source code is freely available at https://gitlab.com/bu_cnio/bollito under the MIT license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Análise da Expressão Gênica de Célula Única , Software , Reprodutibilidade dos Testes , RNA , Fluxo de Trabalho
5.
Genome Med ; 13(1): 187, 2021 12 16.
Artigo em Inglês | MEDLINE | ID: mdl-34911571

RESUMO

We present Beyondcell, a computational methodology for identifying tumour cell subpopulations with distinct drug responses in single-cell RNA-seq data and proposing cancer-specific treatments. Our method calculates an enrichment score in a collection of drug signatures, delineating therapeutic clusters (TCs) within cellular populations. Additionally, Beyondcell determines the therapeutic differences among cell populations and generates a prioritised sensitivity-based ranking in order to guide drug selection. We performed Beyondcell analysis in five single-cell datasets and demonstrated that TCs can be exploited to target malignant cells both in cancer cell lines and tumour patients. Beyondcell is available at: https://gitlab.com/bu_cnio/beyondcell .


Assuntos
Neoplasias , Análise de Célula Única , Perfilação da Expressão Gênica/métodos , Humanos , Neoplasias/tratamento farmacológico , Neoplasias/genética , RNA-Seq , Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos
6.
NAR Genom Bioinform ; 3(2): lqab044, 2021 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-34046593

RESUMO

Alternative splicing of messenger RNA can generate an array of mature transcripts, but it is not clear how many go on to produce functionally relevant protein isoforms. There is only limited evidence for alternative proteins in proteomics analyses and data from population genetic variation studies indicate that most alternative exons are evolving neutrally. Determining which transcripts produce biologically important isoforms is key to understanding isoform function and to interpreting the real impact of somatic mutations and germline variations. Here we have developed a method, TRIFID, to classify the functional importance of splice isoforms. TRIFID was trained on isoforms detected in large-scale proteomics analyses and distinguishes these biologically important splice isoforms with high confidence. Isoforms predicted as functionally important by the algorithm had measurable cross species conservation and significantly fewer broken functional domains. Additionally, exons that code for these functionally important protein isoforms are under purifying selection, while exons from low scoring transcripts largely appear to be evolving neutrally. TRIFID has been developed for the human genome, but it could in principle be applied to other well-annotated species. We believe that this method will generate valuable insights into the cellular importance of alternative splicing.

7.
Eur J Cancer ; 147: 74-83, 2021 04.
Artigo em Inglês | MEDLINE | ID: mdl-33626496

RESUMO

BACKGROUND: Intraductal (IDC) and cribriform (CRIB) histologies in prostate cancer have been associated with germline BRCA2 (gBRCA2) mutations in small retrospective series, leading to the recommendation of genetic testing for patients with IDC in the primary tumour. PATIENTS AND METHODS: To examine the association of gBRCA2 mutations and other tumour molecular features with IDC and/or cribriform (CRIB) histologies, we conducted a case-control study in which primary prostate tumours from 58 gBRCA2 carriers were matched (1:2) by Gleason Grade Group and specimen type to 116 non-carriers. Presence/absence of IDC and CRIB morphologies was established by two expert uropathologists blinded to gBRCA2 status. Fluorescent in-situ hybridization (FISH) and next-generation sequencing (NGS) were used to detect BRCA2 alterations, PTEN deletions and TMPRSS2-ERG fusions. Chi-squared tests were used to compare the frequency of IDC and CRIB in gBRCA2 carriers and controls and to assess associations with other variables. Logistic regression models were constructed to identify independent factors associated with both histology patterns. RESULTS: No significant differences between gBRCA2 carriers and non-carriers were observed in the prevalence of IDC (36% gBRCA2 versus 50% non-carriers, p = 0.085) or CRIB (53% gBRCA2 versus 43% non-carriers p = 0.197) patterns. However, IDC histology was independently associated with bi-allelic BRCA2 alterations (OR 4.3, 95%CI 1.1-16.2) and PTEN homozygous loss (OR 5.2, 95%CI 2.1-13.1). CRIB morphology was also independently associated with bi-allelic BRCA2 alterations (OR 5.6, 95%CI 1.7-19.3). CONCLUSIONS: While we found no association between gBRCA2 mutations and IDC or CRIB histologies, bi-allelic BRCA2 loss in primary prostate tumours was significantly associated with both variant morphologies, independently of other clinical-pathologic factors.


Assuntos
Proteína BRCA2/genética , Biomarcadores Tumorais/genética , Mutação , Neoplasias da Próstata/genética , Adulto , Idoso , Idoso de 80 Anos ou mais , Estudos de Casos e Controles , Análise Mutacional de DNA , Deleção de Genes , Predisposição Genética para Doença , Humanos , Masculino , Pessoa de Meia-Idade , Gradação de Tumores , PTEN Fosfo-Hidrolase/genética , Fenótipo , Neoplasias da Próstata/patologia , Medição de Risco , Fatores de Risco , Espanha
8.
Nucleic Acids Res ; 49(D1): D916-D923, 2021 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-33270111

RESUMO

The GENCODE project annotates human and mouse genes and transcripts supported by experimental data with high accuracy, providing a foundational resource that supports genome biology and clinical genomics. GENCODE annotation processes make use of primary data and bioinformatic tools and analysis generated both within the consortium and externally to support the creation of transcript structures and the determination of their function. Here, we present improvements to our annotation infrastructure, bioinformatics tools, and analysis, and the advances they support in the annotation of the human and mouse genomes including: the completion of first pass manual annotation for the mouse reference genome; targeted improvements to the annotation of genes associated with SARS-CoV-2 infection; collaborative projects to achieve convergence across reference annotation databases for the annotation of human and mouse protein-coding genes; and the first GENCODE manually supervised automated annotation of lncRNAs. Our annotation is accessible via Ensembl, the UCSC Genome Browser and https://www.gencodegenes.org.


Assuntos
COVID-19/prevenção & controle , Biologia Computacional/métodos , Bases de Dados Genéticas , Genômica/métodos , Anotação de Sequência Molecular/métodos , SARS-CoV-2/genética , Animais , COVID-19/epidemiologia , COVID-19/virologia , Epidemias , Humanos , Internet , Camundongos , Pseudogenes/genética , RNA Longo não Codificante/genética , SARS-CoV-2/metabolismo , SARS-CoV-2/fisiologia , Transcrição Gênica/genética
9.
PLoS Comput Biol ; 16(10): e1008287, 2020 10.
Artigo em Inglês | MEDLINE | ID: mdl-33017396

RESUMO

The role of alternative splicing is one of the great unanswered questions in cellular biology. There is strong evidence for alternative splicing at the transcript level, and transcriptomics experiments show that many splice events are tissue specific. It has been suggested that alternative splicing evolved in order to remodel tissue-specific protein-protein networks. Here we investigated the evidence for tissue-specific splicing among splice isoforms detected in a large-scale proteomics analysis. Although the data supporting alternative splicing is limited at the protein level, clear patterns emerged among the small numbers of alternative splice events that we could detect in the proteomics data. More than a third of these splice events were tissue-specific and most were ancient: over 95% of splice events that were tissue-specific in both proteomics and RNAseq analyses evolved prior to the ancestors of lobe-finned fish, at least 400 million years ago. By way of contrast, three in four alternative exons in the human gene set arose in the primate lineage, so our results cannot be extrapolated to the whole genome. Tissue-specific alternative protein forms in the proteomics analysis were particularly abundant in nervous and muscle tissues and their genes had roles related to the cytoskeleton and either the structure of muscle fibres or cell-cell connections. Our results suggest that this conserved tissue-specific alternative splicing may have played a role in the development of the vertebrate brain and heart.


Assuntos
Processamento Alternativo/genética , Especificidade de Órgãos/genética , Isoformas de Proteínas , Animais , Biologia Computacional , Genoma/genética , Humanos , Isoformas de Proteínas/química , Isoformas de Proteínas/classificação , Isoformas de Proteínas/genética , Proteômica
11.
Cancer Cell ; 37(3): 340-353.e6, 2020 03 16.
Artigo em Inglês | MEDLINE | ID: mdl-32109375

RESUMO

Inhibition of the cell-cycle kinases CDK4 and CDK6 is now part of the standard treatment in advanced breast cancer. CDK4/6 inhibitors, however, are not expected to cooperate with DNA-damaging or antimitotic chemotherapies as the former prevent cell-cycle entry, thus interfering with S-phase- or mitosis-targeting agents. Here, we report that sequential administration of CDK4/6 inhibitors after taxanes cooperates to prevent cellular proliferation in pancreatic ductal adenocarcinoma (PDAC) cells, patient-derived xenografts, and genetically engineered mice with Kras G12V and Cdkn2a-null mutations frequently observed in PDAC. This effect correlates with the repressive activity of CDK4/6 inhibitors on homologous recombination proteins required for the recovery from chromosomal damage. CDK4/6 inhibitors also prevent recovery from multiple DNA-damaging agents, suggesting broad applicability for their sequential administration after available chemotherapeutic agents.


Assuntos
Protocolos de Quimioterapia Combinada Antineoplásica/uso terapêutico , Carcinoma Ductal Pancreático/tratamento farmacológico , Quinase 4 Dependente de Ciclina/antagonistas & inibidores , Quinase 6 Dependente de Ciclina/antagonistas & inibidores , Neoplasias Pancreáticas/tratamento farmacológico , Inibidores de Proteínas Quinases/administração & dosagem , Albuminas/administração & dosagem , Animais , Protocolos de Quimioterapia Combinada Antineoplásica/administração & dosagem , Carcinoma Ductal Pancreático/patologia , Linhagem Celular Tumoral , Inibidor p16 de Quinase Dependente de Ciclina/genética , Reparo do DNA/efeitos dos fármacos , Esquema de Medicação , Recombinação Homóloga/efeitos dos fármacos , Humanos , Camundongos Nus , Camundongos Transgênicos , Mutação , Neoplasias Experimentais/tratamento farmacológico , Neoplasias Experimentais/genética , Paclitaxel/administração & dosagem , Neoplasias Pancreáticas/patologia , Piperazinas/administração & dosagem , Inibidores de Proteínas Quinases/uso terapêutico , Proteínas Proto-Oncogênicas p21(ras)/genética , Piridinas/administração & dosagem , Ensaios Antitumorais Modelo de Xenoenxerto
12.
Bioinformatics ; 35(22): 4846-4848, 2019 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-31173067

RESUMO

MOTIVATION: Genetic alterations lead to tumor progression and cell survival but also uncover cancer-specific vulnerabilities on gene dependencies that can be therapeutically exploited. RESULTS: vulcanSpot is a novel computational approach implemented to expand the therapeutic options in cancer beyond known-driver genes unlocking alternative ways to target undruggable genes. The method integrates genome-wide information provided by massive screening experiments to detect genetic vulnerabilities associated to tumors. Then, vulcanSpot prioritizes drugs to target cancer-specific gene dependencies using a weighted scoring system based on well known drug-gene relationships and drug repositioning strategies. AVAILABILITY AND IMPLEMENTATION: http://www.vulcanspot.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Neoplasias , Biologia Computacional , Reposicionamento de Medicamentos , Humanos , Mutação , Software
13.
Nucleic Acids Res ; 47(D1): D766-D773, 2019 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-30357393

RESUMO

The accurate identification and description of the genes in the human and mouse genomes is a fundamental requirement for high quality analysis of data informing both genome biology and clinical genomics. Over the last 15 years, the GENCODE consortium has been producing reference quality gene annotations to provide this foundational resource. The GENCODE consortium includes both experimental and computational biology groups who work together to improve and extend the GENCODE gene annotation. Specifically, we generate primary data, create bioinformatics tools and provide analysis to support the work of expert manual gene annotators and automated gene annotation pipelines. In addition, manual and computational annotation workflows use any and all publicly available data and analysis, along with the research literature to identify and characterise gene loci to the highest standard. GENCODE gene annotations are accessible via the Ensembl and UCSC Genome Browsers, the Ensembl FTP site, Ensembl Biomart, Ensembl Perl and REST APIs as well as https://www.gencodegenes.org.


Assuntos
Bases de Dados Genéticas , Genoma Humano/genética , Genômica , Pseudogenes/genética , Animais , Biologia Computacional , Humanos , Internet , Camundongos , Anotação de Sequência Molecular , Software
14.
Nat Struct Mol Biol ; 25(9): 778-786, 2018 09.
Artigo em Inglês | MEDLINE | ID: mdl-30104661

RESUMO

RNA viruses are a major threat to animals and plants. RNA interference (RNAi) and the interferon response provide innate antiviral defense against RNA viruses. Here, we performed a large-scale screen using Caenorhabditis elegans and its natural pathogen the Orsay virus (OrV), and we identified cde-1 as important for antiviral defense. CDE-1 is a homolog of the mammalian TUT4 and TUT7 terminal uridylyltransferases (collectively called TUT4(7)); its catalytic activity is required for its antiviral function. CDE-1 uridylates the 3' end of the OrV RNA genome and promotes its degradation in a manner independent of the RNAi pathway. Likewise, TUT4(7) enzymes uridylate influenza A virus (IAV) mRNAs in mammalian cells. Deletion of TUT4(7) leads to increased IAV mRNA and protein levels. Collectively, these data implicate 3'-terminal uridylation of viral RNAs as a conserved antiviral defense mechanism.


Assuntos
Caenorhabditis elegans/enzimologia , Caenorhabditis elegans/virologia , Imunidade Inata , RNA Nucleotidiltransferases/metabolismo , Vírus de RNA/metabolismo , Células A549 , Animais , Caenorhabditis elegans/genética , Humanos , Interferência de RNA , Vírus de RNA/imunologia , Vírus de RNA/fisiologia , RNA Viral/metabolismo , Transcriptoma , Replicação Viral
15.
Nucleic Acids Res ; 46(D1): D213-D217, 2018 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-29069475

RESUMO

The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the 'principal' isoform, as the reference for each gene based on these annotations. A single main splice isoform reflects the biological reality for most protein coding genes and APPRIS principal isoforms are the best predictors of these main proteins isoforms. Here, we present the updates to the database, new developments that include the addition of three new species (chimpanzee, Drosophila melangaster and Caenorhabditis elegans), the expansion of APPRIS to cover the RefSeq gene set and the UniProtKB proteome for six species and refinements in the core methods that make up the annotation pipeline. In addition APPRIS now provides a measure of reliability for individual principal isoforms and updates with each release of the GENCODE/Ensembl and RefSeq reference sets. The individual GENCODE/Ensembl, RefSeq and UniProtKB reference gene sets for six organisms have been merged to produce common sets of splice variants.


Assuntos
Bases de Dados Genéticas , Isoformas de Proteínas/genética , Processamento Alternativo , Sequência de Aminoácidos , Animais , Humanos , Modelos Moleculares , Anotação de Sequência Molecular , Conformação Proteica , Isoformas de Proteínas/química , Proteoma/genética , Reprodutibilidade dos Testes , Alinhamento de Sequência
16.
Dev Cell ; 42(3): 241-255.e6, 2017 08 07.
Artigo em Inglês | MEDLINE | ID: mdl-28787591

RESUMO

Small RNAs play a crucial role in genome defense against transposable elements and guide Argonaute proteins to nascent RNA transcripts to induce co-transcriptional gene silencing. However, the molecular basis of this process remains unknown. Here, we identify the conserved RNA helicase Aquarius/EMB-4 as a direct and essential link between small RNA pathways and the transcriptional machinery in Caenorhabditis elegans. Aquarius physically interacts with the germline Argonaute HRDE-1. Aquarius is required to initiate small-RNA-induced heritable gene silencing. HRDE-1 and Aquarius silence overlapping sets of genes and transposable elements. Surprisingly, removal of introns from a target gene abolishes the requirement for Aquarius, but not HRDE-1, for small RNA-dependent gene silencing. We conclude that Aquarius allows small RNA pathways to compete for access to nascent transcripts undergoing co-transcriptional splicing in order to detect and silence transposable elements. Thus, Aquarius and HRDE-1 act as gatekeepers coordinating gene expression and genome defense.


Assuntos
Proteínas Argonautas/genética , Proteínas de Caenorhabditis elegans/genética , Caenorhabditis elegans/genética , Proteínas Nucleares/genética , Interferência de RNA , Animais , Proteínas Argonautas/metabolismo , Caenorhabditis elegans/metabolismo , Proteínas de Caenorhabditis elegans/metabolismo , Elementos de DNA Transponíveis , Íntrons , Proteínas Nucleares/metabolismo , Ligação Proteica
17.
Chembiochem ; 18(11): 979-984, 2017 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-28449301

RESUMO

More than a hundred distinct modified nucleosides have been identified in RNA, but little is known about their distribution across different organisms, their dynamic nature and their response to cellular and environmental stress. Mass-spectrometry-based methods have been at the forefront of identifying and quantifying modified nucleosides. However, they often require synthetic reference standards, which do not exist in the case of many modified nucleosides, and this therefore impedes their analysis. Here we use a metabolic labelling approach to achieve rapid generation of bio-isotopologues of the complete Caenorhabditis elegans transcriptome and its modifications and use them as reference standards to characterise the RNA modification profile in this multicellular organism through an untargeted liquid-chromatography tandem high-resolution mass spectrometry (LC-HRMS) approach. We furthermore show that several of these RNA modifications have a dynamic response to environmental stress and that, in particular, changes in the tRNA wobble base modification 5-methoxycarbonylmethyl-2-thiouridine (mcm5 s2 U) lead to codon-biased gene-expression changes in starved animals.


Assuntos
Processamento Pós-Transcricional do RNA , Estresse Fisiológico/genética , Transcriptoma , Animais , Caenorhabditis elegans , Cromatografia Líquida , Marcação por Isótopo , Espectrometria de Massas em Tandem , Tiouridina/análogos & derivados , Tiouridina/metabolismo
18.
BMC Bioinformatics ; 16 Suppl 2: A1-10, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-25708534

RESUMO

This report summarizes the scientific content and activities of the annual symposium organized by the Student Council of the International Society for Computational Biology (ISCB), held in conjunction with the Intelligent Systems for Molecular Biology (ISMB) conference in Boston, USA, on July 11th, 2014.


Assuntos
Biologia Computacional , Resistência a Múltiplos Medicamentos , Sequenciamento de Nucleotídeos em Larga Escala , Repetições de Microssatélites/genética , Revisão da Pesquisa por Pares , Editoração , RNA Mensageiro/metabolismo , Análise de Sequência de DNA
19.
Bioinformatics ; 31(2): 201-8, 2015 Jan 15.
Artigo em Inglês | MEDLINE | ID: mdl-25246432

RESUMO

MOTIVATION: Intrinsically disordered regions are key for the function of numerous proteins. Due to the difficulties in experimental disorder characterization, many computational predictors have been developed with various disorder flavors. Their performance is generally measured on small sets mainly from experimentally solved structures, e.g. Protein Data Bank (PDB) chains. MobiDB has only recently started to collect disorder annotations from multiple experimental structures. RESULTS: MobiDB annotates disorder for UniProt sequences, allowing us to conduct the first large-scale assessment of fast disorder predictors on 25 833 different sequences with X-ray crystallographic structures. In addition to a comprehensive ranking of predictors, this analysis produced the following interesting observations. (i) The predictors cluster according to their disorder definition, with a consensus giving more confidence. (ii) Previous assessments appear over-reliant on data annotated at the PDB chain level and performance is lower on entire UniProt sequences. (iii) Long disordered regions are harder to predict. (iv) Depending on the structural and functional types of the proteins, differences in prediction performance of up to 10% are observed. AVAILABILITY: The datasets are available from Web site at URL: http://mobidb.bio.unipd.it/lsd. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Proteínas/química , Análise de Sequência de Proteína/métodos , Proteína Supressora de Tumor p53/química , Cristalografia por Raios X , Bases de Dados de Proteínas , Humanos , Anotação de Sequência Molecular , Estrutura Terciária de Proteína
20.
Nucleic Acids Res ; 43(Database issue): D315-20, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25361972

RESUMO

MobiDB (http://mobidb.bio.unipd.it/) is a database of intrinsically disordered and mobile proteins. Intrinsically disordered regions are key for the function of numerous proteins. Here we provide a new version of MobiDB, a centralized source aimed at providing the most complete picture on different flavors of disorder in protein structures covering all UniProt sequences (currently over 80 million). The database features three levels of annotation: manually curated, indirect and predicted. Manually curated data is extracted from the DisProt database. Indirect data is inferred from PDB structures that are considered an indication of intrinsic disorder. The 10 predictors currently included (three ESpritz flavors, two IUPred flavors, two DisEMBL flavors, GlobPlot, VSL2b and JRONN) enable MobiDB to provide disorder annotations for every protein in absence of more reliable data. The new version also features a consensus annotation and classification for long disordered regions. In order to complement the disorder annotations, MobiDB features additional annotations from external sources. Annotations from the UniProt database include post-translational modifications and linear motifs. Pfam annotations are displayed in graphical form and are link-enabled, allowing the user to visit the corresponding Pfam page for further information. Experimental protein-protein interactions from STRING are also classified for disorder content.


Assuntos
Bases de Dados de Proteínas , Proteínas Intrinsicamente Desordenadas/química , Curadoria de Dados , Conformação Proteica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA