Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 30
Filter
1.
Mol Cancer ; 22(1): 119, 2023 07 29.
Article in English | MEDLINE | ID: mdl-37516825

ABSTRACT

Newly growing evidence highlights the essential role that epitranscriptomic marks play in the development of many cancers; however, little is known about the role and implications of altered epitranscriptome deposition in prostate cancer. Here, we show that the transfer RNA N7-methylguanosine (m7G) transferase METTL1 is highly expressed in primary and advanced prostate tumours. Mechanistically, we find that METTL1 depletion causes the loss of m7G tRNA methylation and promotes the biogenesis of a novel class of small non-coding RNAs derived from 5'tRNA fragments. 5'tRNA-derived small RNAs steer translation control to favour the synthesis of key regulators of tumour growth suppression, interferon pathway, and immune effectors. Knockdown of Mettl1 in prostate cancer preclinical models increases intratumoural infiltration of pro-inflammatory immune cells and enhances responses to immunotherapy. Collectively, our findings reveal a therapeutically actionable role of METTL1-directed m7G tRNA methylation in cancer cell translation control and tumour biology.


Subject(s)
Carcinogenesis , Prostatic Neoplasms , Male , Humans , Carcinogenesis/genetics , Cell Transformation, Neoplastic , Prostatic Neoplasms/genetics , Transcription, Genetic , RNA Processing, Post-Transcriptional , Methyltransferases/genetics
2.
Mol Oncol ; 16(21): 3881-3908, 2022 11.
Article in English | MEDLINE | ID: mdl-35811332

ABSTRACT

Tumour heterogeneity is one of the main characteristics of cancer and can be categorised into inter- or intratumour heterogeneity. This heterogeneity has been revealed as one of the key causes of treatment failure and relapse. Precision oncology is an emerging field that seeks to design tailored treatments for each cancer patient according to epidemiological, clinical and omics data. This discipline relies on bioinformatics tools designed to compute scores to prioritise available drugs, with the aim of helping clinicians in treatment selection. In this review, we describe the current approaches for therapy selection depending on which type of tumour heterogeneity is being targeted and the available next-generation sequencing data. We cover intertumour heterogeneity studies and individual treatment selection using genomics variants, expression data or multi-omics strategies. We also describe intratumour dissection through clonal inference and single-cell transcriptomics, in each case providing bioinformatics tools for tailored treatment selection. Finally, we discuss how these therapy selection workflows could be integrated into the clinical practice.


Subject(s)
Neoplasms , Humans , Neoplasms/pathology , Computational Biology , Precision Medicine , Genomics , High-Throughput Nucleotide Sequencing
3.
Bioinformatics ; 38(4): 1155-1156, 2022 01 27.
Article in English | MEDLINE | ID: mdl-34788788

ABSTRACT

SUMMARY: bollito is an automated, flexible and parallelizable computational pipeline for the comprehensive analysis of single-cell RNA-seq data. Starting from FASTQ files or preprocessed expression matrices, bollito performs both basic and advanced tasks in single-cell analysis integrating >30 state-of-the-art tools. This includes quality control, read alignment, dimensionality reduction, clustering, cell-marker detection, differential expression, functional analysis, trajectory inference and RNA velocity. bollito is built using the Snakemake workflow management system, which easily connects each execution step and facilitates the reproducibility of results. bollito's modular design makes it easy to incorporate other packages into the pipeline enabling its expansion with new functionalities. AVAILABILITY AND IMPLEMENTATION: Source code is freely available at https://gitlab.com/bu_cnio/bollito under the MIT license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Single-Cell Gene Expression Analysis , Software , Reproducibility of Results , RNA , Workflow
4.
Nucleic Acids Res ; 50(D1): D54-D59, 2022 01 07.
Article in English | MEDLINE | ID: mdl-34755885

ABSTRACT

APPRIS (https://appris.bioinfo.cnio.es) is a well-established database housing annotations for protein isoforms for a range of species. APPRIS selects principal isoforms based on protein structure and function features and on cross-species conservation. Most coding genes produce a single main protein isoform and the principal isoforms chosen by the APPRIS database best represent this main cellular isoform. Human genetic data, experimental protein evidence and the distribution of clinical variants all support the relevance of APPRIS principal isoforms. APPRIS annotations and principal isoforms have now been expanded to 10 model organisms. In this paper we highlight the most recent updates to the database. APPRIS annotations have been generated for two new species, cow and chicken, the protein structural information has been augmented with reliable models from the EMBL-EBI AlphaFold database, and we have substantially expanded the confirmatory proteomics evidence available for the human genome. The most significant change in APPRIS has been the implementation of TRIFID functional isoform scores. TRIFID functional scores are assigned to all splice isoforms, and APPRIS uses the TRIFID functional scores and proteomics evidence to determine principal isoforms when core methods cannot.


Subject(s)
Databases, Protein , Protein Isoforms/genetics , Proteins/genetics , Proteomics , Animals , Cattle , Chickens/genetics , Humans , Protein Conformation , Protein Isoforms/classification , Proteins/chemistry , Proteins/classification
5.
Genome Med ; 13(1): 187, 2021 12 16.
Article in English | MEDLINE | ID: mdl-34911571

ABSTRACT

We present Beyondcell, a computational methodology for identifying tumour cell subpopulations with distinct drug responses in single-cell RNA-seq data and proposing cancer-specific treatments. Our method calculates an enrichment score in a collection of drug signatures, delineating therapeutic clusters (TCs) within cellular populations. Additionally, Beyondcell determines the therapeutic differences among cell populations and generates a prioritised sensitivity-based ranking in order to guide drug selection. We performed Beyondcell analysis in five single-cell datasets and demonstrated that TCs can be exploited to target malignant cells both in cancer cell lines and tumour patients. Beyondcell is available at: https://gitlab.com/bu_cnio/beyondcell .


Subject(s)
Neoplasms , Single-Cell Analysis , Gene Expression Profiling/methods , Humans , Neoplasms/drug therapy , Neoplasms/genetics , RNA-Seq , Sequence Analysis, RNA/methods , Single-Cell Analysis/methods
6.
NAR Genom Bioinform ; 3(2): lqab044, 2021 Jun.
Article in English | MEDLINE | ID: mdl-34046593

ABSTRACT

Alternative splicing of messenger RNA can generate an array of mature transcripts, but it is not clear how many go on to produce functionally relevant protein isoforms. There is only limited evidence for alternative proteins in proteomics analyses and data from population genetic variation studies indicate that most alternative exons are evolving neutrally. Determining which transcripts produce biologically important isoforms is key to understanding isoform function and to interpreting the real impact of somatic mutations and germline variations. Here we have developed a method, TRIFID, to classify the functional importance of splice isoforms. TRIFID was trained on isoforms detected in large-scale proteomics analyses and distinguishes these biologically important splice isoforms with high confidence. Isoforms predicted as functionally important by the algorithm had measurable cross species conservation and significantly fewer broken functional domains. Additionally, exons that code for these functionally important protein isoforms are under purifying selection, while exons from low scoring transcripts largely appear to be evolving neutrally. TRIFID has been developed for the human genome, but it could in principle be applied to other well-annotated species. We believe that this method will generate valuable insights into the cellular importance of alternative splicing.

7.
Eur J Cancer ; 147: 74-83, 2021 04.
Article in English | MEDLINE | ID: mdl-33626496

ABSTRACT

BACKGROUND: Intraductal (IDC) and cribriform (CRIB) histologies in prostate cancer have been associated with germline BRCA2 (gBRCA2) mutations in small retrospective series, leading to the recommendation of genetic testing for patients with IDC in the primary tumour. PATIENTS AND METHODS: To examine the association of gBRCA2 mutations and other tumour molecular features with IDC and/or cribriform (CRIB) histologies, we conducted a case-control study in which primary prostate tumours from 58 gBRCA2 carriers were matched (1:2) by Gleason Grade Group and specimen type to 116 non-carriers. Presence/absence of IDC and CRIB morphologies was established by two expert uropathologists blinded to gBRCA2 status. Fluorescent in-situ hybridization (FISH) and next-generation sequencing (NGS) were used to detect BRCA2 alterations, PTEN deletions and TMPRSS2-ERG fusions. Chi-squared tests were used to compare the frequency of IDC and CRIB in gBRCA2 carriers and controls and to assess associations with other variables. Logistic regression models were constructed to identify independent factors associated with both histology patterns. RESULTS: No significant differences between gBRCA2 carriers and non-carriers were observed in the prevalence of IDC (36% gBRCA2 versus 50% non-carriers, p = 0.085) or CRIB (53% gBRCA2 versus 43% non-carriers p = 0.197) patterns. However, IDC histology was independently associated with bi-allelic BRCA2 alterations (OR 4.3, 95%CI 1.1-16.2) and PTEN homozygous loss (OR 5.2, 95%CI 2.1-13.1). CRIB morphology was also independently associated with bi-allelic BRCA2 alterations (OR 5.6, 95%CI 1.7-19.3). CONCLUSIONS: While we found no association between gBRCA2 mutations and IDC or CRIB histologies, bi-allelic BRCA2 loss in primary prostate tumours was significantly associated with both variant morphologies, independently of other clinical-pathologic factors.


Subject(s)
BRCA2 Protein/genetics , Biomarkers, Tumor/genetics , Mutation , Prostatic Neoplasms/genetics , Adult , Aged , Aged, 80 and over , Case-Control Studies , DNA Mutational Analysis , Gene Deletion , Genetic Predisposition to Disease , Humans , Male , Middle Aged , Neoplasm Grading , PTEN Phosphohydrolase/genetics , Phenotype , Prostatic Neoplasms/pathology , Risk Assessment , Risk Factors , Spain
8.
Nucleic Acids Res ; 49(D1): D916-D923, 2021 01 08.
Article in English | MEDLINE | ID: mdl-33270111

ABSTRACT

The GENCODE project annotates human and mouse genes and transcripts supported by experimental data with high accuracy, providing a foundational resource that supports genome biology and clinical genomics. GENCODE annotation processes make use of primary data and bioinformatic tools and analysis generated both within the consortium and externally to support the creation of transcript structures and the determination of their function. Here, we present improvements to our annotation infrastructure, bioinformatics tools, and analysis, and the advances they support in the annotation of the human and mouse genomes including: the completion of first pass manual annotation for the mouse reference genome; targeted improvements to the annotation of genes associated with SARS-CoV-2 infection; collaborative projects to achieve convergence across reference annotation databases for the annotation of human and mouse protein-coding genes; and the first GENCODE manually supervised automated annotation of lncRNAs. Our annotation is accessible via Ensembl, the UCSC Genome Browser and https://www.gencodegenes.org.


Subject(s)
COVID-19/prevention & control , Computational Biology/methods , Databases, Genetic , Genomics/methods , Molecular Sequence Annotation/methods , SARS-CoV-2/genetics , Animals , COVID-19/epidemiology , COVID-19/virology , Epidemics , Humans , Internet , Mice , Pseudogenes/genetics , RNA, Long Noncoding/genetics , SARS-CoV-2/metabolism , SARS-CoV-2/physiology , Transcription, Genetic/genetics
9.
PLoS Comput Biol ; 16(10): e1008287, 2020 10.
Article in English | MEDLINE | ID: mdl-33017396

ABSTRACT

The role of alternative splicing is one of the great unanswered questions in cellular biology. There is strong evidence for alternative splicing at the transcript level, and transcriptomics experiments show that many splice events are tissue specific. It has been suggested that alternative splicing evolved in order to remodel tissue-specific protein-protein networks. Here we investigated the evidence for tissue-specific splicing among splice isoforms detected in a large-scale proteomics analysis. Although the data supporting alternative splicing is limited at the protein level, clear patterns emerged among the small numbers of alternative splice events that we could detect in the proteomics data. More than a third of these splice events were tissue-specific and most were ancient: over 95% of splice events that were tissue-specific in both proteomics and RNAseq analyses evolved prior to the ancestors of lobe-finned fish, at least 400 million years ago. By way of contrast, three in four alternative exons in the human gene set arose in the primate lineage, so our results cannot be extrapolated to the whole genome. Tissue-specific alternative protein forms in the proteomics analysis were particularly abundant in nervous and muscle tissues and their genes had roles related to the cytoskeleton and either the structure of muscle fibres or cell-cell connections. Our results suggest that this conserved tissue-specific alternative splicing may have played a role in the development of the vertebrate brain and heart.


Subject(s)
Alternative Splicing/genetics , Organ Specificity/genetics , Protein Isoforms , Animals , Computational Biology , Genome/genetics , Humans , Protein Isoforms/chemistry , Protein Isoforms/classification , Protein Isoforms/genetics , Proteomics
11.
Cancer Cell ; 37(3): 340-353.e6, 2020 03 16.
Article in English | MEDLINE | ID: mdl-32109375

ABSTRACT

Inhibition of the cell-cycle kinases CDK4 and CDK6 is now part of the standard treatment in advanced breast cancer. CDK4/6 inhibitors, however, are not expected to cooperate with DNA-damaging or antimitotic chemotherapies as the former prevent cell-cycle entry, thus interfering with S-phase- or mitosis-targeting agents. Here, we report that sequential administration of CDK4/6 inhibitors after taxanes cooperates to prevent cellular proliferation in pancreatic ductal adenocarcinoma (PDAC) cells, patient-derived xenografts, and genetically engineered mice with Kras G12V and Cdkn2a-null mutations frequently observed in PDAC. This effect correlates with the repressive activity of CDK4/6 inhibitors on homologous recombination proteins required for the recovery from chromosomal damage. CDK4/6 inhibitors also prevent recovery from multiple DNA-damaging agents, suggesting broad applicability for their sequential administration after available chemotherapeutic agents.


Subject(s)
Antineoplastic Combined Chemotherapy Protocols/therapeutic use , Carcinoma, Pancreatic Ductal/drug therapy , Cyclin-Dependent Kinase 4/antagonists & inhibitors , Cyclin-Dependent Kinase 6/antagonists & inhibitors , Pancreatic Neoplasms/drug therapy , Protein Kinase Inhibitors/administration & dosage , Albumins/administration & dosage , Animals , Antineoplastic Combined Chemotherapy Protocols/administration & dosage , Carcinoma, Pancreatic Ductal/pathology , Cell Line, Tumor , Cyclin-Dependent Kinase Inhibitor p16/genetics , DNA Repair/drug effects , Drug Administration Schedule , Homologous Recombination/drug effects , Humans , Mice, Nude , Mice, Transgenic , Mutation , Neoplasms, Experimental/drug therapy , Neoplasms, Experimental/genetics , Paclitaxel/administration & dosage , Pancreatic Neoplasms/pathology , Piperazines/administration & dosage , Protein Kinase Inhibitors/therapeutic use , Proto-Oncogene Proteins p21(ras)/genetics , Pyridines/administration & dosage , Xenograft Model Antitumor Assays
12.
Bioinformatics ; 35(22): 4846-4848, 2019 11 01.
Article in English | MEDLINE | ID: mdl-31173067

ABSTRACT

MOTIVATION: Genetic alterations lead to tumor progression and cell survival but also uncover cancer-specific vulnerabilities on gene dependencies that can be therapeutically exploited. RESULTS: vulcanSpot is a novel computational approach implemented to expand the therapeutic options in cancer beyond known-driver genes unlocking alternative ways to target undruggable genes. The method integrates genome-wide information provided by massive screening experiments to detect genetic vulnerabilities associated to tumors. Then, vulcanSpot prioritizes drugs to target cancer-specific gene dependencies using a weighted scoring system based on well known drug-gene relationships and drug repositioning strategies. AVAILABILITY AND IMPLEMENTATION: http://www.vulcanspot.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Neoplasms , Computational Biology , Drug Repositioning , Humans , Mutation , Software
13.
Nucleic Acids Res ; 47(D1): D766-D773, 2019 01 08.
Article in English | MEDLINE | ID: mdl-30357393

ABSTRACT

The accurate identification and description of the genes in the human and mouse genomes is a fundamental requirement for high quality analysis of data informing both genome biology and clinical genomics. Over the last 15 years, the GENCODE consortium has been producing reference quality gene annotations to provide this foundational resource. The GENCODE consortium includes both experimental and computational biology groups who work together to improve and extend the GENCODE gene annotation. Specifically, we generate primary data, create bioinformatics tools and provide analysis to support the work of expert manual gene annotators and automated gene annotation pipelines. In addition, manual and computational annotation workflows use any and all publicly available data and analysis, along with the research literature to identify and characterise gene loci to the highest standard. GENCODE gene annotations are accessible via the Ensembl and UCSC Genome Browsers, the Ensembl FTP site, Ensembl Biomart, Ensembl Perl and REST APIs as well as https://www.gencodegenes.org.


Subject(s)
Databases, Genetic , Genome, Human/genetics , Genomics , Pseudogenes/genetics , Animals , Computational Biology , Humans , Internet , Mice , Molecular Sequence Annotation , Software
14.
Nat Struct Mol Biol ; 25(9): 778-786, 2018 09.
Article in English | MEDLINE | ID: mdl-30104661

ABSTRACT

RNA viruses are a major threat to animals and plants. RNA interference (RNAi) and the interferon response provide innate antiviral defense against RNA viruses. Here, we performed a large-scale screen using Caenorhabditis elegans and its natural pathogen the Orsay virus (OrV), and we identified cde-1 as important for antiviral defense. CDE-1 is a homolog of the mammalian TUT4 and TUT7 terminal uridylyltransferases (collectively called TUT4(7)); its catalytic activity is required for its antiviral function. CDE-1 uridylates the 3' end of the OrV RNA genome and promotes its degradation in a manner independent of the RNAi pathway. Likewise, TUT4(7) enzymes uridylate influenza A virus (IAV) mRNAs in mammalian cells. Deletion of TUT4(7) leads to increased IAV mRNA and protein levels. Collectively, these data implicate 3'-terminal uridylation of viral RNAs as a conserved antiviral defense mechanism.


Subject(s)
Caenorhabditis elegans/enzymology , Caenorhabditis elegans/virology , Immunity, Innate , RNA Nucleotidyltransferases/metabolism , RNA Viruses/metabolism , A549 Cells , Animals , Caenorhabditis elegans/genetics , Humans , RNA Interference , RNA Viruses/immunology , RNA Viruses/physiology , RNA, Viral/metabolism , Transcriptome , Virus Replication
15.
Nucleic Acids Res ; 46(D1): D213-D217, 2018 01 04.
Article in English | MEDLINE | ID: mdl-29069475

ABSTRACT

The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the 'principal' isoform, as the reference for each gene based on these annotations. A single main splice isoform reflects the biological reality for most protein coding genes and APPRIS principal isoforms are the best predictors of these main proteins isoforms. Here, we present the updates to the database, new developments that include the addition of three new species (chimpanzee, Drosophila melangaster and Caenorhabditis elegans), the expansion of APPRIS to cover the RefSeq gene set and the UniProtKB proteome for six species and refinements in the core methods that make up the annotation pipeline. In addition APPRIS now provides a measure of reliability for individual principal isoforms and updates with each release of the GENCODE/Ensembl and RefSeq reference sets. The individual GENCODE/Ensembl, RefSeq and UniProtKB reference gene sets for six organisms have been merged to produce common sets of splice variants.


Subject(s)
Databases, Genetic , Protein Isoforms/genetics , Alternative Splicing , Amino Acid Sequence , Animals , Humans , Models, Molecular , Molecular Sequence Annotation , Protein Conformation , Protein Isoforms/chemistry , Proteome/genetics , Reproducibility of Results , Sequence Alignment
16.
Dev Cell ; 42(3): 241-255.e6, 2017 08 07.
Article in English | MEDLINE | ID: mdl-28787591

ABSTRACT

Small RNAs play a crucial role in genome defense against transposable elements and guide Argonaute proteins to nascent RNA transcripts to induce co-transcriptional gene silencing. However, the molecular basis of this process remains unknown. Here, we identify the conserved RNA helicase Aquarius/EMB-4 as a direct and essential link between small RNA pathways and the transcriptional machinery in Caenorhabditis elegans. Aquarius physically interacts with the germline Argonaute HRDE-1. Aquarius is required to initiate small-RNA-induced heritable gene silencing. HRDE-1 and Aquarius silence overlapping sets of genes and transposable elements. Surprisingly, removal of introns from a target gene abolishes the requirement for Aquarius, but not HRDE-1, for small RNA-dependent gene silencing. We conclude that Aquarius allows small RNA pathways to compete for access to nascent transcripts undergoing co-transcriptional splicing in order to detect and silence transposable elements. Thus, Aquarius and HRDE-1 act as gatekeepers coordinating gene expression and genome defense.


Subject(s)
Argonaute Proteins/genetics , Caenorhabditis elegans Proteins/genetics , Caenorhabditis elegans/genetics , Nuclear Proteins/genetics , RNA Interference , Animals , Argonaute Proteins/metabolism , Caenorhabditis elegans/metabolism , Caenorhabditis elegans Proteins/metabolism , DNA Transposable Elements , Introns , Nuclear Proteins/metabolism , Protein Binding
17.
Chembiochem ; 18(11): 979-984, 2017 06 01.
Article in English | MEDLINE | ID: mdl-28449301

ABSTRACT

More than a hundred distinct modified nucleosides have been identified in RNA, but little is known about their distribution across different organisms, their dynamic nature and their response to cellular and environmental stress. Mass-spectrometry-based methods have been at the forefront of identifying and quantifying modified nucleosides. However, they often require synthetic reference standards, which do not exist in the case of many modified nucleosides, and this therefore impedes their analysis. Here we use a metabolic labelling approach to achieve rapid generation of bio-isotopologues of the complete Caenorhabditis elegans transcriptome and its modifications and use them as reference standards to characterise the RNA modification profile in this multicellular organism through an untargeted liquid-chromatography tandem high-resolution mass spectrometry (LC-HRMS) approach. We furthermore show that several of these RNA modifications have a dynamic response to environmental stress and that, in particular, changes in the tRNA wobble base modification 5-methoxycarbonylmethyl-2-thiouridine (mcm5 s2 U) lead to codon-biased gene-expression changes in starved animals.


Subject(s)
RNA Processing, Post-Transcriptional , Stress, Physiological/genetics , Transcriptome , Animals , Caenorhabditis elegans , Chromatography, Liquid , Isotope Labeling , Tandem Mass Spectrometry , Thiouridine/analogs & derivatives , Thiouridine/metabolism
18.
BMC Bioinformatics ; 16 Suppl 2: A1-10, 2015.
Article in English | MEDLINE | ID: mdl-25708534

ABSTRACT

This report summarizes the scientific content and activities of the annual symposium organized by the Student Council of the International Society for Computational Biology (ISCB), held in conjunction with the Intelligent Systems for Molecular Biology (ISMB) conference in Boston, USA, on July 11th, 2014.


Subject(s)
Computational Biology , Drug Resistance, Multiple , High-Throughput Nucleotide Sequencing , Microsatellite Repeats/genetics , Peer Review, Research , Publishing , RNA, Messenger/metabolism , Sequence Analysis, DNA
19.
Bioinformatics ; 31(2): 201-8, 2015 Jan 15.
Article in English | MEDLINE | ID: mdl-25246432

ABSTRACT

MOTIVATION: Intrinsically disordered regions are key for the function of numerous proteins. Due to the difficulties in experimental disorder characterization, many computational predictors have been developed with various disorder flavors. Their performance is generally measured on small sets mainly from experimentally solved structures, e.g. Protein Data Bank (PDB) chains. MobiDB has only recently started to collect disorder annotations from multiple experimental structures. RESULTS: MobiDB annotates disorder for UniProt sequences, allowing us to conduct the first large-scale assessment of fast disorder predictors on 25 833 different sequences with X-ray crystallographic structures. In addition to a comprehensive ranking of predictors, this analysis produced the following interesting observations. (i) The predictors cluster according to their disorder definition, with a consensus giving more confidence. (ii) Previous assessments appear over-reliant on data annotated at the PDB chain level and performance is lower on entire UniProt sequences. (iii) Long disordered regions are harder to predict. (iv) Depending on the structural and functional types of the proteins, differences in prediction performance of up to 10% are observed. AVAILABILITY: The datasets are available from Web site at URL: http://mobidb.bio.unipd.it/lsd. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Proteins/chemistry , Sequence Analysis, Protein/methods , Tumor Suppressor Protein p53/chemistry , Crystallography, X-Ray , Databases, Protein , Humans , Molecular Sequence Annotation , Protein Structure, Tertiary
20.
Nucleic Acids Res ; 43(Database issue): D315-20, 2015 Jan.
Article in English | MEDLINE | ID: mdl-25361972

ABSTRACT

MobiDB (http://mobidb.bio.unipd.it/) is a database of intrinsically disordered and mobile proteins. Intrinsically disordered regions are key for the function of numerous proteins. Here we provide a new version of MobiDB, a centralized source aimed at providing the most complete picture on different flavors of disorder in protein structures covering all UniProt sequences (currently over 80 million). The database features three levels of annotation: manually curated, indirect and predicted. Manually curated data is extracted from the DisProt database. Indirect data is inferred from PDB structures that are considered an indication of intrinsic disorder. The 10 predictors currently included (three ESpritz flavors, two IUPred flavors, two DisEMBL flavors, GlobPlot, VSL2b and JRONN) enable MobiDB to provide disorder annotations for every protein in absence of more reliable data. The new version also features a consensus annotation and classification for long disordered regions. In order to complement the disorder annotations, MobiDB features additional annotations from external sources. Annotations from the UniProt database include post-translational modifications and linear motifs. Pfam annotations are displayed in graphical form and are link-enabled, allowing the user to visit the corresponding Pfam page for further information. Experimental protein-protein interactions from STRING are also classified for disorder content.


Subject(s)
Databases, Protein , Intrinsically Disordered Proteins/chemistry , Data Curation , Protein Conformation
SELECTION OF CITATIONS
SEARCH DETAIL
...