Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 90
Filter
1.
J Proteomics ; : 105246, 2024 Jul 02.
Article in English | MEDLINE | ID: mdl-38964537

ABSTRACT

The 2023 European Bioinformatics Community for Mass Spectrometry (EuBIC-MS) Developers Meeting was held from January 15th to January 20th, 2023, in Congressi Stefano Franscin at Monte Verità in Ticino, Switzerland. The participants were scientists and developers working in computational mass spectrometry (MS), metabolomics, and proteomics. The 5-day program was split between introductory keynote lectures and parallel hackathon sessions focusing on "Artificial Intelligence in proteomics" to stimulate future directions in the MS-driven omics areas. During the latter, the participants developed bioinformatics tools and resources addressing outstanding needs in the community. The hackathons allowed less experienced participants to learn from more advanced computational MS experts and actively contribute to highly relevant research projects. We successfully produced several new tools applicable to the proteomics community by improving data analysis and facilitating future research.

2.
bioRxiv ; 2024 May 29.
Article in English | MEDLINE | ID: mdl-38854010

ABSTRACT

Genome sequencing efforts have led to the discovery of tens of millions of protein missense variants found in the human population with the majority of these having no annotated role and some likely contributing to trait variation and disease. Sequence-based artificial intelligence approaches have become highly accurate at predicting variants that are detrimental to the function of proteins but they do not inform on mechanisms of disruption. Here we combined sequence and structure-based methods to perform proteome-wide prediction of deleterious variants with information on their impact on protein stability, protein-protein interactions and small-molecule binding pockets. AlphaFold2 structures were used to predict approximately 100,000 small-molecule binding pockets and stability changes for over 200 million variants. To inform on protein-protein interfaces we used AlphaFold2 to predict structures for nearly 500,000 protein complexes. We illustrate the value of mechanism-aware variant effect predictions to study the relation between protein stability and abundance and the structural properties of interfaces underlying trans protein quantitative trait loci (pQTLs). We characterised the distribution of mechanistic impacts of protein variants found in patients and experimentally studied example disease linked variants in FGFR1.

3.
Dev Cell ; 2024 Jun 18.
Article in English | MEDLINE | ID: mdl-38906138

ABSTRACT

Sexually reproducing eukaryotes employ a developmentally regulated cell division program-meiosis-to generate haploid gametes from diploid germ cells. To understand how gametes arise, we generated a proteomic census encompassing the entire meiotic program of budding yeast. We found that concerted waves of protein expression and phosphorylation modify nearly all cellular pathways to support meiotic entry, meiotic progression, and gamete morphogenesis. Leveraging this comprehensive resource, we pinpointed dynamic changes in mitochondrial components and showed that phosphorylation of the FoF1-ATP synthase complex is required for efficient gametogenesis. Furthermore, using cryoET as an orthogonal approach to visualize mitochondria, we uncovered highly ordered filament arrays of Ald4ALDH2, a conserved aldehyde dehydrogenase that is highly expressed and phosphorylated during meiosis. Notably, phosphorylation-resistant mutants failed to accumulate filaments, suggesting that phosphorylation regulates context-specific Ald4ALDH2 polymerization. Overall, this proteomic census constitutes a broad resource to guide the exploration of the unique sequence of events underpinning gametogenesis.

4.
Nucleic Acids Res ; 52(W1): W140-W147, 2024 Jul 05.
Article in English | MEDLINE | ID: mdl-38769064

ABSTRACT

Genomic variation can impact normal biological function in complex ways and so understanding variant effects requires a broad range of data to be coherently assimilated. Whilst the volume of human variant data and relevant annotations has increased, the corresponding increase in the breadth of participating fields, standards and versioning mean that moving between genomic, coding, protein and structure positions is increasingly complex. In turn this makes investigating variants in diverse formats and assimilating annotations from different resources challenging. ProtVar addresses these issues to facilitate the contextualization and interpretation of human missense variation with unparalleled flexibility and ease of accessibility for use by the broadest range of researchers. By precalculating all possible variants in the human proteome it offers near instantaneous mapping between all relevant data types. It also combines data and analyses from a plethora of resources to bring together genomic, protein sequence and function annotations as well as structural insights and predictions to better understand the likely effect of missense variation in humans. It is offered as an intuitive web server https://www.ebi.ac.uk/protvar where data can be explored and downloaded, and can be accessed programmatically via an API.


Subject(s)
Mutation, Missense , Software , Humans , Databases, Protein , Molecular Sequence Annotation , Proteome/genetics , Proteins/genetics , Proteins/chemistry , Internet , Genomics/methods
5.
Mol Syst Biol ; 20(6): 651-675, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38702390

ABSTRACT

The physical interactome of a protein can be altered upon perturbation, modulating cell physiology and contributing to disease. Identifying interactome differences of normal and disease states of proteins could help understand disease mechanisms, but current methods do not pinpoint structure-specific PPIs and interaction interfaces proteome-wide. We used limited proteolysis-mass spectrometry (LiP-MS) to screen for structure-specific PPIs by probing for protease susceptibility changes of proteins in cellular extracts upon treatment with specific structural states of a protein. We first demonstrated that LiP-MS detects well-characterized PPIs, including antibody-target protein interactions and interactions with membrane proteins, and that it pinpoints interfaces, including epitopes. We then applied the approach to study conformation-specific interactors of the Parkinson's disease hallmark protein alpha-synuclein (aSyn). We identified known interactors of aSyn monomer and amyloid fibrils and provide a resource of novel putative conformation-specific aSyn interactors for validation in further studies. We also used our approach on GDP- and GTP-bound forms of two Rab GTPases, showing detection of differential candidate interactors of conformationally similar proteins. This approach is applicable to screen for structure-specific interactomes of any protein, including posttranslationally modified and unmodified, or metabolite-bound and unbound protein states.


Subject(s)
alpha-Synuclein , Humans , alpha-Synuclein/metabolism , alpha-Synuclein/chemistry , Protein Interaction Mapping , Mass Spectrometry , Protein Binding , Proteolysis , Parkinson Disease/metabolism , rab GTP-Binding Proteins/metabolism , Protein Interaction Maps , Protein Conformation , Amyloid/metabolism , Amyloid/chemistry , Proteome/metabolism
6.
Article in English | MEDLINE | ID: mdl-38621234

ABSTRACT

The last five years have seen impressive progress in deep learning models applied to protein research. Most notably, sequence-based structure predictions have seen transformative gains in the form of AlphaFold2 and related approaches. Millions of missense protein variants in the human population lack annotations, and these computational methods are a valuable means to prioritize variants for further analysis. Here, we review the recent progress in deep learning models applied to the prediction of protein structure and protein variants, with particular emphasis on their implications for human genetics and health. Improved prediction of protein structures facilitates annotations of the impact of variants on protein stability, protein-protein interaction interfaces, and small-molecule binding pockets. Moreover, it contributes to the study of host-pathogen interactions and the characterization of protein function. As genome sequencing in large cohorts becomes increasingly prevalent, we believe that better integration of state-of-the-art protein informatics technologies into human genetics research is of paramount importance.

7.
Cancer Cell ; 42(2): 301-316.e9, 2024 02 12.
Article in English | MEDLINE | ID: mdl-38215750

ABSTRACT

Genetic screens in cancer cell lines inform gene function and drug discovery. More comprehensive screen datasets with multi-omics data are needed to enhance opportunities to functionally map genetic vulnerabilities. Here, we construct a second-generation map of cancer dependencies by annotating 930 cancer cell lines with multi-omic data and analyze relationships between molecular markers and cancer dependencies derived from CRISPR-Cas9 screens. We identify dependency-associated gene expression markers beyond driver genes, and observe many gene addiction relationships driven by gain of function rather than synthetic lethal effects. By combining clinically informed dependency-marker associations with protein-protein interaction networks, we identify 370 anti-cancer priority targets for 27 cancer types, many of which have network-based evidence of a functional link with a marker in a cancer type. Mapping these targets to sequenced tumor cohorts identifies tractable targets in different cancer types. This target prioritization map enhances understanding of gene dependencies and identifies candidate anti-cancer targets for drug development.


Subject(s)
Genetic Testing , Neoplasms , Humans , Phenotype , Drug Discovery , Neoplasms/genetics , Neoplasms/pathology , Cell Line, Tumor , CRISPR-Cas Systems
8.
Cancer Cell ; 42(2): 209-224.e9, 2024 02 12.
Article in English | MEDLINE | ID: mdl-38215748

ABSTRACT

Although immunotherapy with PD-(L)1 blockade is routine for lung cancer, little is known about acquired resistance. Among 1,201 patients with non-small cell lung cancer (NSCLC) treated with PD-(L)1 blockade, acquired resistance is common, occurring in >60% of initial responders. Acquired resistance shows differential expression of inflammation and interferon (IFN) signaling. Relapsed tumors can be separated by upregulated or stable expression of IFNγ response genes. Upregulation of IFNγ response genes is associated with putative routes of resistance characterized by signatures of persistent IFN signaling, immune dysfunction, and mutations in antigen presentation genes which can be recapitulated in multiple murine models of acquired resistance to PD-(L)1 blockade after in vitro IFNγ treatment. Acquired resistance to PD-(L)1 blockade in NSCLC is associated with an ongoing, but altered IFN response. The persistently inflamed, rather than excluded or deserted, tumor microenvironment of acquired resistance may inform therapeutic strategies to effectively reprogram and reverse acquired resistance.


Subject(s)
Carcinoma, Non-Small-Cell Lung , Lung Neoplasms , Humans , Animals , Mice , Carcinoma, Non-Small-Cell Lung/drug therapy , Carcinoma, Non-Small-Cell Lung/genetics , Lung Neoplasms/drug therapy , Lung Neoplasms/genetics , Signal Transduction , Immunotherapy , Antigen Presentation , B7-H1 Antigen/metabolism , Tumor Microenvironment
9.
Mol Syst Biol ; 20(3): 162-169, 2024 Mar.
Article in English | MEDLINE | ID: mdl-38291232

ABSTRACT

Proteins are the key molecular machines that orchestrate all biological processes of the cell. Most proteins fold into three-dimensional shapes that are critical for their function. Studying the 3D shape of proteins can inform us of the mechanisms that underlie biological processes in living cells and can have practical applications in the study of disease mutations or the discovery of novel drug treatments. Here, we review the progress made in sequence-based prediction of protein structures with a focus on applications that go beyond the prediction of single monomer structures. This includes the application of deep learning methods for the prediction of structures of protein complexes, different conformations, the evolution of protein structures and the application of these methods to protein design. These developments create new opportunities for research that will have impact across many areas of biomedical research.


Subject(s)
Deep Learning , Proteins/metabolism , Protein Conformation
10.
bioRxiv ; 2024 Feb 08.
Article in English | MEDLINE | ID: mdl-38076945

ABSTRACT

Translating high-confidence (hc) autism spectrum disorder (ASD) genes into viable treatment targets remains elusive. We constructed a foundational protein-protein interaction (PPI) network in HEK293T cells involving 100 hcASD risk genes, revealing over 1,800 PPIs (87% novel). Interactors, expressed in the human brain and enriched for ASD but not schizophrenia genetic risk, converged on protein complexes involved in neurogenesis, tubulin biology, transcriptional regulation, and chromatin modification. A PPI map of 54 patient-derived missense variants identified differential physical interactions, and we leveraged AlphaFold-Multimer predictions to prioritize direct PPIs and specific variants for interrogation in Xenopus tropicalis and human forebrain organoids. A mutation in the transcription factor FOXP1 led to reconfiguration of DNA binding sites and altered development of deep cortical layer neurons in forebrain organoids. This work offers new insights into molecular mechanisms underlying ASD and describes a powerful platform to develop and test therapeutic strategies for many genetically-defined conditions.

11.
Nature ; 622(7983): 637-645, 2023 Oct.
Article in English | MEDLINE | ID: mdl-37704730

ABSTRACT

Proteins are key to all cellular processes and their structure is important in understanding their function and evolution. Sequence-based predictions of protein structures have increased in accuracy1, and over 214 million predicted structures are available in the AlphaFold database2. However, studying protein structures at this scale requires highly efficient methods. Here, we developed a structural-alignment-based clustering algorithm-Foldseek cluster-that can cluster hundreds of millions of structures. Using this method, we have clustered all of the structures in the AlphaFold database, identifying 2.30 million non-singleton structural clusters, of which 31% lack annotations representing probable previously undescribed structures. Clusters without annotation tend to have few representatives covering only 4% of all proteins in the AlphaFold database. Evolutionary analysis suggests that most clusters are ancient in origin but 4% seem to be species specific, representing lower-quality predictions or examples of de novo gene birth. We also show how structural comparisons can be used to predict domain families and their relationships, identifying examples of remote structural similarity. On the basis of these analyses, we identify several examples of human immune-related proteins with putative remote homology in prokaryotic species, illustrating the value of this resource for studying protein function and evolution across the tree of life.


Subject(s)
Algorithms , Cluster Analysis , Proteins , Structural Homology, Protein , Humans , Databases, Protein , Proteins/chemistry , Proteins/classification , Proteins/metabolism , Sequence Alignment , Molecular Sequence Annotation , Prokaryotic Cells/chemistry , Phylogeny , Species Specificity , Evolution, Molecular
12.
Blood ; 142(24): 2055-2068, 2023 12 14.
Article in English | MEDLINE | ID: mdl-37647632

ABSTRACT

Rare genetic diseases affect millions, and identifying causal DNA variants is essential for patient care. Therefore, it is imperative to estimate the effect of each independent variant and improve their pathogenicity classification. Our study of 140 214 unrelated UK Biobank (UKB) participants found that each of them carries a median of 7 variants previously reported as pathogenic or likely pathogenic. We focused on 967 diagnostic-grade gene (DGG) variants for rare bleeding, thrombotic, and platelet disorders (BTPDs) observed in 12 367 UKB participants. By association analysis, for a subset of these variants, we estimated effect sizes for platelet count and volume, and odds ratios for bleeding and thrombosis. Variants causal of some autosomal recessive platelet disorders revealed phenotypic consequences in carriers. Loss-of-function variants in MPL, which cause chronic amegakaryocytic thrombocytopenia if biallelic, were unexpectedly associated with increased platelet counts in carriers. We also demonstrated that common variants identified by genome-wide association studies (GWAS) for platelet count or thrombosis risk may influence the penetrance of rare variants in BTPD DGGs on their associated hemostasis disorders. Network-propagation analysis applied to an interactome of 18 410 nodes and 571 917 edges showed that GWAS variants with large effect sizes are enriched in DGGs and their first-order interactors. Finally, we illustrate the modifying effect of polygenic scores for platelet count and thrombosis risk on disease severity in participants carrying rare variants in TUBB1 or PROC and PROS1, respectively. Our findings demonstrate the power of association analyses using large population datasets in improving pathogenicity classifications of rare variants.


Subject(s)
Genome-Wide Association Study , Thrombosis , Humans , Biological Specimen Banks , Hemostasis , Hemorrhage/genetics , Rare Diseases
13.
FEBS Lett ; 597(15): 1921-1927, 2023 08.
Article in English | MEDLINE | ID: mdl-37487655

ABSTRACT

The systematic identification of tumour vulnerabilities through perturbational experiments on cancer models, including genome editing and drug screens, is playing a crucial role in combating cancer. This collective effort is known as the Cancer Dependency Map (DepMap). The 1st European Cancer Dependency Map Symposium (EuroDepMap), held in Milan last May, featured talks, a roundtable discussion, and a poster session, showcasing the latest discoveries and future challenges related to the DepMap. The symposium aimed to facilitate interactions among participants across Europe, encourage idea exchange with leading experts, and present their work and future projects. Importantly, it sparked discussions on future endeavours, such as screening more complex cancer models and accounting for tumour evolution.


Subject(s)
Neoplasms , Humans , Neoplasms/genetics , Europe
14.
Commun Biol ; 6(1): 678, 2023 06 29.
Article in English | MEDLINE | ID: mdl-37386082

ABSTRACT

Genome-wide association studies identified several disease-causing mutations in neurodegenerative diseases, including amyotrophic lateral sclerosis (ALS). However, the contribution of genetic variants to pathway disturbances and their cell type-specific variations, especially in glia, is poorly understood. We integrated ALS GWAS-linked gene networks with human astrocyte-specific multi-omics datasets to elucidate pathognomonic signatures. It predicts that KIF5A, a motor protein kinesin-1 heavy-chain isoform, previously detected only in neurons, can also potentiate disease pathways in astrocytes. Using postmortem tissue and super-resolution structured illumination microscopy in cell-based perturbation platforms, we provide evidence that KIF5A is present in astrocyte processes and its deficiency disrupts structural integrity and mitochondrial transport. We show that this may underly cytoskeletal and trafficking changes in SOD1 ALS astrocytes characterised by low KIF5A levels, which can be rescued by c-Jun N-terminal Kinase-1 (JNK1), a kinesin transport regulator. Altogether, our pipeline reveals a mechanism controlling astrocyte process integrity, a pre-requisite for synapse maintenance and suggests a targetable loss-of-function in ALS.


Subject(s)
Amyotrophic Lateral Sclerosis , Proteogenomics , Humans , Amyotrophic Lateral Sclerosis/genetics , Astrocytes , Genome-Wide Association Study , Kinesins/genetics
15.
Genome Biol ; 24(1): 110, 2023 05 09.
Article in English | MEDLINE | ID: mdl-37161576

ABSTRACT

Understanding coding mutations is important for many applications in biology and medicine but the vast mutation space makes comprehensive experimental characterisation impossible. Current predictors are often computationally intensive and difficult to scale, including recent deep learning models. We introduce Sequence UNET, a highly scalable deep learning architecture that classifies and predicts variant frequency from sequence alone using multi-scale representations from a fully convolutional compression/expansion architecture. It achieves comparable pathogenicity prediction to recent methods. We demonstrate scalability by analysing 8.3B variants in 904,134 proteins detected through large-scale proteomics. Sequence UNET runs on modest hardware with a simple Python package.


Subject(s)
Data Compression , Deep Learning , Mutation , Proteomics
16.
Nat Genet ; 55(3): 389-398, 2023 03.
Article in English | MEDLINE | ID: mdl-36823319

ABSTRACT

Interacting proteins tend to have similar functions, influencing the same organismal traits. Interaction networks can be used to expand the list of candidate trait-associated genes from genome-wide association studies. Here, we performed network-based expansion of trait-associated genes for 1,002 human traits showing that this recovers known disease genes or drug targets. The similarity of network expansion scores identifies groups of traits likely to share an underlying genetic and biological process. We identified 73 pleiotropic gene modules linked to multiple traits, enriched in genes involved in processes such as protein ubiquitination and RNA processing. In contrast to gene deletion studies, pleiotropy as defined here captures specifically multicellular-related processes. We show examples of modules linked to human diseases enriched in genes with known pathogenic variants that can be used to map targets of approved drugs for repurposing. Finally, we illustrate the use of network expansion scores to study genes at inflammatory bowel disease genome-wide association study loci, and implicate inflammatory bowel disease-relevant genes with strong functional and genetic support.


Subject(s)
Cell Biology , Cells , Disease , Genetic Association Studies , Genetic Pleiotropy , Genetic Association Studies/methods , Humans , Ubiquitination/genetics , RNA Processing, Post-Transcriptional/genetics , Cells/metabolism , Cells/pathology , Drug Repositioning/methods , Drug Repositioning/trends , Disease/genetics , Inflammatory Bowel Diseases/genetics , Inflammatory Bowel Diseases/pathology , Genome-Wide Association Study , Phenotype , Autoimmune Diseases/genetics , Autoimmune Diseases/pathology
17.
Nat Struct Mol Biol ; 30(2): 216-225, 2023 02.
Article in English | MEDLINE | ID: mdl-36690744

ABSTRACT

Cellular functions are governed by molecular machines that assemble through protein-protein interactions. Their atomic details are critical to studying their molecular mechanisms. However, fewer than 5% of hundreds of thousands of human protein interactions have been structurally characterized. Here we test the potential and limitations of recent progress in deep-learning methods using AlphaFold2 to predict structures for 65,484 human protein interactions. We show that experiments can orthogonally confirm higher-confidence models. We identify 3,137 high-confidence models, of which 1,371 have no homology to a known structure. We identify interface residues harboring disease mutations, suggesting potential mechanisms for pathogenic variants. Groups of interface phosphorylation sites show patterns of co-regulation across conditions, suggestive of coordinated tuning of multiple protein interactions as signaling responses. Finally, we provide examples of how the predicted binary complexes can be used to build larger assemblies helping to expand our understanding of human cell biology.


Subject(s)
Protein Interaction Maps , Signal Transduction , Humans , Mutation , Computational Biology/methods
18.
Mol Syst Biol ; 19(3): e10631, 2023 03 09.
Article in English | MEDLINE | ID: mdl-36688815

ABSTRACT

Genetic alterations in cancer cells trigger oncogenic transformation, a process largely mediated by the dysregulation of kinase and transcription factor (TF) activities. While the mutational profiles of thousands of tumours have been extensively characterised, the measurements of protein activities have been technically limited until recently. We compiled public data of matched genomics and (phospho)proteomics measurements for 1,110 tumours and 77 cell lines that we used to estimate activity changes in 218 kinases and 292 TFs. Co-regulation of kinase and TF activities reflects previously known regulatory relationships and allows us to dissect genetic drivers of signalling changes in cancer. We find that loss-of-function mutations are not often associated with the dysregulation of downstream targets, suggesting frequent compensatory mechanisms. Finally, we identified the activities most differentially regulated in cancer subtypes and showed how these can be linked to differences in patient survival. Our results provide broad insights into the dysregulation of protein activities in cancer and their contribution to disease severity.


Subject(s)
Neoplasms , Humans , Neoplasms/genetics , Signal Transduction , Genomics , Proteomics/methods , Gene Expression Regulation
19.
J Cell Biol ; 222(2)2023 02 06.
Article in English | MEDLINE | ID: mdl-36515990

ABSTRACT

Nuclear pore complexes (NPCs) are large proteinaceous assemblies that mediate nuclear compartmentalization. NPCs undergo large-scale structural rearrangements during mitosis in metazoans and some fungi. However, our understanding of NPC remodeling beyond mitosis remains limited. Using time-lapse fluorescence microscopy, we discovered that NPCs undergo two mechanistically separable remodeling events during budding yeast meiosis in which parts or all of the nuclear basket transiently dissociate from the NPC core during meiosis I and II, respectively. Meiosis I detachment, observed for Nup60 and Nup2, is driven by Polo kinase-mediated phosphorylation of Nup60 at its interface with the Y-complex. Subsequent reattachment of Nup60-Nup2 to the NPC core is facilitated by a lipid-binding amphipathic helix in Nup60. Preventing Nup60-Nup2 reattachment causes misorganization of the entire nuclear basket in gametes. Strikingly, meiotic nuclear basket remodeling also occurs in the distantly related fission yeast, Schizosaccharomyces pombe. Our study reveals a conserved and developmentally programmed aspect of NPC plasticity, providing key mechanistic insights into the nuclear basket organization.


Subject(s)
Nuclear Pore Complex Proteins , Nuclear Pore , Saccharomyces cerevisiae Proteins , Saccharomyces cerevisiae , Meiosis , Mitosis , Nuclear Pore Complex Proteins/genetics , Nuclear Pore Complex Proteins/chemistry , Schizosaccharomyces , Saccharomyces cerevisiae/chemistry , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae Proteins/chemistry , Saccharomyces cerevisiae Proteins/genetics
20.
Nat Struct Mol Biol ; 29(11): 1056-1067, 2022 11.
Article in English | MEDLINE | ID: mdl-36344848

ABSTRACT

Most proteins fold into 3D structures that determine how they function and orchestrate the biological processes of the cell. Recent developments in computational methods for protein structure predictions have reached the accuracy of experimentally determined models. Although this has been independently verified, the implementation of these methods across structural-biology applications remains to be tested. Here, we evaluate the use of AlphaFold2 (AF2) predictions in the study of characteristic structural elements; the impact of missense variants; function and ligand binding site predictions; modeling of interactions; and modeling of experimental structural data. For 11 proteomes, an average of 25% additional residues can be confidently modeled when compared with homology modeling, identifying structural features rarely seen in the Protein Data Bank. AF2-based predictions of protein disorder and complexes surpass dedicated tools, and AF2 models can be used across diverse applications equally well compared with experimentally determined structures, when the confidence metrics are critically considered. In summary, we find that these advances are likely to have a transformative impact in structural biology and broader life-science research.


Subject(s)
Computational Biology , Furylfuramide , Computational Biology/methods , Binding Sites , Proteins/chemistry , Databases, Protein , Protein Conformation
SELECTION OF CITATIONS
SEARCH DETAIL
...