Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 1.768
Filter
Add more filters

Publication year range
1.
Cell ; 187(16): 4389-4407.e15, 2024 Aug 08.
Article in English | MEDLINE | ID: mdl-38917788

ABSTRACT

Fewer than 200 proteins are targeted by cancer drugs approved by the Food and Drug Administration (FDA). We integrate Clinical Proteomic Tumor Analysis Consortium (CPTAC) proteogenomics data from 1,043 patients across 10 cancer types with additional public datasets to identify potential therapeutic targets. Pan-cancer analysis of 2,863 druggable proteins reveals a wide abundance range and identifies biological factors that affect mRNA-protein correlation. Integration of proteomic data from tumors and genetic screen data from cell lines identifies protein overexpression- or hyperactivation-driven druggable dependencies, enabling accurate predictions of effective drug targets. Proteogenomic identification of synthetic lethality provides a strategy to target tumor suppressor gene loss. Combining proteogenomic analysis and MHC binding prediction prioritizes mutant KRAS peptides as promising public neoantigens. Computational identification of shared tumor-associated antigens followed by experimental confirmation nominates peptides as immunotherapy targets. These analyses, summarized at https://targets.linkedomics.org, form a comprehensive landscape of protein and peptide targets for companion diagnostics, drug repurposing, and therapy development.


Subject(s)
Neoplasms , Proteogenomics , Humans , Proteogenomics/methods , Neoplasms/genetics , Neoplasms/drug therapy , Neoplasms/therapy , Neoplasms/metabolism , Molecular Targeted Therapy , Immunotherapy/methods , Antigens, Neoplasm/metabolism , Antigens, Neoplasm/genetics , Cell Line, Tumor , Antineoplastic Agents/therapeutic use , Antineoplastic Agents/pharmacology , Peptides/metabolism , Proteomics , Proto-Oncogene Proteins p21(ras)/genetics , Proto-Oncogene Proteins p21(ras)/metabolism
2.
Cell ; 186(26): 5876-5891.e20, 2023 12 21.
Article in English | MEDLINE | ID: mdl-38134877

ABSTRACT

Harmonizing cell types across the single-cell community and assembling them into a common framework is central to building a standardized Human Cell Atlas. Here, we present CellHint, a predictive clustering tree-based tool to resolve cell-type differences in annotation resolution and technical biases across datasets. CellHint accurately quantifies cell-cell transcriptomic similarities and places cell types into a relationship graph that hierarchically defines shared and unique cell subtypes. Application to multiple immune datasets recapitulates expert-curated annotations. CellHint also reveals underexplored relationships between healthy and diseased lung cell states in eight diseases. Furthermore, we present a workflow for fast cross-dataset integration guided by harmonized cell types and cell hierarchy, which uncovers underappreciated cell types in adult human hippocampus. Finally, we apply CellHint to 12 tissues from 38 datasets, providing a deeply curated cross-tissue database with ∼3.7 million cells and various machine learning models for automatic cell annotation across human tissues.


Subject(s)
Gene Expression Profiling , Transcriptome , Humans , Databases, Factual , Single-Cell Analysis
3.
Cell ; 184(18): 4819-4837.e22, 2021 09 02.
Article in English | MEDLINE | ID: mdl-34380046

ABSTRACT

Animal bodies are composed of cell types with unique expression programs that implement their distinct locations, shapes, structures, and functions. Based on these properties, cell types assemble into specific tissues and organs. To systematically explore the link between cell-type-specific gene expression and morphology, we registered an expression atlas to a whole-body electron microscopy volume of the nereid Platynereis dumerilii. Automated segmentation of cells and nuclei identifies major cell classes and establishes a link between gene activation, chromatin topography, and nuclear size. Clustering of segmented cells according to gene expression reveals spatially coherent tissues. In the brain, genetically defined groups of neurons match ganglionic nuclei with coherent projections. Besides interneurons, we uncover sensory-neurosecretory cells in the nereid mushroom bodies, which thus qualify as sensory organs. They furthermore resemble the vertebrate telencephalon by molecular anatomy. We provide an integrated browser as a Fiji plugin for remote exploration of all available multimodal datasets.


Subject(s)
Cell Shape , Gene Expression Regulation , Polychaeta/cytology , Polychaeta/genetics , Single-Cell Analysis , Animals , Cell Nucleus/metabolism , Ganglia, Invertebrate/metabolism , Gene Expression Profiling , Multigene Family , Multimodal Imaging , Mushroom Bodies/metabolism , Polychaeta/ultrastructure
4.
Cell ; 181(2): 236-249, 2020 04 16.
Article in English | MEDLINE | ID: mdl-32302568

ABSTRACT

Crucial transitions in cancer-including tumor initiation, local expansion, metastasis, and therapeutic resistance-involve complex interactions between cells within the dynamic tumor ecosystem. Transformative single-cell genomics technologies and spatial multiplex in situ methods now provide an opportunity to interrogate this complexity at unprecedented resolution. The Human Tumor Atlas Network (HTAN), part of the National Cancer Institute (NCI) Cancer Moonshot Initiative, will establish a clinical, experimental, computational, and organizational framework to generate informative and accessible three-dimensional atlases of cancer transitions for a diverse set of tumor types. This effort complements both ongoing efforts to map healthy organs and previous large-scale cancer genomics approaches focused on bulk sequencing at a single point in time. Generating single-cell, multiparametric, longitudinal atlases and integrating them with clinical outcomes should help identify novel predictive biomarkers and features as well as therapeutically relevant cell types, cell states, and cellular interactions across transitions. The resulting tumor atlases should have a profound impact on our understanding of cancer biology and have the potential to improve cancer detection, prevention, and therapeutic discovery for better precision-medicine treatments of cancer patients and those at risk for cancer.


Subject(s)
Cell Transformation, Neoplastic/metabolism , Neoplasms/metabolism , Tumor Microenvironment/physiology , Atlases as Topic , Cell Transformation, Neoplastic/pathology , Genomics/methods , Humans , Precision Medicine/methods , Single-Cell Analysis/methods
5.
Cell ; 177(7): 1873-1887.e17, 2019 06 13.
Article in English | MEDLINE | ID: mdl-31178122

ABSTRACT

Defining cell types requires integrating diverse single-cell measurements from multiple experiments and biological contexts. To flexibly model single-cell datasets, we developed LIGER, an algorithm that delineates shared and dataset-specific features of cell identity. We applied it to four diverse and challenging analyses of human and mouse brain cells. First, we defined region-specific and sexually dimorphic gene expression in the mouse bed nucleus of the stria terminalis. Second, we analyzed expression in the human substantia nigra, comparing cell states in specific donors and relating cell types to those in the mouse. Third, we integrated in situ and single-cell expression data to spatially locate fine subtypes of cells present in the mouse frontal cortex. Finally, we jointly defined mouse cortical cell types using single-cell RNA-seq and DNA methylation profiles, revealing putative mechanisms of cell-type-specific epigenomic regulation. Integrative analyses using LIGER promise to accelerate investigations of cell-type definition, gene regulation, and disease states.


Subject(s)
DNA Methylation , Gene Expression Regulation , Septal Nuclei , Sequence Analysis, RNA , Single-Cell Analysis , Substantia Nigra , Adolescent , Adult , Aged , Animals , Female , Humans , Male , Mice , Middle Aged , Septal Nuclei/cytology , Septal Nuclei/metabolism , Substantia Nigra/cytology , Substantia Nigra/metabolism
6.
Trends Biochem Sci ; 49(7): 559-563, 2024 Jul.
Article in English | MEDLINE | ID: mdl-38670884

ABSTRACT

In January 2024, a targeted conference, 'CellVis2', was held at Scripps Research in La Jolla, USA, the second in a series designed to explore the promise, practices, roadblocks, and prospects of creating, visualizing, sharing, and communicating physical representations of entire biological cells at scales down to the atom.

7.
Proc Natl Acad Sci U S A ; 121(37): e2316256121, 2024 Sep 10.
Article in English | MEDLINE | ID: mdl-39226366

ABSTRACT

Trajectory inference methods are essential for analyzing the developmental paths of cells in single-cell sequencing datasets. It provides insights into cellular differentiation, transitions, and lineage hierarchies, helping unravel the dynamic processes underlying development and disease progression. However, many existing tools lack a coherent statistical model and reliable uncertainty quantification, limiting their utility and robustness. In this paper, we introduce VITAE (Variational Inference for Trajectory by AutoEncoder), a statistical approach that integrates a latent hierarchical mixture model with variational autoencoders to infer trajectories. The statistical hierarchical model enhances the interpretability of our framework, while the posterior approximations generated by our variational autoencoder ensure computational efficiency and provide uncertainty quantification of cell projections along trajectories. Specifically, VITAE enables simultaneous trajectory inference and data integration, improving the accuracy of learning a joint trajectory structure in the presence of biological and technical heterogeneity across datasets. We show that VITAE outperforms other state-of-the-art trajectory inference methods on both real and synthetic data under various trajectory topologies. Furthermore, we apply VITAE to jointly analyze three distinct single-cell RNA sequencing datasets of the mouse neocortex, unveiling comprehensive developmental lineages of projection neurons. VITAE effectively reduces batch effects within and across datasets and uncovers finer structures that might be overlooked in individual datasets. Additionally, we showcase VITAE's efficacy in integrative analyses of multiomic datasets with continuous cell population structures.


Subject(s)
Deep Learning , Genomics , Single-Cell Analysis , Single-Cell Analysis/methods , Animals , Mice , Genomics/methods , Sequence Analysis, RNA/methods , Humans
8.
Brief Bioinform ; 25(5)2024 Jul 25.
Article in English | MEDLINE | ID: mdl-39163205

ABSTRACT

Network inference or reconstruction algorithms play an integral role in successfully analyzing and identifying causal relationships between omics hits for detecting dysregulated and altered signaling components in various contexts, encompassing disease states and drug perturbations. However, accurate representation of signaling networks and identification of context-specific interactions within sparse omics datasets in complex interactomes pose significant challenges in integrative approaches. To address these challenges, we present pyPARAGON (PAgeRAnk-flux on Graphlet-guided network for multi-Omic data integratioN), a novel tool that combines network propagation with graphlets. pyPARAGON enhances accuracy and minimizes the inclusion of nonspecific interactions in signaling networks by utilizing network rather than relying on pairwise connections among proteins. Through comprehensive evaluations on benchmark signaling pathways, we demonstrate that pyPARAGON outperforms state-of-the-art approaches in node propagation and edge inference. Furthermore, pyPARAGON exhibits promising performance in discovering cancer driver networks. Notably, we demonstrate its utility in network-based stratification of patient tumors by integrating phosphoproteomic data from 105 breast cancer tumors with the interactome and demonstrating tumor-specific signaling pathways. Overall, pyPARAGON is a novel tool for analyzing and integrating multi-omic data in the context of signaling networks. pyPARAGON is available at https://github.com/netlab-ku/pyPARAGON.


Subject(s)
Algorithms , Signal Transduction , Humans , Computational Biology/methods , Breast Neoplasms/genetics , Breast Neoplasms/metabolism , Software , Protein Interaction Maps , Gene Regulatory Networks , Proteomics/methods , Female
9.
Brief Bioinform ; 25(2)2024 Jan 22.
Article in English | MEDLINE | ID: mdl-38261342

ABSTRACT

Accurate identification of cell cycle phases in single-cell RNA-sequencing (scRNA-seq) data is crucial for biomedical research. Many methods have been developed to tackle this challenge, employing diverse approaches to predict cell cycle phases. In this review article, we delve into the standard processes in identifying cell cycle phases within scRNA-seq data and present several representative methods for comparison. To rigorously assess the accuracy of these methods, we propose an error function and employ multiple benchmarking datasets encompassing human and mouse data. Our evaluation results reveal a key finding: the fit between the reference data and the dataset being analyzed profoundly impacts the effectiveness of cell cycle phase identification methods. Therefore, researchers must carefully consider the compatibility between the reference data and their dataset to achieve optimal results. Furthermore, we explore the potential benefits of incorporating benchmarking data with multiple known cell cycle phases into the analysis. Merging such data with the target dataset shows promise in enhancing prediction accuracy. By shedding light on the accuracy and performance of cell cycle phase prediction methods across diverse datasets, this review aims to motivate and guide future methodological advancements. Our findings offer valuable insights for researchers seeking to improve their understanding of cellular dynamics through scRNA-seq analysis, ultimately fostering the development of more robust and widely applicable cell cycle identification methods.


Subject(s)
Benchmarking , Biomedical Research , Humans , Animals , Mice , Cell Cycle , Research Personnel
10.
Brief Bioinform ; 25(4)2024 May 23.
Article in English | MEDLINE | ID: mdl-38985929

ABSTRACT

Recent advances in sequencing, mass spectrometry, and cytometry technologies have enabled researchers to collect multiple 'omics data types from a single sample. These large datasets have led to a growing consensus that a holistic approach is needed to identify new candidate biomarkers and unveil mechanisms underlying disease etiology, a key to precision medicine. While many reviews and benchmarks have been conducted on unsupervised approaches, their supervised counterparts have received less attention in the literature and no gold standard has emerged yet. In this work, we present a thorough comparison of a selection of six methods, representative of the main families of intermediate integrative approaches (matrix factorization, multiple kernel methods, ensemble learning, and graph-based methods). As non-integrative control, random forest was performed on concatenated and separated data types. Methods were evaluated for classification performance on both simulated and real-world datasets, the latter being carefully selected to cover different medical applications (infectious diseases, oncology, and vaccines) and data modalities. A total of 15 simulation scenarios were designed from the real-world datasets to explore a large and realistic parameter space (e.g. sample size, dimensionality, class imbalance, effect size). On real data, the method comparison showed that integrative approaches performed better or equally well than their non-integrative counterpart. By contrast, DIABLO and the four random forest alternatives outperform the others across the majority of simulation scenarios. The strengths and limitations of these methods are discussed in detail as well as guidelines for future applications.


Subject(s)
Computational Biology , Humans , Computational Biology/methods , Algorithms , Genomics/methods , Genomics/statistics & numerical data , Multiomics
11.
Brief Bioinform ; 25(5)2024 Jul 25.
Article in English | MEDLINE | ID: mdl-39133097

ABSTRACT

Constructing gene regulatory networks is a widely adopted approach for investigating gene regulation, offering diverse applications in biology and medicine. A great deal of research focuses on using time series data or single-cell RNA-sequencing data to infer gene regulatory networks. However, such gene expression data lack either cellular or temporal information. Fortunately, the advent of time-lapse confocal laser microscopy enables biologists to obtain tree-shaped gene expression data of Caenorhabditis elegans, achieving both cellular and temporal resolution. Although such tree-shaped data provide abundant knowledge, they pose challenges like non-pairwise time series, laying the inaccuracy of downstream analysis. To address this issue, a comprehensive framework for data integration and a novel Bayesian approach based on Boolean network with time delay are proposed. The pre-screening process and Markov Chain Monte Carlo algorithm are applied to obtain the parameter estimates. Simulation studies show that our method outperforms existing Boolean network inference algorithms. Leveraging the proposed approach, gene regulatory networks for five subtrees are reconstructed based on the real tree-shaped datatsets of Caenorhabditis elegans, where some gene regulatory relationships confirmed in previous genetic studies are recovered. Also, heterogeneity of regulatory relationships in different cell lineage subtrees is detected. Furthermore, the exploration of potential gene regulatory relationships that bear importance in human diseases is undertaken. All source code is available at the GitHub repository https://github.com/edawu11/BBTD.git.


Subject(s)
Algorithms , Caenorhabditis elegans , Gene Regulatory Networks , Caenorhabditis elegans/genetics , Animals , Bayes Theorem , Computational Biology/methods , Markov Chains , Gene Expression Profiling/methods
12.
Brief Bioinform ; 25(5)2024 Jul 25.
Article in English | MEDLINE | ID: mdl-39082647

ABSTRACT

Deciphering the intricate relationships between transcription factors (TFs), enhancers, and genes through the inference of enhancer-driven gene regulatory networks (eGRNs) is crucial in understanding gene regulatory programs in a complex biological system. This study introduces STREAM, a novel method that leverages a Steiner forest problem model, a hybrid biclustering pipeline, and submodular optimization to infer eGRNs from jointly profiled single-cell transcriptome and chromatin accessibility data. Compared to existing methods, STREAM demonstrates enhanced performance in terms of TF recovery, TF-enhancer linkage prediction, and enhancer-gene relation discovery. Application of STREAM to an Alzheimer's disease dataset and a diffuse small lymphocytic lymphoma dataset reveals its ability to identify TF-enhancer-gene relations associated with pseudotime, as well as key TF-enhancer-gene relations and TF cooperation underlying tumor cells.


Subject(s)
Enhancer Elements, Genetic , Gene Regulatory Networks , RNA-Seq , Single-Cell Analysis , Single-Cell Analysis/methods , Humans , Transcription Factors/metabolism , Transcription Factors/genetics , Chromatin Immunoprecipitation Sequencing , Algorithms , Computational Biology/methods , Alzheimer Disease/genetics , Alzheimer Disease/metabolism , Single-Cell Gene Expression Analysis
13.
Brief Bioinform ; 25(3)2024 Mar 27.
Article in English | MEDLINE | ID: mdl-38678587

ABSTRACT

Deep learning-based multi-omics data integration methods have the capability to reveal the mechanisms of cancer development, discover cancer biomarkers and identify pathogenic targets. However, current methods ignore the potential correlations between samples in integrating multi-omics data. In addition, providing accurate biological explanations still poses significant challenges due to the complexity of deep learning models. Therefore, there is an urgent need for a deep learning-based multi-omics integration method to explore the potential correlations between samples and provide model interpretability. Herein, we propose a novel interpretable multi-omics data integration method (DeepKEGG) for cancer recurrence prediction and biomarker discovery. In DeepKEGG, a biological hierarchical module is designed for local connections of neuron nodes and model interpretability based on the biological relationship between genes/miRNAs and pathways. In addition, a pathway self-attention module is constructed to explore the correlation between different samples and generate the potential pathway feature representation for enhancing the prediction performance of the model. Lastly, an attribution-based feature importance calculation method is utilized to discover biomarkers related to cancer recurrence and provide a biological interpretation of the model. Experimental results demonstrate that DeepKEGG outperforms other state-of-the-art methods in 5-fold cross validation. Furthermore, case studies also indicate that DeepKEGG serves as an effective tool for biomarker discovery. The code is available at https://github.com/lanbiolab/DeepKEGG.


Subject(s)
Biomarkers, Tumor , Deep Learning , Neoplasm Recurrence, Local , Humans , Biomarkers, Tumor/metabolism , Biomarkers, Tumor/genetics , Neoplasm Recurrence, Local/metabolism , Neoplasm Recurrence, Local/genetics , Computational Biology/methods , Neoplasms/genetics , Neoplasms/metabolism , Neoplasms/pathology , Genomics/methods , Multiomics
14.
Brief Bioinform ; 25(2)2024 Jan 22.
Article in English | MEDLINE | ID: mdl-38426322

ABSTRACT

Cancer is a complex and high-mortality disease regulated by multiple factors. Accurate cancer subtyping is crucial for formulating personalized treatment plans and improving patient survival rates. The underlying mechanisms that drive cancer progression can be comprehensively understood by analyzing multi-omics data. However, the high noise levels in omics data often pose challenges in capturing consistent representations and adequately integrating their information. This paper proposed a novel variational autoencoder-based deep learning model, named Deeply Integrating Latent Consistent Representations (DILCR). Firstly, multiple independent variational autoencoders and contrastive loss functions were designed to separate noise from omics data and capture latent consistent representations. Subsequently, an Attention Deep Integration Network was proposed to integrate consistent representations across different omics levels effectively. Additionally, we introduced the Improved Deep Embedded Clustering algorithm to make integrated variable clustering friendly. The effectiveness of DILCR was evaluated using 10 typical cancer datasets from The Cancer Genome Atlas and compared with 14 state-of-the-art integration methods. The results demonstrated that DILCR effectively captures the consistent representations in omics data and outperforms other integration methods in cancer subtyping. In the Kidney Renal Clear Cell Carcinoma case study, cancer subtypes were identified by DILCR with significant biological significance and interpretability.


Subject(s)
Carcinoma, Renal Cell , Kidney Neoplasms , Neoplasms , Humans , Multiomics , Neoplasms/genetics , Carcinoma, Renal Cell/genetics , Algorithms , Cluster Analysis , Kidney Neoplasms/genetics
15.
Brief Bioinform ; 25(4)2024 May 23.
Article in English | MEDLINE | ID: mdl-38783706

ABSTRACT

RNA Polymerase II (Pol II) transcriptional elongation pausing is an integral part of the dynamic regulation of gene transcription in the genome of metazoans. It plays a pivotal role in many vital biological processes and disease progression. However, experimentally measuring genome-wide Pol II pausing is technically challenging and the precise governing mechanism underlying this process is not fully understood. Here, we develop RP3 (RNA Polymerase II Pausing Prediction), a network regularized logistic regression machine learning method, to predict Pol II pausing events by integrating genome sequence, histone modification, gene expression, chromatin accessibility, and protein-protein interaction data. RP3 can accurately predict Pol II pausing in diverse cellular contexts and unveil the transcription factors that are associated with the Pol II pausing machinery. Furthermore, we utilize a forward feature selection framework to systematically identify the combination of histone modification signals associated with Pol II pausing. RP3 is freely available at https://github.com/AMSSwanglab/RP3.


Subject(s)
Histone Code , RNA Polymerase II , RNA Polymerase II/metabolism , Humans , Transcription Elongation, Genetic , Chromatin/metabolism , Chromatin/genetics , Histones/metabolism , Machine Learning , Animals
16.
Trends Immunol ; 44(5): 329-332, 2023 05.
Article in English | MEDLINE | ID: mdl-36997459

ABSTRACT

Profiling immune responses across several dimensions, including time, patients, molecular features, and tissue sites, can deepen our understanding of immunity as an integrated system. These studies require new analytical approaches to realize their full potential. We highlight recent applications of tensor methods and discuss several future opportunities.


Subject(s)
Communicable Diseases , Immunity , Humans
17.
EMBO Rep ; 25(1): 254-285, 2024 Jan.
Article in English | MEDLINE | ID: mdl-38177910

ABSTRACT

Midbrain dopaminergic neurons (mDANs) control voluntary movement, cognition, and reward behavior under physiological conditions and are implicated in human diseases such as Parkinson's disease (PD). Many transcription factors (TFs) controlling human mDAN differentiation during development have been described, but much of the regulatory landscape remains undefined. Using a tyrosine hydroxylase (TH) human iPSC reporter line, we here generate time series transcriptomic and epigenomic profiles of purified mDANs during differentiation. Integrative analysis predicts novel regulators of mDAN differentiation and super-enhancers are used to identify key TFs. We find LBX1, NHLH1 and NR2F1/2 to promote mDAN differentiation and show that overexpression of either LBX1 or NHLH1 can also improve mDAN specification. A more detailed investigation of TF targets reveals that NHLH1 promotes the induction of neuronal miR-124, LBX1 regulates cholesterol biosynthesis, and NR2F1/2 controls neuronal activity.


Subject(s)
Dopaminergic Neurons , Induced Pluripotent Stem Cells , Humans , Dopaminergic Neurons/metabolism , Multiomics , Mesencephalon , Transcription Factors/genetics , Transcription Factors/metabolism , Induced Pluripotent Stem Cells/metabolism , Cell Differentiation/genetics , Basic Helix-Loop-Helix Transcription Factors/genetics
18.
Mol Cell Proteomics ; 23(8): 100811, 2024 Aug.
Article in English | MEDLINE | ID: mdl-38996918

ABSTRACT

Highly specialized cells are fundamental for the proper functioning of complex organs. Variations in cell-type-specific gene expression and protein composition have been linked to a variety of diseases. Investigation of the distinctive molecular makeup of these cells within tissues is therefore critical in biomedical research. Although several technologies have emerged as valuable tools to address this cellular heterogeneity, most workflows lack sufficient in situ resolution and are associated with high costs and extremely long analysis times. Here, we present a combination of experimental and computational approaches that allows a more comprehensive investigation of molecular heterogeneity within tissues than by either shotgun LC-MS/MS or MALDI imaging alone. We applied our pipeline to the mouse brain, which contains a wide variety of cell types that not only perform unique functions but also exhibit varying sensitivities to insults. We explored the distinct neuronal populations within the hippocampus, a brain region crucial for learning and memory that is involved in various neurological disorders. As an example, we identified the groups of proteins distinguishing the neuronal populations of the dentate gyrus (DG) and the cornu ammonis (CA) in the same brain section. Most of the annotated proteins matched the regional enrichment of their transcripts, thereby validating the method. As the method is highly reproducible, the identification of individual masses through the combination of MALDI-IMS and LC-MS/MS methods can be used for the much faster and more precise interpretation of MALDI-IMS measurements only. This greatly speeds up spatial proteomic analyses and allows the detection of local protein variations within the same population of cells. The method's general applicability has the potential to be used to investigate different biological conditions and tissues and a much higher throughput than other techniques making it a promising approach for clinical routine applications.


Subject(s)
Proteomics , Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization , Tandem Mass Spectrometry , Animals , Proteomics/methods , Chromatography, Liquid/methods , Tandem Mass Spectrometry/methods , Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization/methods , Mice , Mice, Inbred C57BL , Hippocampus/metabolism , Male , Neurons/metabolism , Brain/metabolism , Dentate Gyrus/metabolism , Liquid Chromatography-Mass Spectrometry
19.
Trends Genet ; 38(2): 128-139, 2022 02.
Article in English | MEDLINE | ID: mdl-34561102

ABSTRACT

A wealth of single-cell protocols makes it possible to characterize different molecular layers at unprecedented resolution. Integrating the resulting multimodal single-cell data to find cell-to-cell correspondences remains a challenge. We argue that data integration needs to happen at a meaningful biological level of abstraction and that it is necessary to consider the inherent discrepancies between modalities to strike a balance between biological discovery and noise removal. A survey of current methods reveals that a distinction between technical and biological origins of presumed unwanted variation between datasets is not yet commonly considered. The increasing availability of paired multimodal data will aid the development of improved methods by providing a ground truth on cell-to-cell matches.

20.
Am J Hum Genet ; 109(6): 1153-1174, 2022 06 02.
Article in English | MEDLINE | ID: mdl-35659930

ABSTRACT

BRCA1 is a high-risk susceptibility gene for breast and ovarian cancer. Pathogenic protein-truncating variants are scattered across the open reading frame, but all known missense substitutions that are pathogenic because of missense dysfunction are located in either the amino-terminal RING domain or the carboxy-terminal BRCT domain. Heterodimerization of the BRCA1 and BARD1 RING domains is a molecularly defined obligate activity. Hence, we tested every BRCA1 RING domain missense substitution that can be created by a single nucleotide change for heterodimerization with BARD1 in a mammalian two-hybrid assay. Downstream of the laboratory assay, we addressed three additional challenges: assay calibration, validation thereof, and integration of the calibrated results with other available data, such as computational evidence and patient/population observational data to achieve clinically applicable classification. Overall, we found that 15%-20% of BRCA1 RING domain missense substitutions are pathogenic. Using a Bayesian point system for data integration and variant classification, we achieved clinical classification of 89% of observed missense substitutions. Moreover, among missense substitutions not present in the human observational data used here, we find an additional 45 with concordant computational and functional assay evidence in favor of pathogenicity plus 223 with concordant evidence in favor of benignity; these are particularly likely to be classified as likely pathogenic and likely benign, respectively, once human observational data become available.


Subject(s)
Breast Neoplasms , Ovarian Neoplasms , Animals , BRCA1 Protein/genetics , Bayes Theorem , Breast Neoplasms/genetics , Female , Humans , Mammals , Mutation, Missense/genetics , Ovarian Neoplasms/genetics , Protein Domains
SELECTION OF CITATIONS
SEARCH DETAIL