Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 1.461
Filtrar
Mais filtros

Tipo de documento
Intervalo de ano de publicação
1.
Cell ; 187(5): 1255-1277.e27, 2024 Feb 29.
Artigo em Inglês | MEDLINE | ID: mdl-38359819

RESUMO

Despite the successes of immunotherapy in cancer treatment over recent decades, less than <10%-20% cancer cases have demonstrated durable responses from immune checkpoint blockade. To enhance the efficacy of immunotherapies, combination therapies suppressing multiple immune evasion mechanisms are increasingly contemplated. To better understand immune cell surveillance and diverse immune evasion responses in tumor tissues, we comprehensively characterized the immune landscape of more than 1,000 tumors across ten different cancers using CPTAC pan-cancer proteogenomic data. We identified seven distinct immune subtypes based on integrative learning of cell type compositions and pathway activities. We then thoroughly categorized unique genomic, epigenetic, transcriptomic, and proteomic changes associated with each subtype. Further leveraging the deep phosphoproteomic data, we studied kinase activities in different immune subtypes, which revealed potential subtype-specific therapeutic targets. Insights from this work will facilitate the development of future immunotherapy strategies and enhance precision targeting with existing agents.


Assuntos
Neoplasias , Proteogenômica , Humanos , Terapia Combinada , Genômica , Neoplasias/genética , Neoplasias/imunologia , Neoplasias/terapia , Proteômica , Evasão Tumoral
2.
Cell ; 184(21): 5482-5496.e28, 2021 10 14.
Artigo em Inglês | MEDLINE | ID: mdl-34597583

RESUMO

Determining how cells vary with their local signaling environment and organize into distinct cellular communities is critical for understanding processes as diverse as development, aging, and cancer. Here we introduce EcoTyper, a machine learning framework for large-scale identification and validation of cell states and multicellular communities from bulk, single-cell, and spatially resolved gene expression data. When applied to 12 major cell lineages across 16 types of human carcinoma, EcoTyper identified 69 transcriptionally defined cell states. Most states were specific to neoplastic tissue, ubiquitous across tumor types, and significantly prognostic. By analyzing cell-state co-occurrence patterns, we discovered ten clinically distinct multicellular communities with unexpectedly strong conservation, including three with myeloid and stromal elements linked to adverse survival, one enriched in normal tissue, and two associated with early cancer development. This study elucidates fundamental units of cellular organization in human carcinoma and provides a framework for large-scale profiling of cellular ecosystems in any tissue.


Assuntos
Neoplasias/patologia , Microambiente Tumoral , Sobrevivência Celular , Regulação Neoplásica da Expressão Gênica , Humanos , Imunoterapia , Inflamação/patologia , Ligantes , Neoplasias/genética , Fenótipo , Prognóstico , Transcrição Gênica
3.
Cell ; 177(2): 463-477.e15, 2019 04 04.
Artigo em Inglês | MEDLINE | ID: mdl-30951672

RESUMO

To develop a map of cell-cell communication mediated by extracellular RNA (exRNA), the NIH Extracellular RNA Communication Consortium created the exRNA Atlas resource (https://exrna-atlas.org). The Atlas version 4P1 hosts 5,309 exRNA-seq and exRNA qPCR profiles from 19 studies and a suite of analysis and visualization tools. To analyze variation between profiles, we apply computational deconvolution. The analysis leads to a model with six exRNA cargo types (CT1, CT2, CT3A, CT3B, CT3C, CT4), each detectable in multiple biofluids (serum, plasma, CSF, saliva, urine). Five of the cargo types associate with known vesicular and non-vesicular (lipoprotein and ribonucleoprotein) exRNA carriers. To validate utility of this model, we re-analyze an exercise response study by deconvolution to identify physiologically relevant response pathways that were not detected previously. To enable wide application of this model, as part of the exRNA Atlas resource, we provide tools for deconvolution and analysis of user-provided case-control studies.


Assuntos
Comunicação Celular/fisiologia , RNA/metabolismo , Adulto , Líquidos Corporais/química , Ácidos Nucleicos Livres/metabolismo , MicroRNA Circulante/metabolismo , Vesículas Extracelulares/metabolismo , Feminino , Humanos , Masculino , Reprodutibilidade dos Testes , Análise de Sequência de RNA/métodos , Software
4.
Cell ; 172(1-2): 358-372.e23, 2018 01 11.
Artigo em Inglês | MEDLINE | ID: mdl-29307493

RESUMO

Metabolite-protein interactions control a variety of cellular processes, thereby playing a major role in maintaining cellular homeostasis. Metabolites comprise the largest fraction of molecules in cells, but our knowledge of the metabolite-protein interactome lags behind our understanding of protein-protein or protein-DNA interactomes. Here, we present a chemoproteomic workflow for the systematic identification of metabolite-protein interactions directly in their native environment. The approach identified a network of known and novel interactions and binding sites in Escherichia coli, and we demonstrated the functional relevance of a number of newly identified interactions. Our data enabled identification of new enzyme-substrate relationships and cases of metabolite-induced remodeling of protein complexes. Our metabolite-protein interactome consists of 1,678 interactions and 7,345 putative binding sites. Our data reveal functional and structural principles of chemical communication, shed light on the prevalence and mechanisms of enzyme promiscuity, and enable extraction of quantitative parameters of metabolite binding on a proteome-wide scale.


Assuntos
Metaboloma , Proteoma/metabolismo , Proteômica/métodos , Transdução de Sinais , Software , Regulação Alostérica , Sítios de Ligação , Escherichia coli , Metabolômica/métodos , Ligação Proteica , Mapas de Interação de Proteínas , Proteoma/química , Saccharomyces cerevisiae , Análise de Sequência de Proteína/métodos
5.
Mol Cell ; 82(9): 1708-1723.e10, 2022 05 05.
Artigo em Inglês | MEDLINE | ID: mdl-35320755

RESUMO

7SK is a conserved noncoding RNA that regulates transcription by sequestering the transcription factor P-TEFb. 7SK function entails complex changes in RNA structure, but characterizing RNA dynamics in cells remains an unsolved challenge. We developed a single-molecule chemical probing strategy, DANCE-MaP (deconvolution and annotation of ribonucleic conformational ensembles), that defines per-nucleotide reactivity, direct base pairing interactions, tertiary interactions, and thermodynamic populations for each state in RNA structural ensembles from a single experiment. DANCE-MaP reveals that 7SK RNA encodes a large-scale structural switch that couples dissolution of the P-TEFb binding site to structural remodeling at distal release factor binding sites. The 7SK structural equilibrium shifts in response to cell growth and stress and can be targeted to modulate expression of P-TEFbresponsive genes. Our study reveals that RNA structural dynamics underlie 7SK function as an integrator of diverse cellular signals to control transcription and establishes the power of DANCE-MaP to define RNA dynamics in cells.


Assuntos
Fator B de Elongação Transcricional Positiva , Proteínas de Ligação a RNA , Sítios de Ligação/genética , Células HeLa , Humanos , Fator B de Elongação Transcricional Positiva/genética , RNA Nuclear Pequeno/genética , RNA não Traduzido , Proteínas de Ligação a RNA/genética
6.
Am J Hum Genet ; 111(2): 323-337, 2024 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-38306997

RESUMO

Genome-wide association studies (GWASs) have uncovered susceptibility loci associated with psychiatric disorders such as bipolar disorder (BP) and schizophrenia (SCZ). However, most of these loci are in non-coding regions of the genome, and the causal mechanisms of the link between genetic variation and disease risk is unknown. Expression quantitative trait locus (eQTL) analysis of bulk tissue is a common approach used for deciphering underlying mechanisms, although this can obscure cell-type-specific signals and thus mask trait-relevant mechanisms. Although single-cell sequencing can be prohibitively expensive in large cohorts, computationally inferred cell-type proportions and cell-type gene expression estimates have the potential to overcome these problems and advance mechanistic studies. Using bulk RNA-seq from 1,730 samples derived from whole blood in a cohort ascertained from individuals with BP and SCZ, this study estimated cell-type proportions and their relation with disease status and medication. For each cell type, we found between 2,875 and 4,629 eGenes (genes with an associated eQTL), including 1,211 that are not found on the basis of bulk expression alone. We performed a colocalization test between cell-type eQTLs and various traits and identified hundreds of associations that occur between cell-type eQTLs and GWASs but that are not detected in bulk eQTLs. Finally, we investigated the effects of lithium use on the regulation of cell-type expression loci and found examples of genes that are differentially regulated according to lithium use. Our study suggests that applying computational methods to large bulk RNA-seq datasets of non-brain tissue can identify disease-relevant, cell-type-specific biology of psychiatric disorders and psychiatric medication.


Assuntos
Estudo de Associação Genômica Ampla , Lítio , Humanos , Estudo de Associação Genômica Ampla/métodos , RNA-Seq , Locos de Características Quantitativas/genética , Fenótipo , Polimorfismo de Nucleotídeo Único , Predisposição Genética para Doença
7.
Proc Natl Acad Sci U S A ; 121(29): e2313851121, 2024 Jul 16.
Artigo em Inglês | MEDLINE | ID: mdl-38976734

RESUMO

Mass spectrometry-based omics technologies are increasingly used in perturbation studies to map drug effects to biological pathways by identifying significant molecular events. Significance is influenced by fold change and variation of each molecular parameter, but also by multiple testing corrections. While the fold change is largely determined by the biological system, the variation is determined by experimental workflows. Here, it is shown that memory effects of prior subculture can influence the variation of perturbation profiles using the two colon carcinoma cell lines SW480 and HCT116. These memory effects are largely driven by differences in growth states that persist into the perturbation experiment. In SW480 cells, memory effects combined with moderate treatment effects amplify the variation in multiple omics levels, including eicosadomics, proteomics, and phosphoproteomics. With stronger treatment effects, the memory effect was less pronounced, as demonstrated in HCT116 cells. Subculture homogeneity was controlled by real-time monitoring of cell growth. Controlled homogeneous subculture resulted in a perturbation network of 321 causal conjectures based on combined proteomic and phosphoproteomic data, compared to only 58 causal conjectures without controlling subculture homogeneity in SW480 cells. Some cellular responses and regulatory events were identified that extend the mode of action of arsenic trioxide (ATO) only when accounting for these memory effects. Controlled prior subculture led to the finding of a synergistic combination treatment of ATO with the thioredoxin reductase 1 inhibitor auranofin, which may prove useful in the management of NRF2-mediated resistance mechanisms.


Assuntos
Proteômica , Humanos , Proteômica/métodos , Linhagem Celular Tumoral , Células HCT116 , Técnicas de Cultura de Células/métodos , Neoplasias do Colo/metabolismo , Neoplasias do Colo/tratamento farmacológico , Neoplasias do Colo/patologia , Trióxido de Arsênio/farmacologia , Auranofina/farmacologia , Proliferação de Células/efeitos dos fármacos , Espectrometria de Massas/métodos
8.
Brief Bioinform ; 25(3)2024 Mar 27.
Artigo em Inglês | MEDLINE | ID: mdl-38762790

RESUMO

In this review, we provide a comprehensive overview of the different computational tools that have been published for the deconvolution of bulk DNA methylation (DNAm) data. Here, deconvolution refers to the estimation of cell-type proportions that constitute a mixed sample. The paper reviews and compares 25 deconvolution methods (supervised, unsupervised or hybrid) developed between 2012 and 2023 and compares the strengths and limitations of each approach. Moreover, in this study, we describe the impact of the platform used for the generation of methylation data (including microarrays and sequencing), the applied data pre-processing steps and the used reference dataset on the deconvolution performance. Next to reference-based methods, we also examine methods that require only partial reference datasets or require no reference set at all. In this review, we provide guidelines for the use of specific methods dependent on the DNA methylation data type and data availability.


Assuntos
Biologia Computacional , Metilação de DNA , Humanos , Biologia Computacional/métodos , DNA/genética , Algoritmos
9.
Brief Bioinform ; 25(3)2024 Mar 27.
Artigo em Inglês | MEDLINE | ID: mdl-38557675

RESUMO

Spatial transcriptomics (ST) data have emerged as a pivotal approach to comprehending the function and interplay of cells within intricate tissues. Nonetheless, analyses of ST data are restricted by the low spatial resolution and limited number of ribonucleic acid transcripts that can be detected with several popular ST techniques. In this study, we propose that both of the above issues can be significantly improved by introducing a deep graph co-embedding framework. First, we establish a self-supervised, co-graph convolution network-based deep learning model termed SpatialcoGCN, which leverages single-cell data to deconvolve the cell mixtures in spatial data. Evaluations of SpatialcoGCN on a series of simulated ST data and real ST datasets from human ductal carcinoma in situ, developing human heart and mouse brain suggest that SpatialcoGCN could outperform other state-of-the-art cell type deconvolution methods in estimating per-spot cell composition. Moreover, with competitive accuracy, SpatialcoGCN could also recover the spatial distribution of transcripts that are not detected by raw ST data. With a similar co-embedding framework, we further established a spatial information-aware ST data simulation method, SpatialcoGCN-Sim. SpatialcoGCN-Sim could generate simulated ST data with high similarity to real datasets. Together, our approaches provide efficient tools for studying the spatial organization of heterogeneous cells within complex tissues.


Assuntos
Perfilação da Expressão Gênica , RNA , Humanos , Animais , Camundongos , Simulação por Computador , Transcriptoma
10.
Brief Bioinform ; 25(4)2024 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-38982642

RESUMO

Inferring cell type proportions from bulk transcriptome data is crucial in immunology and oncology. Here, we introduce guided LDA deconvolution (GLDADec), a bulk deconvolution method that guides topics using cell type-specific marker gene names to estimate topic distributions for each sample. Through benchmarking using blood-derived datasets, we demonstrate its high estimation performance and robustness. Moreover, we apply GLDADec to heterogeneous tissue bulk data and perform comprehensive cell type analysis in a data-driven manner. We show that GLDADec outperforms existing methods in estimation performance and evaluate its biological interpretability by examining enrichment of biological processes for topics. Finally, we apply GLDADec to The Cancer Genome Atlas tumor samples, enabling subtype stratification and survival analysis based on estimated cell type proportions, thus proving its practical utility in clinical settings. This approach, utilizing marker gene names as partial prior information, can be applied to various scenarios for bulk data deconvolution. GLDADec is available as an open-source Python package at https://github.com/mizuno-group/GLDADec.


Assuntos
Software , Humanos , Perfilação da Expressão Gênica/métodos , Algoritmos , Transcriptoma , Biologia Computacional/métodos , Neoplasias/genética , Biomarcadores Tumorais/genética , Marcadores Genéticos
11.
Proc Natl Acad Sci U S A ; 120(51): e2300474120, 2023 Dec 19.
Artigo em Inglês | MEDLINE | ID: mdl-38100417

RESUMO

Seasonal influenza results in 3 to 5 million cases of severe disease and 250,000 to 500,000 deaths annually. Macrophages have been implicated in both the resolution and progression of the disease, but the drivers of these outcomes are poorly understood. We probed mouse lung transcriptomic datasets using the Digital Cell Quantifier algorithm to predict immune cell subsets that correlated with mild or severe influenza A virus (IAV) infection outcomes. We identified a unique lung macrophage population that transcriptionally resembled small serosal cavity macrophages and whose presence correlated with mild disease. Until now, the study of serosal macrophage translocation in the context of viral infections has been neglected. Here, we show that pleural macrophages (PMs) migrate from the pleural cavity to the lung after infection with IAV. We found that the depletion of PMs increased morbidity and pulmonary inflammation. There were increased proinflammatory cytokines in the pleural cavity and an influx of neutrophils within the lung. Our results show that PMs are recruited to the lung during IAV infection and contribute to recovery from influenza. This study expands our knowledge of PM plasticity and identifies a source of lung macrophages independent of monocyte recruitment and local proliferation.


Assuntos
Vírus da Influenza A , Influenza Humana , Infecções por Orthomyxoviridae , Animais , Camundongos , Humanos , Influenza Humana/genética , Pulmão , Macrófagos , Macrófagos Alveolares
12.
Proc Natl Acad Sci U S A ; 120(28): e2305236120, 2023 07 11.
Artigo em Inglês | MEDLINE | ID: mdl-37399400

RESUMO

Plasma cell-free DNA (cfDNA) is a noninvasive biomarker for cell death of all organs. Deciphering the tissue origin of cfDNA can reveal abnormal cell death because of diseases, which has great clinical potential in disease detection and monitoring. Despite the great promise, the sensitive and accurate quantification of tissue-derived cfDNA remains challenging to existing methods due to the limited characterization of tissue methylation and the reliance on unsupervised methods. To fully exploit the clinical potential of tissue-derived cfDNA, here we present one of the largest comprehensive and high-resolution methylation atlas based on 521 noncancer tissue samples spanning 29 major types of human tissues. We systematically identified fragment-level tissue-specific methylation patterns and extensively validated them in orthogonal datasets. Based on the rich tissue methylation atlas, we develop the first supervised tissue deconvolution approach, a deep-learning-powered model, cfSort, for sensitive and accurate tissue deconvolution in cfDNA. On the benchmarking data, cfSort showed superior sensitivity and accuracy compared to the existing methods. We further demonstrated the clinical utilities of cfSort with two potential applications: aiding disease diagnosis and monitoring treatment side effects. The tissue-derived cfDNA fraction estimated from cfSort reflected the clinical outcomes of the patients. In summary, the tissue methylation atlas and cfSort enhanced the performance of tissue deconvolution in cfDNA, thus facilitating cfDNA-based disease detection and longitudinal treatment monitoring.


Assuntos
Ácidos Nucleicos Livres , Aprendizado Profundo , Humanos , Ácidos Nucleicos Livres/genética , Metilação de DNA , Biomarcadores , Regiões Promotoras Genéticas , Biomarcadores Tumorais/genética
13.
J Biol Chem ; 300(6): 107317, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38677514

RESUMO

It has become increasingly evident that the structures RNAs adopt are conformationally dynamic; the various structured states that RNAs sample govern their interactions with other nucleic acids, proteins, and ligands to regulate a myriad of biological processes. Although several biophysical approaches have been developed and used to study the dynamic landscape of structured RNAs, technical limitations have limited their application to all classes of RNA due to variable size and flexibility. Recent advances combining chemical probing experiments with next-generation- and direct sequencing have emerged as an alternative approach to exploring the conformational dynamics of RNA. In this review, we provide a methodological overview of the sequencing-based techniques used to study RNA conformational dynamics. We discuss how different techniques have enabled us to better understand the propensity of RNAs from a variety of different classes to sample multiple conformational states. Finally, we present examples of the ways these techniques have reshaped how we think about RNA structure.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Conformação de Ácido Nucleico , RNA , RNA/química , RNA/metabolismo , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Nanoporos , Humanos , Análise de Sequência de RNA/métodos
14.
Brief Bioinform ; 24(1)2023 01 19.
Artigo em Inglês | MEDLINE | ID: mdl-36472568

RESUMO

Accounting for cell type compositions has been very successful at analyzing high-throughput data from heterogeneous tissues. Differential gene expression analysis at cell type level is becoming increasingly popular, yielding biomarker discovery in a finer granularity within a particular cell type. Although several computational methods have been developed to identify cell type-specific differentially expressed genes (csDEG) from RNA-seq data, a systematic evaluation is yet to be performed. Here, we thoroughly benchmark six recently published methods: CellDMC, CARseq, TOAST, LRCDE, CeDAR and TCA, together with two classical methods, csSAM and DESeq2, for a comprehensive comparison. We aim to systematically evaluate the performance of popular csDEG detection methods and provide guidance to researchers. In simulation studies, we benchmark available methods under various scenarios of baseline expression levels, sample sizes, cell type compositions, expression level alterations, technical noises and biological dispersions. Real data analyses of three large datasets on inflammatory bowel disease, lung cancer and autism provide evaluation in both the gene level and the pathway level. We find that csDEG calling is strongly affected by effect size, baseline expression level and cell type compositions. Results imply that csDEG discovery is a challenging task itself, with room to improvements on handling low signal-to-noise ratio and low expression genes.


Assuntos
Perfilação da Expressão Gênica , Software , Perfilação da Expressão Gênica/métodos , RNA-Seq , Simulação por Computador , Razão Sinal-Ruído , Análise de Sequência de RNA/métodos
15.
Brief Bioinform ; 24(1)2023 01 19.
Artigo em Inglês | MEDLINE | ID: mdl-36631398

RESUMO

Computational cell type deconvolution on bulk transcriptomics data can reveal cell type proportion heterogeneity across samples. One critical factor for accurate deconvolution is the reference signature matrix for different cell types. Compared with inferring reference signature matrices from cell lines, rapidly accumulating single-cell RNA-sequencing (scRNA-seq) data provide a richer and less biased resource. However, deriving cell type signature from scRNA-seq data is challenging due to high biological and technical noises. In this article, we introduce a novel Bayesian framework, tranSig, to improve signature matrix inference from scRNA-seq by leveraging shared cell type-specific expression patterns across different tissues and studies. Our simulations show that tranSig is robust to the number of signature genes and tissues specified in the model. Applications of tranSig to bulk RNA sequencing data from peripheral blood, bronchoalveolar lavage and aorta demonstrate its accuracy and power to characterize biological heterogeneity across groups. In summary, tranSig offers an accurate and robust approach to defining gene expression signatures of different cell types, facilitating improved in silico cell type deconvolutions.


Assuntos
Perfilação da Expressão Gênica , Análise de Célula Única , Teorema de Bayes , Transcriptoma , Análise de Sequência de RNA
16.
Brief Bioinform ; 24(3)2023 05 19.
Artigo em Inglês | MEDLINE | ID: mdl-37183449

RESUMO

Undoubtedly, single-cell RNA sequencing (scRNA-seq) has changed the research landscape by providing insights into heterogeneous, complex and rare cell populations. Given that more such data sets will become available in the near future, their accurate assessment with compatible and robust models for cell type annotation is a prerequisite. Considering this, herein, we developed scAnno (scRNA-seq data annotation), an automated annotation tool for scRNA-seq data sets primarily based on the single-cell cluster levels, using a joint deconvolution strategy and logistic regression. We explicitly constructed a reference profile for human (30 cell types and 50 human tissues) and a reference profile for mouse (26 cell types and 50 mouse tissues) to support this novel methodology (scAnno). scAnno offers a possibility to obtain genes with high expression and specificity in a given cell type as cell type-specific genes (marker genes) by combining co-expression genes with seed genes as a core. Of importance, scAnno can accurately identify cell type-specific genes based on cell type reference expression profiles without any prior information. Particularly, in the peripheral blood mononuclear cell data set, the marker genes identified by scAnno showed cell type-specific expression, and the majority of marker genes matched exactly with those included in the CellMarker database. Besides validating the flexibility and interpretability of scAnno in identifying marker genes, we also proved its superiority in cell type annotation over other cell type annotation tools (SingleR, scPred, CHETAH and scmap-cluster) through internal validation of data sets (average annotation accuracy: 99.05%) and cross-platform data sets (average annotation accuracy: 95.56%). Taken together, we established the first novel methodology that utilizes a deconvolution strategy for automated cell typing and is capable of being a significant application in broader scRNA-seq analysis. scAnno is available at https://github.com/liuhong-jia/scAnno.


Assuntos
Algoritmos , Software , Animais , Camundongos , Humanos , Perfilação da Expressão Gênica/métodos , Leucócitos Mononucleares , Análise de Célula Única/métodos , RNA/genética , Análise de Sequência de RNA/métodos
17.
Brief Bioinform ; 25(1)2023 11 22.
Artigo em Inglês | MEDLINE | ID: mdl-38180831

RESUMO

The enzyme-linked immunosorbent spot (ELISpot) assay is a powerful in vitro immunoassay that enables cost-effective quantification of antigen-specific T-cell reactivity. It is used widely in the context of cancer and infectious diseases to validate the immunogenicity of predicted epitopes. While technological advances have kept pace with the demand for increased throughput, efforts to increase scale are bottlenecked by current assay design and deconvolution methods, which have remained largely unchanged. Current methods for designing pooled ELISpot experiments offer limited flexibility of assay parameters, lack support for high-throughput scenarios and do not consider peptide identity during pool assignment. We introduce the ACE Configurator for ELISpot (ACE) to address these gaps. ACE generates optimized peptide-pool assignments from highly customizable user inputs and handles the deconvolution of positive peptides using assay readouts. In this study, we present a novel sequence-aware pooling strategy, powered by a fine-tuned ESM-2 model that groups immunologically similar peptides, reducing the number of false positives and subsequent confirmatory assays compared to existing combinatorial approaches. To validate ACE's performance on real-world datasets, we conducted a comprehensive benchmark study, contextualizing design choices with their impact on prediction quality. Our results demonstrate ACE's capacity to further increase precision of identified immunogenic peptides, directly optimizing experimental efficiency. ACE is freely available as an executable with a graphical user interface and command-line interfaces at https://github.com/pirl-unc/ace.


Assuntos
Benchmarking , Imunoadsorventes , Epitopos , Peptídeos
18.
Brief Bioinform ; 24(2)2023 03 19.
Artigo em Inglês | MEDLINE | ID: mdl-36653909

RESUMO

DNA-methylation alterations are common in cancer and display unique characteristics that make them ideal markers for tumor quantification and classification. Here we present MIMESIS, a computational framework exploiting minimal DNA-methylation signatures composed by a few dozen informative DNA-methylation sites to quantify and classify tumor signals in tissue and cell-free DNA samples. Extensive analyses of multiple independent and heterogenous datasets including >7200 samples demonstrate the capability of MIMESIS to provide precise estimations of tumor content and to enable accurate classification of tumor type and molecular subtype. To assess our framework for clinical applications, we designed a MIMESIS-informed assay incorporating the minimal signatures for breast cancer. Using both artificial samples and clinical serial cell-free DNA samples from patients with metastatic breast cancer, we show that our approach provides accurate estimations of tumor content, sensitive detection of tumor signal and the ability to capture clinically relevant molecular subtype in patients' circulation. This study provides evidence that our extremely parsimonious approach can be used to develop cost-effective and highly scalable DNA-methylation assays that could support and facilitate the implementation of precision oncology in clinical practice.


Assuntos
Neoplasias da Mama , Ácidos Nucleicos Livres , Humanos , Feminino , Ácidos Nucleicos Livres/genética , Medicina de Precisão , Metilação de DNA , Neoplasias da Mama/genética , Biomarcadores Tumorais/genética , DNA de Neoplasias/genética
19.
Proc Natl Acad Sci U S A ; 119(14): e2111786119, 2022 04 05.
Artigo em Inglês | MEDLINE | ID: mdl-35363567

RESUMO

The advent of increasingly sophisticated imaging platforms has allowed for the visualization of the murine nervous system at single-cell resolution. However, current experimental approaches have not yet produced whole-brain maps of a comprehensive set of neuronal and nonneuronal types that approaches the cellular diversity of the mammalian cortex. Here, we aim to fill in this gap in knowledge with an open-source computational pipeline, Matrix Inversion and Subset Selection (MISS), that can infer quantitatively validated distributions of diverse collections of neural cell types at 200-µm resolution using a combination of single-cell RNA sequencing (RNAseq) and in situ hybridization datasets. We rigorously demonstrate the accuracy of MISS against literature expectations. Importantly, we show that gene subset selection, a procedure by which we filter out low-information genes prior to performing deconvolution, is a critical preprocessing step that distinguishes MISS from its predecessors and facilitates the production of cell-type maps with significantly higher accuracy. We also show that MISS is generalizable by generating high-quality cell-type maps from a second independently curated single-cell RNAseq dataset. Together, our results illustrate the viability of computational approaches for determining the spatial distributions of a wide variety of cell types from genetic data alone.


Assuntos
Mapeamento Encefálico , Encéfalo , Neurônios , Animais , Encéfalo/citologia , Mapeamento Encefálico/métodos , Camundongos , Neurônios/classificação , Neurônios/metabolismo , RNA-Seq , Análise de Célula Única
20.
Proc Natl Acad Sci U S A ; 119(26): e2119101119, 2022 06 28.
Artigo em Inglês | MEDLINE | ID: mdl-35749363

RESUMO

Cryoelectron tomography of the cell nucleus using scanning transmission electron microscopy and deconvolution processing technology has highlighted a large-scale, 100- to 300-nm interphase chromosome structure, which is present throughout the nucleus. This study further documents and analyzes these chromosome structures. The paper is divided into four parts: 1) evidence (preliminary) for a unified interphase chromosome structure; 2) a proposed unified interphase chromosome architecture; 3) organization as chromosome territories (e.g., fitting the 46 human chromosomes into a 10-µm-diameter nucleus); and 4) structure unification into a polytene chromosome architecture and lampbrush chromosomes. Finally, the paper concludes with a living light microscopy cell study showing that the G1 nucleus contains very similar structures throughout. The main finding is that this chromosome structure appears to coil the 11-nm nucleosome fiber into a defined hollow structure, analogous to a Slinky helical spring [https://en.wikipedia.org/wiki/Slinky; motif used in Bowerman et al., eLife 10, e65587 (2021)]. This Slinky architecture can be used to build chromosome territories, extended to the polytene chromosome structure, as well as to the structure of lampbrush chromosomes.


Assuntos
Núcleo Celular , Cromossomos Humanos , Interfase , Núcleo Celular/genética , Cromatina/genética , Cromossomos Humanos/química , Humanos , Interfase/genética , Nucleossomos/química
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA