Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 32
Filtrar
1.
Cell ; 186(3): 591-606.e23, 2023 02 02.
Artículo en Inglés | MEDLINE | ID: mdl-36669483

RESUMEN

Dysregulation of the immune system is a cardinal feature of opioid addiction. Here, we characterize the landscape of peripheral immune cells from patients with opioid use disorder and from healthy controls. Opioid-associated blood exhibited an abnormal distribution of immune cells characterized by a significant expansion of fragile-like regulatory T cells (Tregs), which was positively correlated with the withdrawal score. Analogously, opioid-treated mice also showed enhanced Treg-derived interferon-γ (IFN-γ) expression. IFN-γ signaling reshaped synaptic morphology in nucleus accumbens (NAc) neurons, modulating subsequent withdrawal symptoms. We demonstrate that opioids increase the expression of neuron-derived C-C motif chemokine ligand 2 (Ccl2) and disrupted blood-brain barrier (BBB) integrity through the downregulation of astrocyte-derived fatty-acid-binding protein 7 (Fabp7), which both triggered peripheral Treg infiltration into NAc. Our study demonstrates that opioids drive the expansion of fragile-like Tregs and favor peripheral Treg diapedesis across the BBB, which leads to IFN-γ-mediated synaptic instability and subsequent withdrawal symptoms.


Asunto(s)
Interferón gamma , Trastornos Relacionados con Opioides , Síndrome de Abstinencia a Sustancias , Linfocitos T Reguladores , Animales , Ratones , Analgésicos Opioides/administración & dosificación , Interferón gamma/metabolismo , Trastornos Relacionados con Opioides/metabolismo , Trastornos Relacionados con Opioides/patología
2.
Brief Bioinform ; 24(4)2023 07 20.
Artículo en Inglés | MEDLINE | ID: mdl-37204192

RESUMEN

Accurately predicting the antigen-binding specificity of adaptive immune receptors (AIRs), such as T-cell receptors (TCRs) and B-cell receptors (BCRs), is essential for discovering new immune therapies. However, the diversity of AIR chain sequences limits the accuracy of current prediction methods. This study introduces SC-AIR-BERT, a pre-trained model that learns comprehensive sequence representations of paired AIR chains to improve binding specificity prediction. SC-AIR-BERT first learns the 'language' of AIR sequences through self-supervised pre-training on a large cohort of paired AIR chains from multiple single-cell resources. The model is then fine-tuned with a multilayer perceptron head for binding specificity prediction, employing the K-mer strategy to enhance sequence representation learning. Extensive experiments demonstrate the superior AUC performance of SC-AIR-BERT compared with current methods for TCR- and BCR-binding specificity prediction.


Asunto(s)
Receptores de Antígenos de Linfocitos B , Receptores de Antígenos de Linfocitos T , Humanos , Receptores de Antígenos de Linfocitos T/genética , Receptores de Antígenos de Linfocitos B/genética , Redes Neurales de la Computación , Especificidad de Anticuerpos
3.
Brief Bioinform ; 24(3)2023 05 19.
Artículo en Inglés | MEDLINE | ID: mdl-36946415

RESUMEN

Colorectal cancer (CRC) is one of the most common gastrointestinal malignancies. There are few recurrence risk signatures for CRC patients. Single-cell RNA-sequencing (scRNA-seq) provides a high-resolution platform for prognostic signature detection. However, scRNA-seq is not practical in large cohorts due to its high cost and most single-cell experiments lack clinical phenotype information. Few studies have been reported to use external bulk transcriptome with survival time to guide the detection of key cell subtypes in scRNA-seq data. We proposed scRankXMBD, a computational framework to prioritize prognostic-associated cell subpopulations based on within-cell relative expression orderings of gene pairs from single-cell transcriptomes. scRankXMBD achieves higher precision and concordance compared with five existing methods. Moreover, we developed single-cell gene pair signatures to predict recurrence risk for patients individually. Our work facilitates the application of the rank-based method in scRNA-seq data for prognostic biomarker discovery and precision oncology. scRankXMBD is available at https://github.com/xmuyulab/scRank-XMBD. (XMBD:Xiamen Big Data, a biomedical open software initiative in the National Institute for Data Science in Health and Medicine, Xiamen University, China.).


Asunto(s)
Neoplasias Colorrectales , Transcriptoma , Humanos , Perfilación de la Expresión Génica/métodos , Pronóstico , Medicina de Precisión , Programas Informáticos , Neoplasias Colorrectales/genética , Análisis de Secuencia de ARN
4.
Brief Bioinform ; 23(3)2022 05 13.
Artículo en Inglés | MEDLINE | ID: mdl-35368072

RESUMEN

Liquid chromatography-mass spectrometry-based quantitative proteomics can measure the expression of thousands of proteins from biological samples and has been increasingly applied in cancer research. Identifying differentially expressed proteins (DEPs) between tumors and normal controls is commonly used to investigate carcinogenesis mechanisms. While differential expression analysis (DEA) at an individual level is desired to identify patient-specific molecular defects for better patient stratification, most statistical DEP analysis methods only identify deregulated proteins at the population level. To date, robust individualized DEA algorithms have been proposed for ribonucleic acid data, but their performance on proteomics data is underexplored. Herein, we performed a systematic evaluation on five individualized DEA algorithms for proteins on cancer proteomic datasets from seven cancer types. Results show that the within-sample relative expression orderings (REOs) of protein pairs in normal tissues were highly stable, providing the basis for individualized DEA for proteins using REOs. Moreover, individualized DEA algorithms achieve higher precision in detecting sample-specific deregulated proteins than population-level methods. To facilitate the utilization of individualized DEA algorithms in proteomics for prognostic biomarker discovery and personalized medicine, we provide Individualized DEP Analysis IDEPAXMBD (XMBD: Xiamen Big Data, a biomedical open software initiative in the National Institute for Data Science in Health and Medicine, Xiamen University, China.) (https://github.com/xmuyulab/IDEPA-XMBD), which is a user-friendly and open-source Python toolkit that integrates individualized DEA algorithms for DEP-associated deregulation pattern recognition.


Asunto(s)
Neoplasias , Proteoma , Humanos , Espectrometría de Masas/métodos , Neoplasias/genética , Proteoma/análisis , Proteómica/métodos , Programas Informáticos
5.
BMC Bioinformatics ; 24(1): 387, 2023 Oct 12.
Artículo en Inglés | MEDLINE | ID: mdl-37821827

RESUMEN

BACKGROUND: Metagenomic sequencing is an unbiased approach that can potentially detect all the known and unidentified strains in pathogen detection. Recently, nanopore sequencing has been emerging as a highly potential tool for rapid pathogen detection due to its fast turnaround time. However, identifying pathogen within species is nontrivial for nanopore sequencing data due to the high sequencing error rate. RESULTS: We developed the core gene alleles metagenome strain identification (cgMSI) tool, which uses a two-stage maximum a posteriori probability estimation method to detect pathogens at strain level from nanopore metagenomic sequencing data at low computational cost. The cgMSI tool can accurately identify strains and estimate relative abundance at 1× coverage. CONCLUSIONS: We developed cgMSI for nanopore metagenomic pathogen detection within species. cgMSI is available at https://github.com/ZHU-XU-xmu/cgMSI .


Asunto(s)
Secuenciación de Nanoporos , Nanoporos , Metagenoma , Alelos , Metagenómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos
6.
Brief Bioinform ; 22(5)2021 09 02.
Artículo en Inglés | MEDLINE | ID: mdl-33822895

RESUMEN

Metagenomics data provide rich information for the detection of foodborne pathogens from food and environmental samples that are mixed with complex background bacteria strains. While pathogen detection from metagenomic sequencing data has become an activity of increasing interest, shotgun sequencing of uncultured food samples typically produces data that contain reads from many different organisms, making accurate strain typing a challenging task. Particularly, as many pathogens may contain a common set of genes that are highly similar to those from normal bacteria in food samples, traditional strain-level abundance profiling approaches do not perform well at detecting pathogens of very low abundance levels. To overcome this limitation, we propose an abundance correction method based on species-specific genomic regions to achieve high sensitivity and high specificity in target pathogen detection at low abundance.


Asunto(s)
Bacterias/genética , Bacterias/patogenicidad , Infecciones Bacterianas/diagnóstico , Enfermedades Transmitidas por los Alimentos/microbiología , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Metagenoma , Metagenómica/métodos , Secuenciación Completa del Genoma/métodos , Infecciones Bacterianas/microbiología , Exactitud de los Datos , Genoma Bacteriano , Humanos , Sensibilidad y Especificidad , Especificidad de la Especie
7.
Brief Bioinform ; 22(3)2021 05 20.
Artículo en Inglés | MEDLINE | ID: mdl-34020539

RESUMEN

With the development of single-cell RNA sequencing (scRNA-seq) technology, it has become possible to perform large-scale transcript profiling for tens of thousands of cells in a single experiment. Many analysis pipelines have been developed for data generated from different high-throughput scRNA-seq platforms, bringing a new challenge to users to choose a proper workflow that is efficient, robust and reliable for a specific sequencing platform. Moreover, as the amount of public scRNA-seq data has increased rapidly, integrated analysis of scRNA-seq data from different sources has become increasingly popular. However, it remains unclear whether such integrated analysis would be biassed if the data were processed by different upstream pipelines. In this study, we encapsulated seven existing high-throughput scRNA-seq data processing pipelines with Nextflow, a general integrative workflow management framework, and evaluated their performance in terms of running time, computational resource consumption and data analysis consistency using eight public datasets generated from five different high-throughput scRNA-seq platforms. Our work provides a useful guideline for the selection of scRNA-seq data processing pipelines based on their performance on different real datasets. In addition, these guidelines can serve as a performance evaluation framework for future developments in high-throughput scRNA-seq data processing.


Asunto(s)
Algoritmos , Bases de Datos de Ácidos Nucleicos , RNA-Seq , ARN , Análisis de la Célula Individual , Células 3T3 , Animales , Células HEK293 , Humanos , Ratones , ARN/biosíntesis , ARN/genética
8.
Bioinformatics ; 37(2): 265-267, 2021 Apr 19.
Artículo en Inglés | MEDLINE | ID: mdl-33416868

RESUMEN

SUMMARY: Currently, various software tools are used to support two mainstream workflows for data-independent acquisition (DIA) mass spectrometry (MS) data processing, namely, spectrum-centric scoring (SCS) and peptide-centric scoring (PCS). However, a fully automatic, easily reproducible and freely accessible pipeline that simultaneously integrates SCS and PCS strategies and supports both library-free and library-based modes is absent. We developed Diamond, a Nextflow-based, containerized, multi-modal DIA-MS data processing pipeline for peptide identification and quantification. Diamond integrated two mainstream workflows for DIA data analysis, namely, SCS and PCS, for use cases both with and without assay libraries. This multi-modal pipeline serves as a versatile, easy-to-use and easily extendable toolbox for large-scale DIA data processing. AVAILABILITY: Diamond is hosted on GitHub (https://github.com/xmuyulab/Diamond) and is released under the highly permissive MIT license to encourage further customization and modification. The Docker image for Diamond is freely accessible at https://hub.docker.com/r/zeroli/diamond.

9.
J Biomed Inform ; 130: 104093, 2022 06.
Artículo en Inglés | MEDLINE | ID: mdl-35537690

RESUMEN

The random noises, sampling biases, and batch effects often confound true biological variations in single-cell RNA-sequencing (scRNA-seq) data. Adjusting such biases is key to the robust discoveries in downstream analyses, such as cell clustering, gene selection and data integration. Here we propose a model-based downsampling algorithm based on minimal unbiased representative points (MURPXMBD). MURPXMBD is designed to retrieve a set of representative points by reducing gene-wise random independent errors, while retaining the covariance structure of biological origin hence provide an unbiased representation of the cell population. Subsequent validation using benchmark datasets shows that MURPXMBD can improve the quality and accuracy of clustering algorithms, and thus facilitate the discovery of new cell types. Besides, MURPXMBD also improves the performance of dataset integration algorithms. In summary, MURPXMBD serves as a useful noise-reduction method for single-cell sequencing analysis in biomedical studies.


Asunto(s)
Análisis de la Célula Individual , Transcriptoma , Algoritmos , Análisis por Conglomerados , Perfilación de la Expresión Génica/métodos , Análisis de Secuencia de ARN/métodos , Análisis de la Célula Individual/métodos
10.
Bioinformatics ; 36(17): 4551-4559, 2020 11 01.
Artículo en Inglés | MEDLINE | ID: mdl-32458976

RESUMEN

MOTIVATION: Per-base quality values in Next Generation Sequencing data take a significant portion of storage even after compression. Lossy compression technologies could further reduce the space used by quality values. However, in many applications, lossless compression is still desired. Hence, sequencing data in multiple file formats have to be prepared for different applications. RESULTS: We developed a scalable lossy to lossless compression solution for quality values named ScaleQC (Scalable Quality value Compression). ScaleQC is able to provide the so-called bit-stream level scalability that the losslessly compressed bit-stream by ScaleQC can be further truncated to lower data rates without incurring an expensive transcoding operation. Despite its scalability, ScaleQC still achieves comparable compression performance at both lossless and lossy data rates compared to the existing lossless or lossy compressors. AVAILABILITY AND IMPLEMENTATION: ScaleQC has been integrated with SAMtools as a special quality value encoding mode for CRAM. Its source codes can be obtained from our integrated SAMtools (https://github.com/xmuyulab/samtools) with dependency on integrated HTSlib (https://github.com/xmuyulab/htslib). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Compresión de Datos , Algoritmos , Secuenciación de Nucleótidos de Alto Rendimiento , Programas Informáticos
11.
BMC Bioinformatics ; 21(1): 321, 2020 Jul 20.
Artículo en Inglés | MEDLINE | ID: mdl-32689929

RESUMEN

BACKGROUND: Recent advancements in high-throughput sequencing technologies have generated an unprecedented amount of genomic data that must be stored, processed, and transmitted over the network for sharing. Lossy genomic data compression, especially of the base quality values of sequencing data, is emerging as an efficient way to handle this challenge due to its superior compression performance compared to lossless compression methods. Many lossy compression algorithms have been developed for and evaluated using DNA sequencing data. However, whether these algorithms can be used on RNA sequencing (RNA-seq) data remains unclear. RESULTS: In this study, we evaluated the impacts of lossy quality value compression on common RNA-seq data analysis pipelines including expression quantification, transcriptome assembly, and short variants detection using RNA-seq data from different species and sequencing platforms. Our study shows that lossy quality value compression could effectively improve RNA-seq data compression. In some cases, lossy algorithms achieved up to 1.2-3 times further reduction on the overall RNA-seq data size compared to existing lossless algorithms. However, lossy quality value compression could affect the results of some RNA-seq data processing pipelines, and hence its impacts to RNA-seq studies cannot be ignored in some cases. Pipelines using HISAT2 for alignment were most significantly affected by lossy quality value compression, while the effects of lossy compression on pipelines that do not depend on quality values, e.g., STAR-based expression quantification and transcriptome assembly pipelines, were not observed. Moreover, regardless of using either STAR or HISAT2 as the aligner, variant detection results were affected by lossy quality value compression, albeit to a lesser extent when STAR-based pipeline was used. Our results also show that the impacts of lossy quality value compression depend on the compression algorithms being used and the compression levels if the algorithm supports setting of multiple compression levels. CONCLUSIONS: Lossy quality value compression can be incorporated into existing RNA-seq analysis pipelines to alleviate the data storage and transmission burdens. However, care should be taken on the selection of compression tools and levels based on the requirements of the downstream analysis pipelines to avoid introducing undesirable adverse effects on the analysis results.


Asunto(s)
Algoritmos , Compresión de Datos/métodos , Compresión de Datos/normas , Genómica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ARN/métodos , Secuencia de Bases , Perfilación de la Expresión Génica , Genoma Humano , Humanos
12.
Neural Netw ; 172: 106151, 2024 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-38301339

RESUMEN

Representation learning on temporal interaction graphs (TIG) aims to model complex networks with the dynamic evolution of interactions on a wide range of web and social graph applications. However, most existing works on TIG either (a) rely on discretely updated node embeddings merely when an interaction occurs that fail to capture the continuous evolution of embedding trajectories of nodes, or (b) overlook the rich temporal patterns hidden in the ever-changing graph data that presumably lead to sub-optimal models. In this paper, we propose a two-module framework named ConTIG, a novel representation learning method on TIG that captures the continuous dynamic evolution of node embedding trajectories. With two essential modules, our model exploits three-fold factors in dynamic networks including latest interaction, neighbor features, and inherent characteristics. In the first update module, we employ a continuous inference block to learn the nodes' state trajectories from time-adjacent interaction patterns using ordinary differential equations. In the second transform module, we introduce a self-attention mechanism to predict future node embeddings by aggregating historical temporal interaction information. Experiment results demonstrate the superiority of ConTIG on temporal link prediction, temporal node recommendation, and dynamic node classification tasks of four datasets compared with a range of state-of-the-art baselines, especially for long-interval interaction prediction.


Asunto(s)
Aprendizaje Automático
13.
Cell Rep Methods ; 4(6): 100797, 2024 Jun 17.
Artículo en Inglés | MEDLINE | ID: mdl-38889685

RESUMEN

Cancer of unknown primary (CUP) represents metastatic cancer where the primary site remains unidentified despite standard diagnostic procedures. To determine the tumor origin in such cases, we developed BPformer, a deep learning method integrating the transformer model with prior knowledge of biological pathways. Trained on transcriptomes from 10,410 primary tumors across 32 cancer types, BPformer achieved remarkable accuracy rates of 94%, 92%, and 89% in primary tumors and primary and metastatic sites of metastatic tumors, respectively, surpassing existing methods. Additionally, BPformer was validated in a retrospective study, demonstrating consistency with tumor sites diagnosed through immunohistochemistry and histopathology. Furthermore, BPformer was able to rank pathways based on their contribution to tumor origin identification, which helped to classify oncogenic signaling pathways into those that are highly conservative among different cancers versus those that are highly variable depending on their origins.


Asunto(s)
Neoplasias Primarias Desconocidas , Humanos , Neoplasias Primarias Desconocidas/genética , Neoplasias Primarias Desconocidas/patología , Neoplasias Primarias Desconocidas/metabolismo , Neoplasias Primarias Desconocidas/diagnóstico , Transducción de Señal/genética , Transcriptoma , Aprendizaje Profundo , Estudios Retrospectivos
14.
IEEE Trans Med Imaging ; 42(5): 1337-1348, 2023 05.
Artículo en Inglés | MEDLINE | ID: mdl-37015475

RESUMEN

Multi-instance learning (MIL) is widely adop- ted for automatic whole slide image (WSI) analysis and it usually consists of two stages, i.e., instance feature extraction and feature aggregation. However, due to the "weak supervision" of slide-level labels, the feature aggregation stage would suffer from severe over-fitting in training an effective MIL model. In this case, mining more information from limited slide-level data is pivotal to WSI analysis. Different from previous works on improving instance feature extraction, this paper investigates how to exploit the latent relationship of different instances (patches) to combat overfitting in MIL for more generalizable WSI classification. In particular, we propose a novel Multi-instance Rein- forcement Contrastive Learning framework (MuRCL) to deeply mine the inherent semantic relationships of different patches to advance WSI classification. Specifically, the proposed framework is first trained in a self-supervised manner and then finetuned with WSI slide-level labels. We formulate the first stage as a contrastive learning (CL) process, where positive/negative discriminative feature sets are constructed from the same patch-level feature bags of WSIs. To facilitate the CL training, we design a novel reinforcement learning-based agent to progressively update the selection of discriminative feature sets according to an online reward for slide-level feature aggregation. Then, we further update the model with labeled WSI data to regularize the learned features for the final WSI classification. Experimental results on three public WSI classification datasets (Camelyon16, TCGA-Lung and TCGA-Kidney) demonstrate that the proposed MuRCL outperforms state-of-the-art MIL models. In addition, MuRCL can achieve comparable performance to other state-of-the-art MIL models on TCGA-Esca dataset.


Asunto(s)
Procesamiento de Imagen Asistido por Computador , Aprendizaje Automático Supervisado , Humanos , Conjuntos de Datos como Asunto , Pulmón/diagnóstico por imagen , Riñón/diagnóstico por imagen
15.
IEEE Trans Med Imaging ; 42(8): 2462-2473, 2023 08.
Artículo en Inglés | MEDLINE | ID: mdl-37028064

RESUMEN

Cancer survival prediction requires exploiting related multimodal information (e.g., pathological, clinical and genomic features, etc.) and it is even more challenging in clinical practices due to the incompleteness of patient's multimodal data. Furthermore, existing methods lack sufficient intra- and inter-modal interactions, and suffer from significant performance degradation caused by missing modalities. This manuscript proposes a novel hybrid graph convolutional network, entitled HGCN, which is equipped with an online masked autoencoder paradigm for robust multimodal cancer survival prediction. Particularly, we pioneer modeling the patient's multimodal data into flexible and interpretable multimodal graphs with modality-specific preprocessing. HGCN integrates the advantages of graph convolutional networks (GCNs) and a hypergraph convolutional network (HCN) through node message passing and a hyperedge mixing mechanism to facilitate intra-modal and inter-modal interactions between multimodal graphs. With HGCN, the potential for multimodal data to create more reliable predictions of patient's survival risk is dramatically increased compared to prior methods. Most importantly, to compensate for missing patient modalities in clinical scenarios, we incorporated an online masked autoencoder paradigm into HGCN, which can effectively capture intrinsic dependence between modalities and seamlessly generate missing hyperedges for model inference. Extensive experiments and analysis on six cancer cohorts from TCGA show that our method significantly outperforms the state-of-the-arts in both complete and missing modal settings. Our codes are made available at https://github.com/lin-lcx/HGCN.


Asunto(s)
Genómica , Neoplasias , Humanos , Neoplasias/diagnóstico por imagen
16.
Research (Wash D C) ; 6: 0179, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-37377457

RESUMEN

Data-independent acquisition (DIA) technology for protein identification from mass spectrometry and related algorithms is developing rapidly. The spectrum-centric analysis of DIA data without the use of spectra library from data-dependent acquisition data represents a promising direction. In this paper, we proposed an untargeted analysis method, Dear-DIAXMBD, for direct analysis of DIA data. Dear-DIAXMBD first integrates the deep variational autoencoder and triplet loss to learn the representations of the extracted fragment ion chromatograms, then uses the k-means clustering algorithm to aggregate fragments with similar representations into the same classes, and finally establishes the inverted index tables to determine the precursors of fragment clusters between precursors and peptides and between fragments and peptides. We show that Dear-DIAXMBD performs superiorly with the highly complicated DIA data of different species obtained by different instrument platforms. Dear-DIAXMBD is publicly available at https://github.com/jianweishuai/Dear-DIA-XMBD.

17.
Patterns (N Y) ; 3(5): 100509, 2022 May 13.
Artículo en Inglés | MEDLINE | ID: mdl-35607625

RESUMEN

There is an increasing risk of people using advanced artificial intelligence, particularly the generative adversarial network (GAN), for scientific image manipulation for the purpose of publications. We demonstrated this possibility by using GAN to fabricate several different types of biomedical images and discuss possible ways for the detection and prevention of such scientific misconducts in research communities.

18.
Commun Med (Lond) ; 2: 131, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-36281356

RESUMEN

Background: Single-cell technologies have enabled extensive analysis of complex immune composition, phenotype and interactions within tumor, which is crucial in understanding the mechanisms behind cancer progression and treatment resistance. Unfortunately, knowledge on cell phenotypes and their spatial interactions has only had limited impact on the pathological stratification of patients in the clinic so far. We explore the relationship between different tumor environments (TMEs) and response to immunotherapy by deciphering the composition and spatial relationships of different cell types. Methods: Here we used imaging mass cytometry to simultaneously quantify 35 proteins in a spatially resolved manner on tumor tissues from 26 melanoma patients receiving anti-programmed cell death-1 (anti-PD-1) therapy. Using unsupervised clustering, we profiled 662,266 single cells to identify lymphocytes, myeloid derived monocytes, stromal and tumor cells, and characterized TME of different melanomas. Results: Combined single-cell and spatial analysis reveals highly dynamic TMEs that are characterized with variable tumor and immune cell phenotypes and their spatial organizations in melanomas, and many of these multicellular features are associated with response to anti-PD-1 therapy. We further identify six distinct TME archetypes based on their multicellular compositions, and find that patients with different TME archetypes responded differently to anti-PD-1 therapy. Finally, we find that classifying patients based on the gene expression signature derived from TME archetypes predicts anti-PD-1 therapy response across multiple validation cohorts. Conclusions: Our results demonstrate the utility of multiplex proteomic imaging technologies in studying complex molecular events in a spatially resolved manner for the development of new strategies for patient stratification and treatment outcome prediction.

19.
Patterns (N Y) ; 3(3): 100440, 2022 Mar 11.
Artículo en Inglés | MEDLINE | ID: mdl-35510186

RESUMEN

Understanding the immune cell abundance of cancer and other disease-related tissues has an important role in guiding disease treatments. Computational cell type proportion estimation methods have been previously developed to derive such information from bulk RNA sequencing data. Unfortunately, our results show that the performance of these methods can be seriously plagued by the mismatch between training data and real-world data. To tackle this issue, we propose the DAISM-DNNXMBD (XMBD: Xiamen Big Data, a biomedical open software initiative in the National Institute for Data Science in Health and Medicine, Xiamen University, China.) (denoted as DAISM-DNN) pipeline that trains a deep neural network (DNN) with dataset-specific training data populated from a certain amount of calibrated samples using DAISM, a novel data augmentation method with an in silico mixing strategy. The evaluation results demonstrate that the DAISM-DNN pipeline outperforms other existing methods consistently and substantially for all the cell types under evaluation in real-world datasets.

20.
STAR Protoc ; 3(3): 101587, 2022 09 16.
Artículo en Inglés | MEDLINE | ID: mdl-35942344

RESUMEN

Computational protocols for cell type deconvolution from bulk RNA-seq data have been used to understand cellular heterogeneity in disease-related samples, but their performance can be impacted by batch effect among datasets. Here, we present a DAISM-DNN protocol to achieve robust cell type proportion estimation on the target dataset. We describe the preparation of calibrated samples from human blood samples. We then detail steps to train a dataset-specific deep neural network (DNN) model and cell type proportion estimation using the trained model. For complete details on the use and execution of this protocol, please refer to Lin et al. (2022).


Asunto(s)
Redes Neurales de la Computación , Humanos , RNA-Seq
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA