Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 26.263
Filtrar
1.
BMC Genom Data ; 25(1): 45, 2024 May 07.
Artículo en Inglés | MEDLINE | ID: mdl-38714942

RESUMEN

OBJECTIVES: Cellular deconvolution is a valuable computational process that can infer the cellular composition of heterogeneous tissue samples from bulk RNA-sequencing data. Benchmark testing is a crucial step in the development and evaluation of new cellular deconvolution algorithms, and also plays a key role in the process of building and optimizing deconvolution pipelines for specific experimental applications. However, few in vivo benchmarking datasets exist, particularly for whole blood, which is the single most profiled human tissue. Here, we describe a unique dataset containing whole blood gene expression profiles and matched circulating leukocyte counts from a large cohort of human donors with utility for benchmarking cellular deconvolution pipelines. DATA DESCRIPTION: To produce this dataset, venous whole blood was sampled from 138 total donors recruited at an academic medical center. Genome-wide expression profiling was subsequently performed via next-generation RNA sequencing, and white blood cell differentials were collected in parallel using flow cytometry. The resultant final dataset contains donor-level expression data for over 45,000 protein coding and non-protein coding genes, as well as matched neutrophil, lymphocyte, monocyte, and eosinophil counts.


Asunto(s)
Benchmarking , Humanos , Recuento de Leucocitos , Perfilación de la Expresión Génica/métodos , Transcriptoma , Análisis de Secuencia de ARN/métodos , Leucocitos/metabolismo , Secuenciación de Nucleótidos de Alto Rendimiento , Algoritmos
2.
Front Immunol ; 15: 1310376, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38720887

RESUMEN

Introduction: Hypopharyngeal squamous cell carcinoma (HSCC) is one of the malignant tumors with the worst prognosis in head and neck cancers. The transformation from normal tissue through low-grade and high-grade intraepithelial neoplasia to cancerous tissue in HSCC is typically viewed as a progressive pathological sequence typical of tumorigenesis. Nonetheless, the alterations in diverse cell clusters within the tissue microenvironment (TME) throughout tumorigenesis and their impact on the development of HSCC are yet to be fully understood. Methods: We employed single-cell RNA sequencing and TCR/BCR sequencing to sequence 60,854 cells from nine tissue samples representing different stages during the progression of HSCC. This allowed us to construct dynamic transcriptomic maps of cells in diverse TME across various disease stages, and experimentally validated the key molecules within it. Results: We delineated the heterogeneity among tumor cells, immune cells (including T cells, B cells, and myeloid cells), and stromal cells (such as fibroblasts and endothelial cells) during the tumorigenesis of HSCC. We uncovered the alterations in function and state of distinct cell clusters at different stages of tumor development and identified specific clusters closely associated with the tumorigenesis of HSCC. Consequently, we discovered molecules like MAGEA3 and MMP3, pivotal for the diagnosis and treatment of HSCC. Discussion: Our research sheds light on the dynamic alterations within the TME during the tumorigenesis of HSCC, which will help to understand its mechanism of canceration, identify early diagnostic markers, and discover new therapeutic targets.


Asunto(s)
Neoplasias Hipofaríngeas , Análisis de la Célula Individual , Microambiente Tumoral , Humanos , Neoplasias Hipofaríngeas/genética , Neoplasias Hipofaríngeas/patología , Neoplasias Hipofaríngeas/inmunología , Microambiente Tumoral/inmunología , Microambiente Tumoral/genética , Receptores de Antígenos de Linfocitos T/genética , Receptores de Antígenos de Linfocitos T/metabolismo , Receptores de Antígenos de Linfocitos B/genética , Receptores de Antígenos de Linfocitos B/metabolismo , Carcinogénesis/genética , Análisis de Secuencia de ARN , Transcriptoma , Biomarcadores de Tumor/genética , Carcinoma de Células Escamosas de Cabeza y Cuello/genética , Carcinoma de Células Escamosas de Cabeza y Cuello/inmunología , Carcinoma de Células Escamosas de Cabeza y Cuello/patología , Regulación Neoplásica de la Expresión Génica , Masculino
3.
Methods Mol Biol ; 2808: 121-127, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38743366

RESUMEN

During the infection of a host cell by an infectious agent, a series of gene expression changes occurs as a consequence of host-pathogen interactions. Unraveling this complex interplay is the key for understanding of microbial virulence and host response pathways, thus providing the basis for new molecular insights into the mechanisms of pathogenesis and the corresponding immune response. Dual RNA sequencing (dual RNA-seq) has been developed to simultaneously determine pathogen and host transcriptomes enabling both differential and coexpression analyses between the two partners as well as genome characterization in the case of RNA viruses. Here, we provide a detailed laboratory protocol and bioinformatics analysis guidelines for dual RNA-seq experiments focusing on - but not restricted to - measles virus (MeV) as a pathogen of interest. The application of dual RNA-seq technologies in MeV-infected patients can potentially provide valuable information on the structure of the viral RNA genome and on cellular innate immune responses and drive the discovery of new targets for antiviral therapy.


Asunto(s)
Genoma Viral , Interacciones Huésped-Patógeno , Virus del Sarampión , Sarampión , ARN Viral , Humanos , Sarampión/virología , Sarampión/inmunología , Sarampión/genética , Virus del Sarampión/genética , Virus del Sarampión/patogenicidad , ARN Viral/genética , Interacciones Huésped-Patógeno/genética , Interacciones Huésped-Patógeno/inmunología , Biología Computacional/métodos , Análisis de Secuencia de ARN/métodos , RNA-Seq/métodos , Transcriptoma , Perfilación de la Expresión Génica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos
4.
Nat Commun ; 15(1): 4055, 2024 May 14.
Artículo en Inglés | MEDLINE | ID: mdl-38744843

RESUMEN

We introduce GRouNdGAN, a gene regulatory network (GRN)-guided reference-based causal implicit generative model for simulating single-cell RNA-seq data, in silico perturbation experiments, and benchmarking GRN inference methods. Through the imposition of a user-defined GRN in its architecture, GRouNdGAN simulates steady-state and transient-state single-cell datasets where genes are causally expressed under the control of their regulating transcription factors (TFs). Training on six experimental reference datasets, we show that our model captures non-linear TF-gene dependencies and preserves gene identities, cell trajectories, pseudo-time ordering, and technical and biological noise, with no user manipulation and only implicit parameterization. GRouNdGAN can synthesize cells under new conditions to perform in silico TF knockout experiments. Benchmarking various GRN inference algorithms reveals that GRouNdGAN effectively bridges the existing gap between simulated and biological data benchmarks of GRN inference algorithms, providing gold standard ground truth GRNs and realistic cells corresponding to the biological system of interest.


Asunto(s)
Algoritmos , Simulación por Computador , Redes Reguladoras de Genes , RNA-Seq , Análisis de la Célula Individual , Análisis de la Célula Individual/métodos , RNA-Seq/métodos , Humanos , Factores de Transcripción/metabolismo , Factores de Transcripción/genética , Biología Computacional/métodos , Benchmarking , Análisis de Secuencia de ARN/métodos , Análisis de Expresión Génica de una Sola Célula
5.
BMC Genomics ; 25(1): 455, 2024 May 08.
Artículo en Inglés | MEDLINE | ID: mdl-38720252

RESUMEN

BACKGROUND: Standard ChIP-seq and RNA-seq processing pipelines typically disregard sequencing reads whose origin is ambiguous ("multimappers"). This usual practice has potentially important consequences for the functional interpretation of the data: genomic elements belonging to clusters composed of highly similar members are left unexplored. RESULTS: In particular, disregarding multimappers leads to the underrepresentation in epigenetic studies of recently active transposable elements, such as AluYa5, L1HS and SVAs. Furthermore, this common strategy also has implications for transcriptomic analysis: members of repetitive gene families, such the ones including major histocompatibility complex (MHC) class I and II genes, are under-quantified. CONCLUSION: Revealing inherent biases that permeate routine tasks such as functional enrichment analysis, our results underscore the urgency of broadly adopting multimapper-aware bioinformatic pipelines -currently restricted to specific contexts or communities- to ensure the reliability of genomic and transcriptomic studies.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Elementos Transponibles de ADN/genética , Biología Computacional/métodos , Perfilación de la Expresión Génica/métodos , Genómica/métodos , Análisis de Secuencia de ARN/métodos
6.
BMC Bioinformatics ; 25(1): 181, 2024 May 08.
Artículo en Inglés | MEDLINE | ID: mdl-38720247

RESUMEN

BACKGROUND: RNA sequencing combined with machine learning techniques has provided a modern approach to the molecular classification of cancer. Class predictors, reflecting the disease class, can be constructed for known tissue types using the gene expression measurements extracted from cancer patients. One challenge of current cancer predictors is that they often have suboptimal performance estimates when integrating molecular datasets generated from different labs. Often, the quality of the data is variable, procured differently, and contains unwanted noise hampering the ability of a predictive model to extract useful information. Data preprocessing methods can be applied in attempts to reduce these systematic variations and harmonize the datasets before they are used to build a machine learning model for resolving tissue of origins. RESULTS: We aimed to investigate the impact of data preprocessing steps-focusing on normalization, batch effect correction, and data scaling-through trial and comparison. Our goal was to improve the cross-study predictions of tissue of origin for common cancers on large-scale RNA-Seq datasets derived from thousands of patients and over a dozen tumor types. The results showed that the choice of data preprocessing operations affected the performance of the associated classifier models constructed for tissue of origin predictions in cancer. CONCLUSION: By using TCGA as a training set and applying data preprocessing methods, we demonstrated that batch effect correction improved performance measured by weighted F1-score in resolving tissue of origin against an independent GTEx test dataset. On the other hand, the use of data preprocessing operations worsened classification performance when the independent test dataset was aggregated from separate studies in ICGC and GEO. Therefore, based on our findings with these publicly available large-scale RNA-Seq datasets, the application of data preprocessing techniques to a machine learning pipeline is not always appropriate.


Asunto(s)
Aprendizaje Automático , Neoplasias , RNA-Seq , Humanos , RNA-Seq/métodos , Neoplasias/genética , Transcriptoma/genética , Análisis de Secuencia de ARN/métodos , Perfilación de la Expresión Génica/métodos , Biología Computacional/métodos
7.
Medicine (Baltimore) ; 103(19): e38144, 2024 May 10.
Artículo en Inglés | MEDLINE | ID: mdl-38728457

RESUMEN

Papillary thyroid carcinoma (PTC) prognosis may be deteriorated due to the metastases, and anoikis palys an essential role in the tumor metastasis. However, the potential effect of anoikis-related genes on the prognosis of PTC was unclear. The mRNA and clinical information were obtained from the cancer genome atlas database. Hub genes were identified and risk model was constructed using Cox regression analysis. Kaplan-Meier (K-M) curve was applied for the survival analysis. Immune infiltration and immune therapy response were calculated using CIBERSORT and TIDE. The identification of cell types and cell interaction was performed by Seurat, SingleR and CellChat packages. GO, KEGG, and GSVA were applied for the enrichment analysis. Protein-protein interaction network was constructed in STRING and Cytoscape. Drug sensitivity was assessed in GSCA. Based on bulk RNA data, we identified 4 anoikis-related risk signatures, which were oncogenes, and constructed a risk model. The enrichment analysis found high risk group was enriched in some immune-related pathways. High risk group had higher infiltration of Tregs, higher TIDE score and lower levels of monocytes and CD8 T cells. Based on scRNA data, we found that 4 hub genes were mainly expressed in monocytes and macrophages, and they interacted with T cells. Hub genes were significantly related to immune escape-related genes. Drug sensitivity analysis suggested that cyclin dependent kinase inhibitor 2A may be a better chemotherapy target. We constructed a risk model which could effectively and steadily predict the prognosis of PTC. We inferred that the immune escape may be involved in the development of PTC.


Asunto(s)
Anoicis , Cáncer Papilar Tiroideo , Neoplasias de la Tiroides , Humanos , Cáncer Papilar Tiroideo/genética , Cáncer Papilar Tiroideo/patología , Anoicis/genética , Neoplasias de la Tiroides/genética , Neoplasias de la Tiroides/patología , Pronóstico , Análisis de la Célula Individual/métodos , Análisis de Secuencia de ARN , Mapas de Interacción de Proteínas/genética , Femenino , Masculino , Estimación de Kaplan-Meier , Regulación Neoplásica de la Expresión Génica , Perfilación de la Expresión Génica/métodos
8.
J Pathol Clin Res ; 10(3): e12376, 2024 May.
Artículo en Inglés | MEDLINE | ID: mdl-38738521

RESUMEN

The identification of gene fusions has become an integral part of soft tissue and bone tumour diagnosis. We investigated the added value of targeted RNA-based sequencing (targeted RNA-seq, Archer FusionPlex) to our current molecular diagnostic workflow of these tumours, which is based on fluorescence in situ hybridisation (FISH) for the detection of gene fusions using 25 probes. In a series of 131 diagnostic samples targeted RNA-seq identified a gene fusion, BCOR internal tandem duplication or ALK deletion in 47 cases (35.9%). For 74 cases, encompassing 137 FISH analyses, concordance between FISH and targeted RNA-seq was evaluated. A positive or negative FISH result was confirmed by targeted RNA-seq in 27 out of 49 (55.1%) and 81 out of 88 (92.0%) analyses, respectively. While negative concordance was high, targeted RNA-seq identified a canonical gene fusion in seven cases despite a negative FISH result. The 22 discordant FISH-positive analyses showed a lower percentage of rearrangement-positive nuclei (range 15-41%) compared to the concordant FISH-positive analyses (>41% of nuclei in 88.9% of cases). Six FISH analyses (in four cases) were finally considered false positive based on histological and targeted RNA-seq findings. For the EWSR1 FISH probe, we observed a gene-dependent disparity (p = 0.0020), with 8 out of 35 cases showing a discordance between FISH and targeted RNA-seq (22.9%). This study demonstrates an added value of targeted RNA-seq to our current diagnostic workflow of soft tissue and bone tumours in 19 out of 131 cases (14.5%), which we categorised as altered diagnosis (3 cases), added precision (6 cases), or augmented spectrum (10 cases). In the latter subgroup, four novel fusion transcripts were found for which the clinical relevance remains unclear: NAB2::NCOA2, YAP1::NUTM2B, HSPA8::BRAF, and PDE2A::PLAG1. Overall, targeted RNA-seq has proven extremely valuable in the diagnostic workflow of soft tissue and bone tumours.


Asunto(s)
Neoplasias Óseas , Hibridación Fluorescente in Situ , Neoplasias de los Tejidos Blandos , Flujo de Trabajo , Humanos , Neoplasias Óseas/genética , Neoplasias Óseas/diagnóstico , Neoplasias Óseas/patología , Neoplasias de los Tejidos Blandos/genética , Neoplasias de los Tejidos Blandos/diagnóstico , Neoplasias de los Tejidos Blandos/patología , Femenino , Adulto , Masculino , Persona de Mediana Edad , Adolescente , Anciano , Análisis de Secuencia de ARN , Niño , Adulto Joven , Fusión Génica , Biomarcadores de Tumor/genética , Preescolar , Anciano de 80 o más Años , Proteínas de Fusión Oncogénica/genética
9.
Brief Bioinform ; 25(3)2024 Mar 27.
Artículo en Inglés | MEDLINE | ID: mdl-38739758

RESUMEN

The complicated process of neuronal development is initiated early in life, with the genetic mechanisms governing this process yet to be fully elucidated. Single-cell RNA sequencing (scRNA-seq) is a potent instrument for pinpointing biomarkers that exhibit differential expression across various cell types and developmental stages. By employing scRNA-seq on human embryonic stem cells, we aim to identify differentially expressed genes (DEGs) crucial for early-stage neuronal development. Our focus extends beyond simply identifying DEGs. We strive to investigate the functional roles of these genes through enrichment analysis and construct gene regulatory networks to understand their interactions. Ultimately, this comprehensive approach aspires to illuminate the molecular mechanisms and transcriptional dynamics governing early human brain development. By uncovering potential links between these DEGs and intelligence, mental disorders, and neurodevelopmental disorders, we hope to shed light on human neurological health and disease. In this study, we have used scRNA-seq to identify DEGs involved in early-stage neuronal development in hESCs. The scRNA-seq data, collected on days 26 (D26) and 54 (D54), of the in vitro differentiation of hESCs to neurons were analyzed. Our analysis identified 539 DEGs between D26 and D54. Functional enrichment of those DEG biomarkers indicated that the up-regulated DEGs participated in neurogenesis, while the down-regulated DEGs were linked to synapse regulation. The Reactome pathway analysis revealed that down-regulated DEGs were involved in the interactions between proteins located in synapse pathways. We also discovered interactions between DEGs and miRNA, transcriptional factors (TFs) and DEGs, and between TF and miRNA. Our study identified 20 significant transcription factors, shedding light on early brain development genetics. The identified DEGs and gene regulatory networks are valuable resources for future research into human brain development and neurodevelopmental disorders.


Asunto(s)
Biomarcadores , Encéfalo , Redes Reguladoras de Genes , Células Madre Embrionarias Humanas , Análisis de la Célula Individual , Humanos , Análisis de la Célula Individual/métodos , Células Madre Embrionarias Humanas/metabolismo , Células Madre Embrionarias Humanas/citología , Encéfalo/metabolismo , Encéfalo/embriología , Encéfalo/citología , Biomarcadores/metabolismo , Neuronas/metabolismo , Neuronas/citología , Diferenciación Celular/genética , RNA-Seq , Neurogénesis/genética , Regulación del Desarrollo de la Expresión Génica , Perfilación de la Expresión Génica , Análisis de Secuencia de ARN/métodos , Análisis de Expresión Génica de una Sola Célula
10.
Cancer Immunol Immunother ; 73(6): 112, 2024 May 02.
Artículo en Inglés | MEDLINE | ID: mdl-38693422

RESUMEN

OBJECTIVE: The high mortality rate of gastric cancer, traditionally managed through surgery, underscores the urgent need for advanced therapeutic strategies. Despite advancements in treatment modalities, outcomes remain suboptimal, necessitating the identification of novel biomarkers to predict sensitivity to immunotherapy. This study focuses on utilizing single-cell sequencing for gene identification and developing a random forest model to predict immunotherapy sensitivity in gastric cancer patients. METHODS: Differentially expressed genes were identified using single-cell RNA sequencing (scRNA-seq) and gene set enrichment analysis (GESA). A random forest model was constructed based on these genes, and its effectiveness was validated through prognostic analysis. Further, analyses of immune cell infiltration, immune checkpoints, and the random forest model provided deeper insights. RESULTS: High METTL1 expression was found to correlate with improved survival rates in gastric cancer patients (P = 0.042), and the random forest model, based on METTL1 and associated prognostic genes, achieved a significant predictive performance (AUC = 0.863). It showed associations with various immune cell types and negative correlations with CTLA4 and PDCD1 immune checkpoints. Experiments in vitro and in vivo demonstrated that METTL1 enhances gastric cancer cell activity by suppressing T cell proliferation and upregulating CTLA4 and PDCD1. CONCLUSION: The random forest model, based on scRNA-seq, shows high predictive value for survival and immunotherapy sensitivity in gastric cancer patients. This study underscores the potential of METTL1 as a biomarker in enhancing the efficacy of gastric cancer immunotherapy.


Asunto(s)
Inmunoterapia , Análisis de la Célula Individual , Neoplasias Gástricas , Neoplasias Gástricas/genética , Neoplasias Gástricas/terapia , Neoplasias Gástricas/inmunología , Neoplasias Gástricas/mortalidad , Humanos , Análisis de la Célula Individual/métodos , Inmunoterapia/métodos , Animales , Ratones , Pronóstico , Biomarcadores de Tumor/genética , Análisis de Secuencia de ARN/métodos , Femenino , Masculino , Regulación Neoplásica de la Expresión Génica , Ensayos Antitumor por Modelo de Xenoinjerto , Línea Celular Tumoral , Bosques Aleatorios
11.
BMC Genomics ; 25(1): 489, 2024 May 17.
Artículo en Inglés | MEDLINE | ID: mdl-38760729

RESUMEN

BACKGROUND: The cellular origin of hypopharyngeal diseases is crucial for further diagnosis and treatment, and the microenvironment in tissues may also be associated with specific cell types at the same time. Normal adjacent tissues (NATs) of hypopharyngeal carcinoma differ from non-tumor-bearing tissues, and can influenced by the tumor. However, the heterogeneity in kinds of disease samples remains little known, and the transcriptomic profile about biological information associated with disease occurrence and clinical outcome contained in it has yet to be fully evaluated. For these reasons, we should quickly investigate the taxonomic and transcriptomic information of NATs in human hypopharynx. RESULTS: Single-cell suspensions of normal adjacent tissues (NATs) of hypopharyngeal carcinoma were obtained and single-cell RNA sequencing (scRNA-seq) was performed. We present scRNA-seq data from 39,315 high-quality cells in the hypopharyngeal from five human donors, nine clusters of normal adjacent human hypopharyngeal cells were presented, including epithelial cells, endothelial cells (ECs), mononuclear phagocyte system cells (MPs), fibroblasts, T cells, plasma cells, B cells, mural cells and mast cells. Nonimmune components in the microenvironment, including epithelial cells, endothelial cells, fibroblasts and the subpopulations of them were performed. CONCLUSIONS: Our data provide a solid basis for the study of single-cell landscape in human normal adjacent hypopharyngeal tissues biology and related diseases.


Asunto(s)
Neoplasias Hipofaríngeas , Análisis de la Célula Individual , Transcriptoma , Microambiente Tumoral , Humanos , Neoplasias Hipofaríngeas/genética , Neoplasias Hipofaríngeas/patología , Microambiente Tumoral/genética , Hipofaringe/patología , Hipofaringe/metabolismo , Perfilación de la Expresión Génica , Masculino , Análisis de Secuencia de ARN
12.
J Cell Mol Med ; 28(10): e18378, 2024 May.
Artículo en Inglés | MEDLINE | ID: mdl-38760895

RESUMEN

The efficacy of radiotherapy, a cornerstone in the treatment of lung adenocarcinoma (LUAD), is profoundly undermined by radiotolerance. This resistance not only poses a significant clinical challenge but also compromises patient survival rates. Therefore, it is important to explore this mechanism for the treatment of LUAD. Multiple public databases were used for single-cell RNA sequencing (scRNA-seq) data. We filtered, normalized and downscaled scRNA-seq data based on the Seurat package to obtain different cell subpopulations. Subsequently, the ssGSEA algorithm was used to assess the enrichment scores of the different cell subpopulations, and thus screen the cell subpopulations that are most relevant to radiotherapy tolerance based on the Pearson method. Finally, pseudotime analysis was performed, and a preliminary exploration of gene mutations in different cell subpopulations was performed. We identified HIST1H1D+ A549 and PIF1+ A549 as the cell subpopulations related to radiotolerance. The expression levels of cell cycle-related genes and pathway enrichment scores of these two cell subpopulations increased gradually with the extension of radiation treatment time. Finally, we found that the proportion of TP53 mutations in patients who had received radiotherapy was significantly higher than that in patients who had not received radiotherapy. We identified two cellular subpopulations associated with radiotherapy tolerance, which may shed light on the molecular mechanisms of radiotherapy tolerance in LUAD and provide new clinical perspectives.


Asunto(s)
Adenocarcinoma del Pulmón , Regulación Neoplásica de la Expresión Génica , Neoplasias Pulmonares , Mutación , Tolerancia a Radiación , Análisis de la Célula Individual , Humanos , Análisis de la Célula Individual/métodos , Adenocarcinoma del Pulmón/genética , Adenocarcinoma del Pulmón/radioterapia , Adenocarcinoma del Pulmón/patología , Tolerancia a Radiación/genética , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/radioterapia , Neoplasias Pulmonares/patología , Regulación Neoplásica de la Expresión Génica/efectos de la radiación , Análisis de Secuencia de ARN/métodos , Proteína p53 Supresora de Tumor/genética , Proteína p53 Supresora de Tumor/metabolismo , Células A549 , Perfilación de la Expresión Génica , Línea Celular Tumoral
13.
J Vis Exp ; (207)2024 May 03.
Artículo en Inglés | MEDLINE | ID: mdl-38767365

RESUMEN

Intermuscular adipose tissue (IMAT) is a relatively understudied adipose depot located between muscle fibers. IMAT content increases with age and BMI and is associated with metabolic and muscle degenerative diseases; however, an understanding of the biological properties of IMAT and its interplay with the surrounding muscle fibers is severely lacking. In recent years, single-cell and nuclei RNA sequencing have provided us with cell type-specific atlases of several human tissues. However, the cellular composition of human IMAT remains largely unexplored due to the inherent challenges of its accessibility from biopsy collection in humans. In addition to the limited amount of tissue collected, the processing of human IMAT is complicated due to its proximity to skeletal muscle tissue and fascia. The lipid-laden nature of the adipocytes makes it incompatible with single-cell isolation. Hence, single nuclei RNA sequencing is optimal for obtaining high-dimensional transcriptomics at single-cell resolution and provides the potential to uncover the biology of this depot, including the exact cellular composition of IMAT. Here, we present a detailed protocol for nuclei isolation and library preparation of frozen human IMAT for single nuclei RNA sequencing. This protocol allows for the profiling of thousands of nuclei using a droplet-based approach, thus providing the capacity to detect rare and low-abundant cell types.


Asunto(s)
Tejido Adiposo , Núcleo Celular , Análisis de Secuencia de ARN , Humanos , Tejido Adiposo/citología , Análisis de Secuencia de ARN/métodos , Núcleo Celular/química , Núcleo Celular/genética , Análisis de la Célula Individual/métodos , Músculo Esquelético/citología , Músculo Esquelético/química
14.
BMC Cancer ; 24(1): 607, 2024 May 20.
Artículo en Inglés | MEDLINE | ID: mdl-38769480

RESUMEN

BACKGROUND: Cancerous cells' identity is determined via a mixture of multiple factors such as genomic variations, epigenetics, and the regulatory variations that are involved in transcription. The differences in transcriptome expression as well as abnormal structures in peptides determine phenotypical differences. Thus, bulk RNA-seq and more recent single-cell RNA-seq data (scRNA-seq) are important to identify pathogenic differences. In this case, we rely on k-mer decomposition of sequences to identify pathogenic variations in detail which does not need a reference, so it outperforms more traditional Next-Generation Sequencing (NGS) analysis techniques depending on the alignment of the sequences to a reference. RESULTS: Via our alignment-free analysis, over esophageal and glioblastoma cancer patients, high-frequency variations over multiple different locations (repeats, intergenic regions, exons, introns) as well as multiple different forms (fusion, polyadenylation, splicing, etc.) could be discovered. Additionally, we have analyzed the importance of less-focused events systematically in a classic transcriptome analysis pipeline where these events are considered as indicators for tumor prognosis, tumor prediction, tumor neoantigen inference, as well as their connection with respect to the immune microenvironment. CONCLUSIONS: Our results suggest that esophageal cancer (ESCA) and glioblastoma processes can be explained via pathogenic microbial RNA, repeated sequences, novel splicing variants, and long intergenic non-coding RNAs (lincRNAs). We expect our application of reference-free process and analysis to be helpful in tumor and normal samples differential scRNA-seq analysis, which in turn offers a more comprehensive scheme for major cancer-associated events.


Asunto(s)
Glioblastoma , Análisis de la Célula Individual , Transcriptoma , Humanos , Análisis de la Célula Individual/métodos , Glioblastoma/genética , Glioblastoma/patología , Perfilación de la Expresión Génica/métodos , Neoplasias Esofágicas/genética , Neoplasias Esofágicas/patología , Secuenciación de Nucleótidos de Alto Rendimiento , RNA-Seq/métodos , Análisis de Secuencia de ARN/métodos , Regulación Neoplásica de la Expresión Génica , Neoplasias/genética , Neoplasias/patología
15.
Brief Bioinform ; 25(3)2024 Mar 27.
Artículo en Inglés | MEDLINE | ID: mdl-38770716

RESUMEN

Temporal RNA-sequencing (RNA-seq) studies of bulk samples provide an opportunity for improved understanding of gene regulation during dynamic phenomena such as development, tumor progression or response to an incremental dose of a pharmacotherapeutic. Moreover, single-cell RNA-seq (scRNA-seq) data implicitly exhibit temporal characteristics because gene expression values recapitulate dynamic processes such as cellular transitions. Unfortunately, temporal RNA-seq data continue to be analyzed by methods that ignore this ordinal structure and yield results that are often difficult to interpret. Here, we present Error Modelled Gene Expression Analysis (EMOGEA), a framework for analyzing RNA-seq data that incorporates measurement uncertainty, while introducing a special formulation for those acquired to monitor dynamic phenomena. This method is specifically suited for RNA-seq studies in which low-count transcripts with small-fold changes lead to significant biological effects. Such transcripts include genes involved in signaling and non-coding RNAs that inherently exhibit low levels of expression. Using simulation studies, we show that this framework down-weights samples that exhibit extreme responses such as batch effects allowing them to be modeled with the rest of the samples and maintain the degrees of freedom originally envisioned for a study. Using temporal experimental data, we demonstrate the framework by extracting a cascade of gene expression waves from a well-designed RNA-seq study of zebrafish embryogenesis and an scRNA-seq study of mouse pre-implantation and provide unique biological insights into the regulation of genes in each wave. For non-ordinal measurements, we show that EMOGEA has a much higher rate of true positive calls and a vanishingly small rate of false negative discoveries compared to common approaches. Finally, we provide two packages in Python and R that are self-contained and easy to use, including test data.


Asunto(s)
RNA-Seq , Pez Cebra , Animales , Pez Cebra/genética , RNA-Seq/métodos , Perfilación de la Expresión Génica/métodos , Análisis de la Célula Individual/métodos , Ratones , Análisis de Secuencia de ARN/métodos , Programas Informáticos
16.
Brief Bioinform ; 25(3)2024 Mar 27.
Artículo en Inglés | MEDLINE | ID: mdl-38770720

RESUMEN

The normalization of RNA sequencing data is a primary step for downstream analysis. The most popular method used for the normalization is the trimmed mean of M values (TMM) and DESeq. The TMM tries to trim away extreme log fold changes of the data to normalize the raw read counts based on the remaining non-deferentially expressed genes. However, the major problem with the TMM is that the values of trimming factor M are heuristic. This paper tries to estimate the adaptive value of M in TMM based on Jaeckel's Estimator, and each sample acts as a reference to find the scale factor of each sample. The presented approach is validated on SEQC, MAQC2, MAQC3, PICKRELL and two simulated datasets with two-group and three-group conditions by varying the percentage of differential expression and the number of replicates. The performance of the present approach is compared with various state-of-the-art methods, and it is better in terms of area under the receiver operating characteristic curve and differential expression.


Asunto(s)
RNA-Seq , RNA-Seq/métodos , Humanos , Algoritmos , Análisis de Secuencia de ARN/métodos , Biología Computacional/métodos , Perfilación de la Expresión Génica/métodos , Curva ROC , Programas Informáticos
17.
Int J Mol Sci ; 25(9)2024 Apr 30.
Artículo en Inglés | MEDLINE | ID: mdl-38732140

RESUMEN

Glioblastoma Multiforme is a brain tumor distinguished by its aggressiveness. We suggested that this aggressiveness leads single-cell RNA-sequence data (scRNA-seq) to span a representative portion of the cancer attractors domain. This conjecture allowed us to interpret the scRNA-seq heterogeneity as reflecting a representative trajectory within the attractor's domain. We considered factors such as genomic instability to characterize the cancer dynamics through stochastic fixed points. The fixed points were derived from centroids obtained through various clustering methods to verify our method sensitivity. This methodological foundation is based upon sample and time average equivalence, assigning an interpretative value to the data cluster centroids and supporting parameters estimation. We used stochastic simulations to reproduce the dynamics, and our results showed an alignment between experimental and simulated dataset centroids. We also computed the Waddington landscape, which provided a visual framework for validating the centroids and standard deviations as characterizations of cancer attractors. Additionally, we examined the stability and transitions between attractors and revealed a potential interplay between subtypes. These transitions might be related to cancer recurrence and progression, connecting the molecular mechanisms of cancer heterogeneity with statistical properties of gene expression dynamics. Our work advances the modeling of gene expression dynamics and paves the way for personalized therapeutic interventions.


Asunto(s)
Neoplasias Encefálicas , Glioblastoma , Análisis de la Célula Individual , Glioblastoma/genética , Glioblastoma/patología , Glioblastoma/metabolismo , Humanos , Análisis de la Célula Individual/métodos , Neoplasias Encefálicas/genética , Neoplasias Encefálicas/patología , Neoplasias Encefálicas/metabolismo , Regulación Neoplásica de la Expresión Génica , Heterogeneidad Genética , Perfilación de la Expresión Génica/métodos , Inestabilidad Genómica , Análisis de Secuencia de ARN/métodos , Análisis por Conglomerados
18.
CNS Neurosci Ther ; 30(5): e14741, 2024 05.
Artículo en Inglés | MEDLINE | ID: mdl-38702940

RESUMEN

AIMS: Despite the success of single-cell RNA sequencing in identifying cellular heterogeneity in ischemic stroke, clarifying the mechanisms underlying these associations of differently expressed genes remains challenging. Several studies that integrate gene expression and gene expression quantitative trait loci (eQTLs) with genome wide-association study (GWAS) data to determine their causal role have been proposed. METHODS: Here, we combined Mendelian randomization (MR) framework and single cell (sc) RNA sequencing to study how differently expressed genes (DEGs) mediating the effect of gene expression on ischemic stroke. The hub gene was further validated in the in vitro model. RESULTS: We identified 2339 DEGs in 10 cell clusters. Among these DEGs, 58 genes were associated with the risk of ischemic stroke. After external validation with eQTL dataset, lactate dehydrogenase B (LDHB) is identified to be positively associated with ischemic stroke. The expression of LDHB has also been validated in sc RNA-seq with dominant expression in microglia and astrocytes, and melatonin is able to reduce the LDHB expression and activity in vitro ischemic models. CONCLUSION: Our study identifies LDHB as a novel biomarker for ischemic stroke via combining the sc RNA-seq and MR analysis.


Asunto(s)
Accidente Cerebrovascular Isquémico , L-Lactato Deshidrogenasa , Melatonina , Análisis de la Aleatorización Mendeliana , Análisis de Secuencia de ARN , Animales , Humanos , Estudio de Asociación del Genoma Completo/métodos , Accidente Cerebrovascular Isquémico/genética , Accidente Cerebrovascular Isquémico/metabolismo , Isoenzimas/genética , Isoenzimas/metabolismo , L-Lactato Deshidrogenasa/metabolismo , L-Lactato Deshidrogenasa/genética , Análisis de la Aleatorización Mendeliana/métodos , Sitios de Carácter Cuantitativo , Análisis de Secuencia de ARN/métodos , Análisis de la Célula Individual/métodos , Ratones
19.
Nat Commun ; 15(1): 3899, 2024 May 09.
Artículo en Inglés | MEDLINE | ID: mdl-38724548

RESUMEN

The epitranscriptome embodies many new and largely unexplored functions of RNA. A significant roadblock hindering progress in epitranscriptomics is the identification of more than one modification in individual transcript molecules. We address this with CHEUI (CH3 (methylation) Estimation Using Ionic current). CHEUI predicts N6-methyladenosine (m6A) and 5-methylcytosine (m5C) in individual molecules from the same sample, the stoichiometry at transcript reference sites, and differential methylation between any two conditions. CHEUI processes observed and expected nanopore direct RNA sequencing signals to achieve high single-molecule, transcript-site, and stoichiometry accuracies in multiple tests using synthetic RNA standards and cell line data. CHEUI's capability to identify two modification types in the same sample reveals a co-occurrence of m6A and m5C in individual mRNAs in cell line and tissue transcriptomes. CHEUI provides new avenues to discover and study the function of the epitranscriptome.


Asunto(s)
5-Metilcitosina , Adenosina , Análisis de Secuencia de ARN , Transcriptoma , Adenosina/análogos & derivados , Adenosina/metabolismo , 5-Metilcitosina/metabolismo , 5-Metilcitosina/análogos & derivados , Humanos , Metilación , Análisis de Secuencia de ARN/métodos , Procesamiento Postranscripcional del ARN , ARN Mensajero/metabolismo , ARN Mensajero/genética , ARN/metabolismo , ARN/genética
20.
Nat Commun ; 15(1): 3946, 2024 May 10.
Artículo en Inglés | MEDLINE | ID: mdl-38729950

RESUMEN

Disease modeling with isogenic Induced Pluripotent Stem Cell (iPSC)-differentiated organoids serves as a powerful technique for studying disease mechanisms. Multiplexed coculture is crucial to mitigate batch effects when studying the genetic effects of disease-causing variants in differentiated iPSCs or organoids, and demultiplexing at the single-cell level can be conveniently achieved by assessing natural genetic barcodes. Here, to enable cost-efficient time-series experimental designs via multiplexed bulk and single-cell RNA-seq of hybrids, we introduce a computational method in our Vireo Suite, Vireo-bulk, to effectively deconvolve pooled bulk RNA-seq data by genotype reference, and thereby quantify donor abundance over the course of differentiation and identify differentially expressed genes among donors. Furthermore, with multiplexed scRNA-seq and bulk RNA-seq, we demonstrate the usefulness and necessity of a pooled design to reveal donor iPSC line heterogeneity during macrophage cell differentiation and to model rare WT1 mutation-driven kidney disease with chimeric organoids. Our work provides an experimental and analytic pipeline for dissecting disease mechanisms with chimeric organoids.


Asunto(s)
Diferenciación Celular , Células Madre Pluripotentes Inducidas , Organoides , RNA-Seq , Análisis de la Célula Individual , Organoides/metabolismo , Análisis de la Célula Individual/métodos , Células Madre Pluripotentes Inducidas/metabolismo , Células Madre Pluripotentes Inducidas/citología , Humanos , Diferenciación Celular/genética , RNA-Seq/métodos , Análisis de Secuencia de ARN/métodos , Macrófagos/metabolismo , Macrófagos/citología , Animales , Análisis de Expresión Génica de una Sola Célula
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA