Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 41
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Cell ; 179(3): 736-749.e15, 2019 10 17.
Artículo en Inglés | MEDLINE | ID: mdl-31626772

RESUMEN

Underrepresentation of Asian genomes has hindered population and medical genetics research on Asians, leading to population disparities in precision medicine. By whole-genome sequencing of 4,810 Singapore Chinese, Malays, and Indians, we found 98.3 million SNPs and small insertions or deletions, over half of which are novel. Population structure analysis demonstrated great representation of Asian genetic diversity by three ethnicities in Singapore and revealed a Malay-related novel ancestry component. Furthermore, demographic inference suggested that Malays split from Chinese ∼24,800 years ago and experienced significant admixture with East Asians ∼1,700 years ago, coinciding with the Austronesian expansion. Additionally, we identified 20 candidate loci for natural selection, 14 of which harbored robust associations with complex traits and diseases. Finally, we show that our data can substantially improve genotype imputation in diverse Asian and Oceanian populations. These results highlight the value of our data as a resource to empower human genetics discovery across broad geographic regions.


Asunto(s)
Genética de Población , Genoma Humano/genética , Selección Genética , Secuenciación Completa del Genoma , Pueblo Asiatico/genética , Femenino , Genotipo , Humanos , Malasia/epidemiología , Masculino , Polimorfismo de Nucleótido Simple/genética , Singapur/epidemiología
2.
Genes Dev ; 2022 Aug 25.
Artículo en Inglés | MEDLINE | ID: mdl-36008138

RESUMEN

Stem cells are fundamental units of tissue remodeling whose functions are dictated by lineage-specific transcription factors. Home to epidermal stem cells and their upward-stratifying progenies, skin relies on its secretory functions to form the outermost protective barrier, of which a transcriptional orchestrator has been elusive. KLF5 is a Krüppel-like transcription factor broadly involved in development and regeneration whose lineage specificity, if any, remains unclear. Here we report KLF5 specifically marks the epidermis, and its deletion leads to skin barrier dysfunction in vivo. Lipid envelopes and secretory lamellar bodies are defective in KLF5-deficient skin, accompanied by preferential loss of complex sphingolipids. KLF5 binds to and transcriptionally regulates genes encoding rate-limiting sphingolipid metabolism enzymes. Remarkably, skin barrier defects elicited by KLF5 ablation can be rescued by dietary interventions. Finally, we found that KLF5 is widely suppressed in human diseases with disrupted epidermal secretion, and its regulation of sphingolipid metabolism is conserved in human skin. Altogether, we established KLF5 as a disease-relevant transcription factor governing sphingolipid metabolism and barrier function in the skin, likely representing a long-sought secretory lineage-defining factor across tissue types.

3.
J Biomed Inform ; 154: 104648, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38692464

RESUMEN

BACKGROUND: Advances in artificial intelligence (AI) have realized the potential of revolutionizing healthcare, such as predicting disease progression via longitudinal inspection of Electronic Health Records (EHRs) and lab tests from patients admitted to Intensive Care Units (ICU). Although substantial literature exists addressing broad subjects, including the prediction of mortality, length-of-stay, and readmission, studies focusing on forecasting Acute Kidney Injury (AKI), specifically dialysis anticipation like Continuous Renal Replacement Therapy (CRRT) are scarce. The technicality of how to implement AI remains elusive. OBJECTIVE: This study aims to elucidate the important factors and methods that are required to develop effective predictive models of AKI and CRRT for patients admitted to ICU, using EHRs in the Medical Information Mart for Intensive Care (MIMIC) database. METHODS: We conducted a comprehensive comparative analysis of established predictive models, considering both time-series measurements and clinical notes from MIMIC-IV databases. Subsequently, we proposed a novel multi-modal model which integrates embeddings of top-performing unimodal models, including Long Short-Term Memory (LSTM) and BioMedBERT, and leverages both unstructured clinical notes and structured time series measurements derived from EHRs to enable the early prediction of AKI and CRRT. RESULTS: Our multimodal model achieved a lead time of at least 12 h ahead of clinical manifestation, with an Area Under the Receiver Operating Characteristic Curve (AUROC) of 0.888 for AKI and 0.997 for CRRT, as well as an Area Under the Precision Recall Curve (AUPRC) of 0.727 for AKI and 0.840 for CRRT, respectively, which significantly outperformed the baseline models. Additionally, we performed a SHapley Additive exPlanation (SHAP) analysis using the expected gradients algorithm, which highlighted important, previously underappreciated predictive features for AKI and CRRT. CONCLUSION: Our study revealed the importance and the technicality of applying longitudinal, multimodal modeling to improve early prediction of AKI and CRRT, offering insights for timely interventions. The performance and interpretability of our model indicate its potential for further assessment towards clinical applications, to ultimately optimize AKI management and enhance patient outcomes.


Asunto(s)
Lesión Renal Aguda , Registros Electrónicos de Salud , Unidades de Cuidados Intensivos , Lesión Renal Aguda/terapia , Humanos , Estudios Longitudinales , Terapia de Reemplazo Renal , Inteligencia Artificial , Predicción , Tiempo de Internación , Masculino , Bases de Datos Factuales , Femenino
4.
Brief Bioinform ; 22(3)2021 05 20.
Artículo en Inglés | MEDLINE | ID: mdl-32591784

RESUMEN

Whole-exome sequencing (WES) has been widely used to study the role of protein-coding variants in genetic diseases. Non-coding regions, typically covered by sparse off-target data, are often discarded by conventional WES analyses. Here, we develop a genotype calling pipeline named WEScall to analyse both target and off-target data. We leverage linkage disequilibrium shared within study samples and from an external reference panel to improve genotyping accuracy. In an application to WES of 2527 Chinese and Malays, WEScall can reduce the genotype discordance rate from 0.26% (SE= 6.4 × 10-6) to 0.08% (SE = 3.6 × 10-6) across 1.1 million single nucleotide polymorphisms (SNPs) in the deeply sequenced target regions. Furthermore, we obtain genotypes at 0.70% (SE = 3.0 × 10-6) discordance rate across 5.2 million off-target SNPs, which had ~1.2× mean sequencing depth. Using this dataset, we perform genome-wide association studies of 10 metabolic traits. Despite of our small sample size, we identify 10 loci at genome-wide significance (P < 5 × 10-8), including eight well-established loci. The two novel loci, both associated with glycated haemoglobin levels, are GPATCH8-SLC4A1 (rs369762319, P = 2.56 × 10-12) and ROR2 (rs1201042, P = 3.24 × 10-8). Finally, using summary statistics from UK Biobank and Biobank Japan, we show that polygenic risk prediction can be significantly improved for six out of nine traits by incorporating off-target data (P < 0.01). These results demonstrate WEScall as a useful tool to facilitate WES studies with decent amounts of off-target data.


Asunto(s)
Secuenciación del Exoma/métodos , Predisposición Genética a la Enfermedad , Genotipo , Proteína 1 de Intercambio de Anión de Eritrocito/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Desequilibrio de Ligamiento , Proteínas Musculares/genética , Polimorfismo de Nucleótido Simple
5.
BMC Bioinformatics ; 23(1): 2, 2022 Jan 04.
Artículo en Inglés | MEDLINE | ID: mdl-34983369

RESUMEN

Cellular heterogeneity underlies cancer evolution and metastasis. Advances in single-cell technologies such as single-cell RNA sequencing and mass cytometry have enabled interrogation of cell type-specific expression profiles and abundance across heterogeneous cancer samples obtained from clinical trials and preclinical studies. However, challenges remain in determining sample sizes needed for ascertaining changes in cell type abundances in a controlled study. To address this statistical challenge, we have developed a new approach, named Sensei, to determine the number of samples and the number of cells that are required to ascertain such changes between two groups of samples in single-cell studies. Sensei expands the t-test and models the cell abundances using a beta-binomial distribution. We evaluate the mathematical accuracy of Sensei and provide practical guidelines on over 20 cell types in over 30 cancer types based on knowledge acquired from the cancer cell atlas (TCGA) and prior single-cell studies. We provide a web application to enable user-friendly study design via https://kchen-lab.github.io/sensei/table_beta.html .


Asunto(s)
Neoplasias , Programas Informáticos , Distribución Binomial , Humanos , Neoplasias/genética , Proyectos de Investigación , Tamaño de la Muestra
6.
Mol Biol Evol ; 38(10): 4463-4474, 2021 09 27.
Artículo en Inglés | MEDLINE | ID: mdl-34152401

RESUMEN

The Peranakan Chinese are culturally unique descendants of immigrants from China who settled in the Malay Archipelago ∼300-500 years ago. Today, among large communities in Southeast Asia, the Peranakans have preserved Chinese traditions with strong influence from the local indigenous Malays. Yet, whether or to what extent genetic admixture co-occurred with the cultural mixture has been a topic of ongoing debate. We performed whole-genome sequencing (WGS) on 177 Singapore (SG) Peranakans and analyzed the data jointly with WGS data of Asian and European populations. We estimated that Peranakan Chinese inherited ∼5.62% (95% confidence interval [CI]: 4.76-6.49%) Malay ancestry, much higher than that in SG Chinese (1.08%, 0.65-1.51%), southern Chinese (0.86%, 0.50-1.23%), and northern Chinese (0.25%, 0.18-0.32%). A sex-biased admixture history, in which the Malay ancestry was contributed primarily by females, was supported by X chromosomal variants, and mitochondrial (MT) and Y haplogroups. Finally, we identified an ancient admixture event shared by Peranakan Chinese and SG Chinese ∼1,612 (95% CI: 1,345-1,923) years ago, coinciding with the settlement history of Han Chinese in southern China, apart from the recent admixture event with Malays unique to Peranakan Chinese ∼190 (159-213) years ago. These findings greatly advance our understanding of the dispersal history of Chinese and their interaction with indigenous populations in Southeast Asia.


Asunto(s)
Pueblo Asiatico , Genética de Población , Asia Sudoriental , Pueblo Asiatico/genética , China , Femenino , Humanos , Secuenciación Completa del Genoma
7.
Cytometry A ; 99(9): 899-909, 2021 09.
Artículo en Inglés | MEDLINE | ID: mdl-33342071

RESUMEN

Signal intensity measured in a mass cytometry (CyTOF) channel can often be affected by the neighboring channels due to technological limitations. Such signal artifacts are known as spillover effects and can substantially limit the accuracy of cell population clustering. Current approaches reduce these effects by using additional beads for normalization purposes known as single-stained controls. While effective in compensating for spillover effects, incorporating single-stained controls can be costly and require customized panel design. This is especially evident when executing large-scale immune profiling studies. We present a novel statistical method, named CytoSpill that independently quantifies and compensates the spillover effects in CyTOF data without requiring the use of single-stained controls. Our method utilizes knowledge-guided modeling and statistical techniques, such as finite mixture modeling and sequential quadratic programming, to achieve optimal error correction. We evaluated our method using five publicly available CyTOF datasets obtained from human peripheral blood mononuclear cells (PBMCs), C57BL/6J mouse bone marrow, healthy human bone marrow, chronic lymphocytic leukemia patient, and healthy human cord blood samples. In the PBMCs with known ground truth, our method achieved comparable results to experiments that incorporated single-stained controls. In datasets without ground-truth, our method not only reduced spillover on likely affected markers, but also led to the discovery of potentially novel subpopulations expressing functionally meaningful, cluster-specific markers. CytoSpill (developed in R) will greatly enhance the execution of large-scale cellular profiling of tumor immune microenvironment, development of novel immunotherapy, and the discovery of immune-specific biomarkers. The implementation of our method can be found at https://github.com/KChen-lab/CytoSpill.git.


Asunto(s)
Leucocitos Mononucleares , Animales , Biomarcadores , Análisis por Conglomerados , Citometría de Flujo , Humanos , Ratones , Ratones Endogámicos C57BL
8.
PLoS Genet ; 13(9): e1007021, 2017 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-28961250

RESUMEN

Knowledge of biological relatedness between samples is important for many genetic studies. In large-scale human genetic association studies, the estimated kinship is used to remove cryptic relatedness, control for family structure, and estimate trait heritability. However, estimation of kinship is challenging for sparse sequencing data, such as those from off-target regions in target sequencing studies, where genotypes are largely uncertain or missing. Existing methods often assume accurate genotypes at a large number of markers across the genome. We show that these methods, without accounting for the genotype uncertainty in sparse sequencing data, can yield a strong downward bias in kinship estimation. We develop a computationally efficient method called SEEKIN to estimate kinship for both homogeneous samples and heterogeneous samples with population structure and admixture. Our method models genotype uncertainty and leverages linkage disequilibrium through imputation. We test SEEKIN on a whole exome sequencing dataset (WES) of Singapore Chinese and Malays, which involves substantial population structure and admixture. We show that SEEKIN can accurately estimate kinship coefficient and classify genetic relatedness using off-target sequencing data down sampled to ~0.15X depth. In application to the full WES dataset without down sampling, SEEKIN also outperforms existing methods by properly analyzing shallow off-target data (~0.75X). Using both simulated and real phenotypes, we further illustrate how our method improves estimation of trait heritability for WES studies.


Asunto(s)
Bases de Datos Genéticas , Genética de Población/métodos , Genoma Humano , Análisis de Secuencia de ADN , Pueblo Asiatico/genética , Biología Computacional , Exoma , Estudios de Asociación Genética , Genotipo , Técnicas de Genotipaje , Humanos , Desequilibrio de Ligamiento , Modelos Genéticos , Programas Informáticos
9.
Commun Biol ; 7(1): 326, 2024 Mar 14.
Artículo en Inglés | MEDLINE | ID: mdl-38486077

RESUMEN

Clustering and visualization are essential parts of single-cell gene expression data analysis. The Euclidean distance used in most distance-based methods is not optimal. The batch effect, i.e., the variability among samples gathered from different times, tissues, and patients, introduces large between-group distance and obscures the true identities of cells. To solve this problem, we introduce Label-Aware Distance (LAD), a metric using temporal/spatial locality of the batch effect to control for such factors. We validate LAD on simulated data as well as apply it to a mouse retina development dataset and a lung dataset. We also found the utility of our approach in understanding the progression of the Coronavirus Disease 2019 (COVID-19). LAD provides better cell embedding than state-of-the-art batch correction methods on longitudinal datasets. It can be used in distance-based clustering and visualization methods to combine the power of multiple samples to help make biological findings.


Asunto(s)
Análisis por Conglomerados , Animales , Ratones , Expresión Génica
10.
medRxiv ; 2024 Mar 15.
Artículo en Inglés | MEDLINE | ID: mdl-38559064

RESUMEN

Background: Advances in artificial intelligence (AI) have realized the potential of revolutionizing healthcare, such as predicting disease progression via longitudinal inspection of Electronic Health Records (EHRs) and lab tests from patients admitted to Intensive Care Units (ICU). Although substantial literature exists addressing broad subjects, including the prediction of mortality, length-of-stay, and readmission, studies focusing on forecasting Acute Kidney Injury (AKI), specifically dialysis anticipation like Continuous Renal Replacement Therapy (CRRT) are scarce. The technicality of how to implement AI remains elusive. Objective: This study aims to elucidate the important factors and methods that are required to develop effective predictive models of AKI and CRRT for patients admitted to ICU, using EHRs in the Medical Information Mart for Intensive Care (MIMIC) database. Methods: We conducted a comprehensive comparative analysis of established predictive models, considering both time-series measurements and clinical notes from MIMIC-IV databases. Subsequently, we proposed a novel multi-modal model which integrates embeddings of top-performing unimodal models, including Long Short-Term Memory (LSTM) and BioMedBERT, and leverages both unstructured clinical notes and structured time series measurements derived from EHRs to enable the early prediction of AKI and CRRT. Results: Our multimodal model achieved a lead time of at least 12 hours ahead of clinical manifestation, with an Area Under the Receiver Operating Characteristic Curve (AUROC) of 0.888 for AKI and 0.997 for CRRT, as well as an Area Under the Precision Recall Curve (AUPRC) of 0.727 for AKI and 0.840 for CRRT, respectively, which significantly outperformed the baseline models. Additionally, we performed a SHapley Additive exPlanation (SHAP) analysis using the expected gradients algorithm, which highlighted important, previously underappreciated predictive features for AKI and CRRT. Conclusion: Our study revealed the importance and the technicality of applying longitudinal, multimodal modeling to improve early prediction of AKI and CRRT, offering insights for timely interventions. The performance and interpretability of our model indicate its potential for further assessment towards clinical applications, to ultimately optimize AKI management and enhance patient outcomes.

11.
Cancer Cell ; 42(8): 1450-1466.e11, 2024 Aug 12.
Artículo en Inglés | MEDLINE | ID: mdl-39137729

RESUMEN

Glioblastoma (GBM) is an aggressive brain cancer with limited therapeutic options. Natural killer (NK) cells are innate immune cells with strong anti-tumor activity and may offer a promising treatment strategy for GBM. We compared the anti-GBM activity of NK cells engineered to express interleukin (IL)-15 or IL-21. Using multiple in vivo models, IL-21 NK cells were superior to IL-15 NK cells both in terms of safety and long-term anti-tumor activity, with locoregionally administered IL-15 NK cells proving toxic and ineffective at tumor control. IL-21 NK cells displayed a unique chromatin accessibility signature, with CCAAT/enhancer-binding proteins (C/EBP), especially CEBPD, serving as key transcription factors regulating their enhanced function. Deletion of CEBPD resulted in loss of IL-21 NK cell potency while its overexpression increased NK cell long-term cytotoxicity and metabolic fitness. These results suggest that IL-21, through C/EBP transcription factors, drives epigenetic reprogramming of NK cells, enhancing their anti-tumor efficacy against GBM.


Asunto(s)
Neoplasias Encefálicas , Proteína delta de Unión al Potenciador CCAAT , Glioblastoma , Interleucinas , Células Asesinas Naturales , Células Asesinas Naturales/inmunología , Células Asesinas Naturales/metabolismo , Glioblastoma/inmunología , Glioblastoma/genética , Glioblastoma/patología , Glioblastoma/terapia , Interleucinas/genética , Interleucinas/metabolismo , Interleucinas/inmunología , Humanos , Animales , Ratones , Proteína delta de Unión al Potenciador CCAAT/metabolismo , Proteína delta de Unión al Potenciador CCAAT/genética , Neoplasias Encefálicas/inmunología , Neoplasias Encefálicas/genética , Neoplasias Encefálicas/patología , Neoplasias Encefálicas/terapia , Línea Celular Tumoral , Interleucina-15/genética , Interleucina-15/metabolismo , Interleucina-15/inmunología , Ensayos Antitumor por Modelo de Xenoinjerto
12.
Nat Med ; 30(3): 772-784, 2024 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-38238616

RESUMEN

There is a pressing need for allogeneic chimeric antigen receptor (CAR)-immune cell therapies that are safe, effective and affordable. We conducted a phase 1/2 trial of cord blood-derived natural killer (NK) cells expressing anti-CD19 chimeric antigen receptor and interleukin-15 (CAR19/IL-15) in 37 patients with CD19+ B cell malignancies. The primary objectives were safety and efficacy, defined as day 30 overall response (OR). Secondary objectives included day 100 response, progression-free survival, overall survival and CAR19/IL-15 NK cell persistence. No notable toxicities such as cytokine release syndrome, neurotoxicity or graft-versus-host disease were observed. The day 30 and day 100 OR rates were 48.6% for both. The 1-year overall survival and progression-free survival were 68% and 32%, respectively. Patients who achieved OR had higher levels and longer persistence of CAR-NK cells. Receiving CAR-NK cells from a cord blood unit (CBU) with nucleated red blood cells ≤ 8 × 107 and a collection-to-cryopreservation time ≤ 24 h was the most significant predictor for superior outcome. NK cells from these optimal CBUs were highly functional and enriched in effector-related genes. In contrast, NK cells from suboptimal CBUs had upregulation of inflammation, hypoxia and cellular stress programs. Finally, using multiple mouse models, we confirmed the superior antitumor activity of CAR/IL-15 NK cells from optimal CBUs in vivo. These findings uncover new features of CAR-NK cell biology and underscore the importance of donor selection for allogeneic cell therapies. ClinicalTrials.gov identifier: NCT03056339 .


Asunto(s)
Trasplante de Células Madre Hematopoyéticas , Neoplasias , Receptores Quiméricos de Antígenos , Animales , Ratones , Humanos , Receptores Quiméricos de Antígenos/genética , Interleucina-15 , Células Asesinas Naturales , Inmunoterapia Adoptiva/efectos adversos , Antígenos CD19 , Proteínas Adaptadoras Transductoras de Señales
13.
Sci Transl Med ; 16(764): eadp0004, 2024 Sep 11.
Artículo en Inglés | MEDLINE | ID: mdl-39259809

RESUMEN

Myelodysplastic syndrome and acute myeloid leukemia (AML) belong to a continuous disease spectrum of myeloid malignancies with poor prognosis in the relapsed/refractory setting necessitating novel therapies. Natural killer (NK) cells from patients with myeloid malignancies display global dysfunction with impaired killing capacity, altered metabolism, and an exhausted phenotype at the single-cell transcriptomic and proteomic levels. In this study, we identified that this dysfunction was mediated through a cross-talk between NK cells and myeloid blasts necessitating cell-cell contact. NK cell dysfunction could be prevented by targeting the αvß-integrin/TGF-ß/SMAD pathway but, once established, was persistent because of profound epigenetic reprogramming. We identified BATF as a core transcription factor and the main mediator of this NK cell dysfunction in AML. Mechanistically, we found that BATF was directly regulated and induced by SMAD2/3 and, in turn, bound to key genes related to NK cell exhaustion, such as HAVCR2, LAG3, TIGIT, and CTLA4. BATF deletion enhanced NK cell function against AML in vitro and in vivo. Collectively, our findings reveal a previously unidentified mechanism of NK immune evasion in AML manifested by epigenetic rewiring and inactivation of NK cells by myeloid blasts. This work highlights the importance of using healthy allogeneic NK cells as an adoptive cell therapy to treat patients with myeloid malignancies combined with strategies aimed at preventing the dysfunction by targeting the TGF-ß pathway or BATF.


Asunto(s)
Factores de Transcripción con Cremalleras de Leucina de Carácter Básico , Epigénesis Genética , Células Asesinas Naturales , Leucemia Mieloide Aguda , Leucemia Mieloide Aguda/genética , Leucemia Mieloide Aguda/patología , Leucemia Mieloide Aguda/inmunología , Humanos , Factores de Transcripción con Cremalleras de Leucina de Carácter Básico/metabolismo , Factores de Transcripción con Cremalleras de Leucina de Carácter Básico/genética , Células Asesinas Naturales/metabolismo , Células Asesinas Naturales/inmunología , Animales , Factor de Crecimiento Transformador beta/metabolismo , Transducción de Señal , Ratones , Reprogramación Celular , Proteína smad3/metabolismo , Proteína Smad2/metabolismo
14.
Res Sq ; 2023 Jul 26.
Artículo en Inglés | MEDLINE | ID: mdl-37547002

RESUMEN

Clustering and visualization are essential parts of single-cell gene expression data analysis. The Euclidean distance used in most distance-based methods is not optimal. The batch effect, i.e., the variability among samples gathered from different times, tissues, and patients, introduces large between-group distance and obscures the true identities of cells. To solve this problem, we introduce Batch-Corrected Distance (BCD), a metric using temporal/spatial locality of the batch effect to control for such factors. We validate BCD on simulated data as well as applied it to a mouse retina development dataset and a lung dataset. We also found the utility of our approach in understanding the progression of the Coronavirus Disease 2019 (COVID-19). BCD achieves more accurate clusters and better visualizations than state-of-the-art batch correction methods on longitudinal datasets. BCD can be directly integrated with most clustering and visualization methods to enable more scientific findings.

15.
J Bioinform Syst Biol ; 6(2): 74-81, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-39301431

RESUMEN

We present novoRNABreak, a unified framework for cancer specific novel splice junction and fusion transcript detection in RNA-seq data obtained from human cancer samples. novoRNABreak is based on a local assembly model, which offers a tradeoff between the alignment-based and de novo whole transcriptome assembly (WTA) methods. This approach is accurate and sensitive in assembling novel junctions that are difficult to directly align or have multiple alignments. Additionally, it is more efficient due to the strategy that focuses on junctions rather than full length transcripts. The performance of novoRNABreak is demonstrated by a comprehensive set of experiments using synthetic data generated based on genome reference, as well as real RNA-seq data from breast cancer and prostate cancer samples. The results show that our tool has a better performance by fully utilizing unmapped reads and precisely identifying the junctions where short reads or small exons have multiple alignments. novoRNABreak is a fully-fledged program available on GitHub (https://github.com/KChen-lab/novoRNABreak).

16.
Nat Biotechnol ; 2023 Aug 17.
Artículo en Inglés | MEDLINE | ID: mdl-37592035

RESUMEN

Single-cell omics technologies enable molecular characterization of diverse cell types and states, but how the resulting transcriptional and epigenetic profiles depend on the cell's genetic background remains understudied. We describe Monopogen, a computational tool to detect single-nucleotide variants (SNVs) from single-cell sequencing data. Monopogen leverages linkage disequilibrium from external reference panels to identify germline SNVs and detects putative somatic SNVs using allele cosegregating patterns at the cell population level. It can identify 100 K to 3 M germline SNVs achieving a genotyping accuracy of 95%, together with hundreds of putative somatic SNVs. Monopogen-derived genotypes enable global and local ancestry inference and identification of admixed samples. It identifies variants associated with cardiomyocyte metabolic levels and epigenomic programs. It also improves putative somatic SNV detection that enables clonal lineage tracing in primary human clonal hematopoiesis. Monopogen brings together population genetics, cell lineage tracing and single-cell omics to uncover genetic determinants of cellular processes.

17.
bioRxiv ; 2023 Dec 19.
Artículo en Inglés | MEDLINE | ID: mdl-38187699

RESUMEN

Key to understanding many biological phenomena is knowing the temporal ordering of cellular events, which often require continuous direct observations [1, 2]. An alternative solution involves the utilization of irreversible genetic changes, such as naturally occurring mutations, to create indelible markers that enables retrospective temporal ordering [3-8]. Using NSC-seq, a newly designed and validated multi-purpose single-cell CRISPR platform, we developed a molecular clock approach to record the timing of cellular events and clonality in vivo , while incorporating assigned cell state and lineage information. Using this approach, we uncovered precise timing of tissue-specific cell expansion during murine embryonic development and identified new intestinal epithelial progenitor states by their unique genetic histories. NSC-seq analysis of murine adenomas and single-cell multi-omic profiling of human precancers as part of the Human Tumor Atlas Network (HTAN), including 116 scRNA-seq datasets and clonal analysis of 418 human polyps, demonstrated the occurrence of polyancestral initiation in 15-30% of colonic precancers, revealing their origins from multiple normal founders. Thus, our multimodal framework augments existing single-cell analyses and lays the foundation for in vivo multimodal recording, enabling the tracking of lineage and temporal events during development and tumorigenesis.

18.
Sci Adv ; 9(30): eadd6997, 2023 07 28.
Artículo en Inglés | MEDLINE | ID: mdl-37494448

RESUMEN

Chimeric antigen receptor (CAR) engineering of natural killer (NK) cells is promising, with early-phase clinical studies showing encouraging responses. However, the transcriptional signatures that control the fate of CAR-NK cells after infusion and factors that influence tumor control remain poorly understood. We performed single-cell RNA sequencing and mass cytometry to study the heterogeneity of CAR-NK cells and their in vivo evolution after adoptive transfer, from the phase of tumor control to relapse. Using a preclinical model of noncurative lymphoma and samples from a responder and a nonresponder patient treated with CAR19/IL-15 NK cells, we observed the emergence of NK cell clusters with distinct patterns of activation, function, and metabolic signature associated with different phases of in vivo evolution and tumor control. Interaction with the highly metabolically active tumor resulted in loss of metabolic fitness in NK cells that could be partly overcome by incorporation of IL-15 in the CAR construct.


Asunto(s)
Receptores Quiméricos de Antígenos , Humanos , Receptores Quiméricos de Antígenos/genética , Receptores Quiméricos de Antígenos/metabolismo , Interleucina-15/genética , Interleucina-15/metabolismo , Citocinas/metabolismo , Línea Celular Tumoral , Células Asesinas Naturales , Tratamiento Basado en Trasplante de Células y Tejidos
19.
Genome Biol ; 23(1): 112, 2022 05 09.
Artículo en Inglés | MEDLINE | ID: mdl-35534898

RESUMEN

Integration of single-cell multiomics profiles generated by different single-cell technologies from the same biological sample is still challenging. Previous approaches based on shared features have only provided approximate solutions. Here, we present a novel mathematical solution named bi-order canonical correlation analysis (bi-CCA), which extends the widely used CCA approach to iteratively align the rows and the columns between data matrices. Bi-CCA is generally applicable to combinations of any two single-cell modalities. Validations using co-assayed ground truth data and application to a CAR-NK study and a fetal muscle atlas demonstrate its capability in generating accurate multimodal co-embeddings and discovering cellular identity.

20.
Nat Commun ; 13(1): 474, 2022 01 25.
Artículo en Inglés | MEDLINE | ID: mdl-35078987

RESUMEN

The specificity of CRISPR/Cas9 genome editing is largely determined by the sequences of guide RNA (gRNA) and the targeted DNA, yet the sequence-dependent rules underlying off-target effects are not fully understood. To systematically explore the sequence determinants governing CRISPR/Cas9 specificity, here we describe a dual-target system to measure the relative cleavage rate between off- and on-target sequences (off-on ratios) of 1902 gRNAs on 13,314 synthetic target sequences, and reveal a set of sequence rules involving 2 factors in off-targeting: 1) a guide-intrinsic mismatch tolerance (GMT) independent of the mismatch context; 2) an "epistasis-like" combinatorial effect of multiple mismatches, which are associated with the free-energy landscape in R-loop formation and are explainable by a multi-state kinetic model. These sequence rules lead to the development of MOFF, a model-based predictor of Cas9-mediated off-target effects. Moreover, the "epistasis-like" combinatorial effect suggests a strategy of allele-specific genome editing using mismatched guides. With the aid of MOFF prediction, this strategy significantly improves the selectivity and expands the application domain of Cas9-based allele-specific editing, as tested in a high-throughput allele-editing screen on 18 cancer hotspot mutations.


Asunto(s)
Secuencia de Bases/genética , Sistemas CRISPR-Cas , Edición Génica/métodos , Mutación , Neoplasias/terapia , ARN Guía de Kinetoplastida/química , Línea Celular , Humanos , Neoplasias/genética , Neoplasias/patología , ARN Guía de Kinetoplastida/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA