Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 34
Filtrar
Más filtros

Bases de datos
Tipo del documento
Intervalo de año de publicación
1.
Cell ; 181(5): 1112-1130.e16, 2020 05 28.
Artículo en Inglés | MEDLINE | ID: mdl-32470399

RESUMEN

Acute physical activity leads to several changes in metabolic, cardiovascular, and immune pathways. Although studies have examined selected changes in these pathways, the system-wide molecular response to an acute bout of exercise has not been fully characterized. We performed longitudinal multi-omic profiling of plasma and peripheral blood mononuclear cells including metabolome, lipidome, immunome, proteome, and transcriptome from 36 well-characterized volunteers, before and after a controlled bout of symptom-limited exercise. Time-series analysis revealed thousands of molecular changes and an orchestrated choreography of biological processes involving energy metabolism, oxidative stress, inflammation, tissue repair, and growth factor response, as well as regulatory pathways. Most of these processes were dampened and some were reversed in insulin-resistant participants. Finally, we discovered biological pathways involved in cardiopulmonary exercise response and developed prediction models revealing potential resting blood-based biomarkers of peak oxygen consumption.


Asunto(s)
Metabolismo Energético/fisiología , Ejercicio Físico/fisiología , Anciano , Biomarcadores/metabolismo , Femenino , Humanos , Insulina/metabolismo , Resistencia a la Insulina , Leucocitos Mononucleares/metabolismo , Estudios Longitudinales , Masculino , Metaboloma , Persona de Mediana Edad , Oxígeno/metabolismo , Consumo de Oxígeno , Proteoma , Transcriptoma
2.
Am J Hum Genet ; 108(10): 1866-1879, 2021 10 07.
Artículo en Inglés | MEDLINE | ID: mdl-34582792

RESUMEN

Complex traits and diseases can be influenced by both genetics and environment. However, given the large number of environmental stimuli and power challenges for gene-by-environment testing, it remains a critical challenge to identify and prioritize specific disease-relevant environmental exposures. We propose a framework for leveraging signals from transcriptional responses to environmental perturbations to identify disease-relevant perturbations that can modulate genetic risk for complex traits and inform the functions of genetic variants associated with complex traits. We perturbed human skeletal-muscle-, fat-, and liver-relevant cell lines with 21 perturbations affecting insulin resistance, glucose homeostasis, and metabolic regulation in humans and identified thousands of environmentally responsive genes. By combining these data with GWASs from 31 distinct polygenic traits, we show that the heritability of multiple traits is enriched in regions surrounding genes responsive to specific perturbations and, further, that environmentally responsive genes are enriched for associations with specific diseases and phenotypes from the GWAS Catalog. Overall, we demonstrate the advantages of large-scale characterization of transcriptional changes in diversely stimulated and pathologically relevant cells to identify disease-relevant perturbations.


Asunto(s)
Interacción Gen-Ambiente , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Herencia Multifactorial , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Enfermedades Autoinmunes/etiología , Enfermedades Autoinmunes/patología , Humanos , Trastornos Mentales/etiología , Trastornos Mentales/patología , Enfermedades Metabólicas/etiología , Enfermedades Metabólicas/patología , Fenotipo
3.
PLoS Comput Biol ; 18(2): e1009838, 2022 02.
Artículo en Inglés | MEDLINE | ID: mdl-35130266

RESUMEN

The ability to predict human phenotypes and identify biomarkers of disease from metagenomic data is crucial for the development of therapeutics for microbiome-associated diseases. However, metagenomic data is commonly affected by technical variables unrelated to the phenotype of interest, such as sequencing protocol, which can make it difficult to predict phenotype and find biomarkers of disease. Supervised methods to correct for background noise, originally designed for gene expression and RNA-seq data, are commonly applied to microbiome data but may be limited because they cannot account for unmeasured sources of variation. Unsupervised approaches address this issue, but current methods are limited because they are ill-equipped to deal with the unique aspects of microbiome data, which is compositional, highly skewed, and sparse. We perform a comparative analysis of the ability of different denoising transformations in combination with supervised correction methods as well as an unsupervised principal component correction approach that is presently used in other domains but has not been applied to microbiome data to date. We find that the unsupervised principal component correction approach has comparable ability in reducing false discovery of biomarkers as the supervised approaches, with the added benefit of not needing to know the sources of variation apriori. However, in prediction tasks, it appears to only improve prediction when technical variables contribute to the majority of variance in the data. As new and larger metagenomic datasets become increasingly available, background noise correction will become essential for generating reproducible microbiome analyses.


Asunto(s)
Microbioma Gastrointestinal , Humanos
4.
Genome Res ; 26(6): 768-77, 2016 06.
Artículo en Inglés | MEDLINE | ID: mdl-27197214

RESUMEN

The X Chromosome, with its unique mode of inheritance, contributes to differences between the sexes at a molecular level, including sex-specific gene expression and sex-specific impact of genetic variation. Improving our understanding of these differences offers to elucidate the molecular mechanisms underlying sex-specific traits and diseases. However, to date, most studies have either ignored the X Chromosome or had insufficient power to test for the sex-specific impact of genetic variation. By analyzing whole blood transcriptomes of 922 individuals, we have conducted the first large-scale, genome-wide analysis of the impact of both sex and genetic variation on patterns of gene expression, including comparison between the X Chromosome and autosomes. We identified a depletion of expression quantitative trait loci (eQTL) on the X Chromosome, especially among genes under high selective constraint. In contrast, we discovered an enrichment of sex-specific regulatory variants on the X Chromosome. To resolve the molecular mechanisms underlying such effects, we generated chromatin accessibility data through ATAC-sequencing to connect sex-specific chromatin accessibility to sex-specific patterns of expression and regulatory variation. As sex-specific regulatory variants discovered in our study can inform sex differences in heritable disease prevalence, we integrated our data with genome-wide association study data for multiple immune traits identifying several traits with significant sex biases in genetic susceptibilities. Together, our study provides genome-wide insight into how genetic variation, the X Chromosome, and sex shape human gene regulation and disease.


Asunto(s)
Cromosomas Humanos X/genética , Transcriptoma , Femenino , Perfilación de la Expresión Génica , Regulación de la Expresión Génica , Predisposición Genética a la Enfermedad , Genoma Humano , Humanos , Masculino , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Caracteres Sexuales
5.
Biom J ; 61(3): 747-768, 2019 05.
Artículo en Inglés | MEDLINE | ID: mdl-30693553

RESUMEN

Marginal tests based on individual SNPs are routinely used in genetic association studies. Studies have shown that haplotype-based methods may provide more power in disease mapping than methods based on single markers when, for example, multiple disease-susceptibility variants occur within the same gene. A limitation of haplotype-based methods is that the number of parameters increases exponentially with the number of SNPs, inducing a commensurate increase in the degrees of freedom and weakening the power to detect associations. To address this limitation, we introduce a hierarchical linkage disequilibrium model for disease mapping, based on a reparametrization of the multinomial haplotype distribution, where every parameter corresponds to the cumulant of each possible subset of a set of loci. This hierarchy present in the parameters enables us to employ flexible testing strategies over a range of parameter sets: from standard single SNP analyses through the full haplotype distribution tests, reducing degrees of freedom and increasing the power to detect associations. We show via extensive simulations that our approach maintains the type I error at nominal level and has increased power under many realistic scenarios, as compared to single SNP and standard haplotype-based studies. To evaluate the performance of our proposed methodology in real data, we analyze genome-wide data from the Wellcome Trust Case-Control Consortium.


Asunto(s)
Biometría/métodos , Haplotipos , Desequilibrio de Ligamiento , Artritis Reumatoide/genética , Sitios Genéticos/genética , Estudio de Asociación del Genoma Completo , Humanos , Cirrosis Hepática Biliar/genética , Polimorfismo de Nucleótido Simple
6.
Genet Epidemiol ; 39(3): 156-65, 2015 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-25620726

RESUMEN

Integrative omics, the joint analysis of outcome and multiple types of omics data, such as genomics, epigenomics, and transcriptomics data, constitute a promising approach for powerful and biologically relevant association studies. These studies often employ a case-control design, and often include nonomics covariates, such as age and gender, that may modify the underlying omics risk factors. An open question is how to best integrate multiple omics and nonomics information to maximize statistical power in case-control studies that ascertain individuals based on the phenotype. Recent work on integrative omics have used prospective approaches, modeling case-control status conditional on omics, and nonomics risk factors. Compared to univariate approaches, jointly analyzing multiple risk factors with a prospective approach increases power in nonascertained cohorts. However, these prospective approaches often lose power in case-control studies. In this article, we propose a novel statistical method for integrating multiple omics and nonomics factors in case-control association studies. Our method is based on a retrospective likelihood function that models the joint distribution of omics and nonomics factors conditional on case-control status. The new method provides accurate control of Type I error rate and has increased efficiency over prospective approaches in both simulated and real data.


Asunto(s)
Estudios de Casos y Controles , Epigenómica/métodos , Estudio de Asociación del Genoma Completo , Genómica/métodos , Funciones de Verosimilitud , Modelos Genéticos , Esclerosis Múltiple/genética , Humanos , Fenotipo , Estudios Prospectivos , Estudios Retrospectivos
7.
Genet Epidemiol ; 38 Suppl 1: S37-43, 2014 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-25112186

RESUMEN

With the advance of next-generation sequencing technologies in recent years, rare genetic variant data have now become available for genetic epidemiology studies. For family samples, however, only a few statistical methods for association analysis of rare genetic variants have been developed. Rare variant approaches are of great interest, particularly for family data, because samples enriched for trait-relevant variants can be ascertained and rare variants are putatively enriched through segregation. To facilitate the evaluation of existing and new rare variant testing approaches for analyzing family data, Genetic Analysis Workshop 18 (GAW18) provided genotype and next-generation sequencing data and longitudinal blood pressure traits from extended pedigrees of Mexican American families from the San Antonio Family Study. Our GAW18 group members analyzed real and simulated phenotype data from GAW18 by using generalized linear mixed-effects models or principal components to adjust for familial correlation or by testing binary traits using a correction factor for familial effects. With one exception, approaches dealt with the extended pedigrees in their original state using information based on the kinship matrix or alternative genetic similarity measures. For simulated data our group demonstrated that the family-based kernel machine score test is superior in power to family-based single-marker or burden tests, except in a few specific scenarios. For real data three contributions identified significant associations. They substantially reduced the number of tests before performing the association analysis. We conclude from our real data analyses that further development of strategies for targeted testing or more focused screening of genetic variants is strongly desirable.


Asunto(s)
Estudios de Asociación Genética , Variación Genética , Linaje , Presión Sanguínea/genética , Pruebas Genéticas , Genotipo , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Hipertensión/genética , Hipertensión/patología , Modelos Lineales , Fenotipo , Polimorfismo de Nucleótido Simple , Análisis de Componente Principal , Análisis de Secuencia de ADN
8.
NPJ Digit Med ; 7(1): 49, 2024 Feb 28.
Artículo en Inglés | MEDLINE | ID: mdl-38418551

RESUMEN

Over the last ten years, there has been considerable progress in using digital behavioral phenotypes, captured passively and continuously from smartphones and wearable devices, to infer depressive mood. However, most digital phenotype studies suffer from poor replicability, often fail to detect clinically relevant events, and use measures of depression that are not validated or suitable for collecting large and longitudinal data. Here, we report high-quality longitudinal validated assessments of depressive mood from computerized adaptive testing paired with continuous digital assessments of behavior from smartphone sensors for up to 40 weeks on 183 individuals experiencing mild to severe symptoms of depression. We apply a combination of cubic spline interpolation and idiographic models to generate individualized predictions of future mood from the digital behavioral phenotypes, achieving high prediction accuracy of depression severity up to three weeks in advance (R2 ≥ 80%) and a 65.7% reduction in the prediction error over a baseline model which predicts future mood based on past depression severity alone. Finally, our study verified the feasibility of obtaining high-quality longitudinal assessments of mood from a clinical population and predicting symptom severity weeks in advance using passively collected digital behavioral data. Our results indicate the possibility of expanding the repertoire of patient-specific behavioral measures to enable future psychiatric research.

9.
Cell Genom ; 4(1): 100460, 2024 Jan 10.
Artículo en Inglés | MEDLINE | ID: mdl-38190099

RESUMEN

Single-nucleotide polymorphisms (SNPs) near the ERAP2 gene are associated with various autoimmune conditions, as well as protection against lethal infections. Due to high linkage disequilibrium, numerous trait-associated SNPs are correlated with ERAP2 expression; however, their functional mechanisms remain unidentified. We show by reciprocal allelic replacement that ERAP2 expression is directly controlled by the splice region variant rs2248374. However, disease-associated variants in the downstream LNPEP gene promoter are independently associated with ERAP2 expression. Allele-specific conformation capture assays revealed long-range chromatin contacts between the gene promoters of LNPEP and ERAP2 and showed that interactions were stronger in patients carrying the alleles that increase susceptibility to autoimmune diseases. Replacing the SNPs in the LNPEP promoter by reference sequences lowered ERAP2 expression. These findings show that multiple SNPs act in concert to regulate ERAP2 expression and that disease-associated variants can convert a gene promoter region into a potent enhancer of a distal gene.


Asunto(s)
Enfermedades Autoinmunes , Polimorfismo de Nucleótido Simple , Humanos , Polimorfismo de Nucleótido Simple/genética , Predisposición Genética a la Enfermedad/genética , Enfermedades Autoinmunes/genética , Regiones Promotoras Genéticas/genética , Aminopeptidasas/genética
10.
Genet Epidemiol ; 36(8): 811-9, 2012 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-22851506

RESUMEN

It is hypothesized that certain alleles can have a protective effect not only when inherited by the offspring but also as noninherited maternal antigens (NIMA). To estimate the NIMA effect, large samples of families are needed. When large samples are not available, we propose a combined approach to estimate the NIMA effect from ascertained nuclear families and twin pairs. We develop a likelihood-based approach allowing for several ascertainment schemes, to accommodate for the outcome-dependent sampling scheme, and a family-specific random term, to take into account the correlation between family members. We estimate the parameters using maximum likelihood based on the combined joint likelihood (CJL) approach. Simulations show that the CJL is more efficient for estimating the NIMA odds ratios as compared to a families-only approach. To illustrate our approach, we used data from a family and a twin study from the United Kingdom on rheumatoid arthritis, and confirmed the protective NIMA effect, with an odds ratio of 0.477 (95% CI 0.264-0.864).


Asunto(s)
Antígenos/inmunología , Artritis Reumatoide/genética , Familia , Estudios de Asociación Genética/métodos , Modelos Genéticos , Gemelos/genética , Alelos , Antígenos/genética , Artritis Reumatoide/inmunología , Femenino , Genotipo , Humanos , Funciones de Verosimilitud , Madres , Oportunidad Relativa , Penetrancia , Reino Unido
11.
Nat Commun ; 14(1): 4214, 2023 07 14.
Artículo en Inglés | MEDLINE | ID: mdl-37452040

RESUMEN

Obesity-induced adipose tissue dysfunction can cause low-grade inflammation and downstream obesity comorbidities. Although preadipocytes may contribute to this pro-inflammatory environment, the underlying mechanisms are unclear. We used human primary preadipocytes from body mass index (BMI) -discordant monozygotic (MZ) twin pairs to generate epigenetic (ATAC-sequence) and transcriptomic (RNA-sequence) data for testing whether increased BMI alters the subnuclear compartmentalization of open chromatin in the twins' preadipocytes, causing downstream inflammation. Here we show that the co-accessibility of open chromatin, i.e. compartmentalization of chromatin activity, is altered in the higher vs lower BMI MZ siblings for a large subset ( ~ 88.5 Mb) of the active subnuclear compartments. Using the UK Biobank we show that variants within these regions contribute to systemic inflammation through interactions with BMI on C-reactive protein. In summary, open chromatin co-accessibility in human preadipocytes is disrupted among the higher BMI siblings, suggesting a mechanism how obesity may lead to inflammation via gene-environment interactions.


Asunto(s)
Inflamación , Obesidad , Humanos , Índice de Masa Corporal , Cromatina , Inflamación/genética , Obesidad/metabolismo , Gemelos Monocigóticos
12.
Front Genet ; 14: 997383, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-36999049

RESUMEN

RNA sequencing (RNA-seq) has become an exemplary technology in modern biology and clinical science. Its immense popularity is due in large part to the continuous efforts of the bioinformatics community to develop accurate and scalable computational tools to analyze the enormous amounts of transcriptomic data that it produces. RNA-seq analysis enables genes and their corresponding transcripts to be probed for a variety of purposes, such as detecting novel exons or whole transcripts, assessing expression of genes and alternative transcripts, and studying alternative splicing structure. It can be a challenge, however, to obtain meaningful biological signals from raw RNA-seq data because of the enormous scale of the data as well as the inherent limitations of different sequencing technologies, such as amplification bias or biases of library preparation. The need to overcome these technical challenges has pushed the rapid development of novel computational tools, which have evolved and diversified in accordance with technological advancements, leading to the current myriad of RNA-seq tools. These tools, combined with the diverse computational skill sets of biomedical researchers, help to unlock the full potential of RNA-seq. The purpose of this review is to explain basic concepts in the computational analysis of RNA-seq data and define discipline-specific jargon.

13.
Nat Med ; 29(7): 1845-1856, 2023 07.
Artículo en Inglés | MEDLINE | ID: mdl-37464048

RESUMEN

An individual's disease risk is affected by the populations that they belong to, due to shared genetics and environmental factors. The study of fine-scale populations in clinical care is important for identifying and reducing health disparities and for developing personalized interventions. To assess patterns of clinical diagnoses and healthcare utilization by fine-scale populations, we leveraged genetic data and electronic medical records from 35,968 patients as part of the UCLA ATLAS Community Health Initiative. We defined clusters of individuals using identity by descent, a form of genetic relatedness that utilizes shared genomic segments arising due to a common ancestor. In total, we identified 376 clusters, including clusters with patients of Afro-Caribbean, Puerto Rican, Lebanese Christian, Iranian Jewish and Gujarati ancestry. Our analysis uncovered 1,218 significant associations between disease diagnoses and clusters and 124 significant associations with specialty visits. We also examined the distribution of pathogenic alleles and found 189 significant alleles at elevated frequency in particular clusters, including many that are not regularly included in population screening efforts. Overall, this work progresses the understanding of health in understudied communities and can provide the foundation for further study into health inequities.


Asunto(s)
Atención a la Salud , Aceptación de la Atención de Salud , Humanos , Los Angeles , Irán , Etnicidad
14.
bioRxiv ; 2023 Nov 06.
Artículo en Inglés | MEDLINE | ID: mdl-37986808

RESUMEN

Mapping the functional human genome and impact of genetic variants is often limited to European-descendent population samples. To aid in overcoming this limitation, we measured gene expression using RNA sequencing in lymphoblastoid cell lines (LCLs) from 599 individuals from six African populations to identify novel transcripts including those not represented in the hg38 reference genome. We used whole genomes from the 1000 Genomes Project and 164 Maasai individuals to identify 8,881 expression and 6,949 splicing quantitative trait loci (eQTLs/sQTLs), and 2,611 structural variants associated with gene expression (SV-eQTLs). We further profiled chromatin accessibility using ATAC-Seq in a subset of 100 representative individuals, to identity chromatin accessibility quantitative trait loci (caQTLs) and allele-specific chromatin accessibility, and provide predictions for the functional effect of 78.9 million variants on chromatin accessibility. Using this map of eQTLs and caQTLs we fine-mapped GWAS signals for a range of complex diseases. Combined, this work expands global functional genomic data to identify novel transcripts, functional elements and variants, understand population genetic history of molecular quantitative trait loci, and further resolve the genetic basis of multiple human traits and disease.

15.
Nat Commun ; 13(1): 5704, 2022 09 28.
Artículo en Inglés | MEDLINE | ID: mdl-36171194

RESUMEN

A majority of the variants identified in genome-wide association studies fall in non-coding regions of the genome, indicating their mechanism of impact is mediated via gene expression. Leveraging this hypothesis, transcriptome-wide association studies (TWAS) have assisted in both the interpretation and discovery of additional genes associated with complex traits. However, existing methods for conducting TWAS do not take full advantage of the intra-individual correlation inherently present in multi-context expression studies and do not properly adjust for multiple testing across contexts. We introduce CONTENT-a computationally efficient method with proper cross-context false discovery correction that leverages correlation structure across contexts to improve power and generate context-specific and context-shared components of expression. We apply CONTENT to bulk multi-tissue and single-cell RNA-seq data sets and show that CONTENT leads to a 42% (bulk) and 110% (single cell) increase in the number of genetically predicted genes relative to previous approaches. We find the context-specific component of expression comprises 30% of heritability in tissue-level bulk data and 75% in single-cell data, consistent with cell-type heterogeneity in bulk tissue. In the context of TWAS, CONTENT increases the number of locus-phenotype associations discovered by over 51% relative to previous methods across 22 complex traits.


Asunto(s)
Estudio de Asociación del Genoma Completo , Sitios de Carácter Cuantitativo , Regulación de la Expresión Génica , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo/métodos , Humanos , Fenotipo , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo/genética , Transcriptoma/genética
16.
Genome Med ; 14(1): 31, 2022 03 15.
Artículo en Inglés | MEDLINE | ID: mdl-35292083

RESUMEN

BACKGROUND: Identification of causal genes for polygenic human diseases has been extremely challenging, and our understanding of how physiological and pharmacological stimuli modulate genetic risk at disease-associated loci is limited. Specifically, insulin resistance (IR), a common feature of cardiometabolic disease, including type 2 diabetes, obesity, and dyslipidemia, lacks well-powered genome-wide association studies (GWAS), and therefore, few associated loci and causal genes have been identified. METHODS: Here, we perform and integrate linkage disequilibrium (LD)-adjusted colocalization analyses across nine cardiometabolic traits (fasting insulin, fasting glucose, insulin sensitivity, insulin sensitivity index, type 2 diabetes, triglycerides, high-density lipoprotein, body mass index, and waist-hip ratio) combined with expression and splicing quantitative trait loci (eQTLs and sQTLs) from five metabolically relevant human tissues (subcutaneous and visceral adipose, skeletal muscle, liver, and pancreas). To elucidate the upstream regulators and functional mechanisms for these genes, we integrate their transcriptional responses to 21 relevant physiological and pharmacological perturbations in human adipocytes, hepatocytes, and skeletal muscle cells and map their protein-protein interactions. RESULTS: We identify 470 colocalized loci and prioritize 207 loci with a single colocalized gene. Patterns of shared colocalizations across traits and tissues highlight different potential roles for colocalized genes in cardiometabolic disease and distinguish several genes involved in pancreatic ß-cell function from others with a more direct role in skeletal muscle, liver, and adipose tissues. At the loci with a single colocalized gene, 42 of these genes were regulated by insulin and 35 by glucose in perturbation experiments, including 17 regulated by both. Other metabolic perturbations regulated the expression of 30 more genes not regulated by glucose or insulin, pointing to other potential upstream regulators of candidate causal genes. CONCLUSIONS: Our use of transcriptional responses under metabolic perturbations to contextualize genetic associations from our custom colocalization approach provides a list of likely causal genes and their upstream regulators in the context of IR-associated cardiometabolic risk.


Asunto(s)
Enfermedades Cardiovasculares , Diabetes Mellitus Tipo 2 , Resistencia a la Insulina , Enfermedades Cardiovasculares/genética , Diabetes Mellitus Tipo 2/genética , Estudio de Asociación del Genoma Completo , Humanos , Resistencia a la Insulina/genética , Sitios de Carácter Cuantitativo
17.
Genome Med ; 14(1): 104, 2022 Sep 09.
Artículo en Inglés | MEDLINE | ID: mdl-36085083

RESUMEN

BACKGROUND: Large medical centers in urban areas, like Los Angeles, care for a diverse patient population and offer the potential to study the interplay between genetic ancestry and social determinants of health. Here, we explore the implications of genetic ancestry within the University of California, Los Angeles (UCLA) ATLAS Community Health Initiative-an ancestrally diverse biobank of genomic data linked with de-identified electronic health records (EHRs) of UCLA Health patients (N=36,736). METHODS: We quantify the extensive continental and subcontinental genetic diversity within the ATLAS data through principal component analysis, identity-by-descent, and genetic admixture. We assess the relationship between genetically inferred ancestry (GIA) and >1500 EHR-derived phenotypes (phecodes). Finally, we demonstrate the utility of genetic data linked with EHR to perform ancestry-specific and multi-ancestry genome and phenome-wide scans across a broad set of disease phenotypes. RESULTS: We identify 5 continental-scale GIA clusters including European American (EA), African American (AA), Hispanic Latino American (HL), South Asian American (SAA) and East Asian American (EAA) individuals and 7 subcontinental GIA clusters within the EAA GIA corresponding to Chinese American, Vietnamese American, and Japanese American individuals. Although we broadly find that self-identified race/ethnicity (SIRE) is highly correlated with GIA, we still observe marked differences between the two, emphasizing that the populations defined by these two criteria are not analogous. We find a total of 259 significant associations between continental GIA and phecodes even after accounting for individuals' SIRE, demonstrating that for some phenotypes, GIA provides information not already captured by SIRE. GWAS identifies significant associations for liver disease in the 22q13.31 locus across the HL and EAA GIA groups (HL p-value=2.32×10-16, EAA p-value=6.73×10-11). A subsequent PheWAS at the top SNP reveals significant associations with neurologic and neoplastic phenotypes specifically within the HL GIA group. CONCLUSIONS: Overall, our results explore the interplay between SIRE and GIA within a disease context and underscore the utility of studying the genomes of diverse individuals through biobank-scale genotyping linked with EHR-based phenotyping.


Asunto(s)
Registros Electrónicos de Salud , Salud Pública , Pueblo Asiatico , Bancos de Muestras Biológicas , Genómica , Humanos
18.
Science ; 376(6589): eabf1970, 2022 04 08.
Artículo en Inglés | MEDLINE | ID: mdl-35389781

RESUMEN

Systemic lupus erythematosus (SLE) is a heterogeneous autoimmune disease. Knowledge of circulating immune cell types and states associated with SLE remains incomplete. We profiled more than 1.2 million peripheral blood mononuclear cells (162 cases, 99 controls) with multiplexed single-cell RNA sequencing (mux-seq). Cases exhibited elevated expression of type 1 interferon-stimulated genes (ISGs) in monocytes, reduction of naïve CD4+ T cells that correlated with monocyte ISG expression, and expansion of repertoire-restricted cytotoxic GZMH+ CD8+ T cells. Cell type-specific expression features predicted case-control status and stratified patients into two molecular subtypes. We integrated dense genotyping data to map cell type-specific cis-expression quantitative trait loci and to link SLE-associated variants to cell type-specific expression. These results demonstrate mux-seq as a systematic approach to characterize cellular composition, identify transcriptional signatures, and annotate genetic variants associated with SLE.


Asunto(s)
Interferón Tipo I , Lupus Eritematoso Sistémico , Linfocitos T CD8-positivos/metabolismo , Estudios de Casos y Controles , Humanos , Interferón Tipo I/metabolismo , Leucocitos Mononucleares , Lupus Eritematoso Sistémico/genética , RNA-Seq , Transcripción Genética
19.
Cancer Med ; 10(7): 2232-2241, 2021 04.
Artículo en Inglés | MEDLINE | ID: mdl-33314708

RESUMEN

BACKGROUND: Clinical, molecular, and histopathologic features guide treatment for neuroblastoma, but obtaining tumor tissue may cause complications and is subject to sampling error due to tumor heterogeneity. We hypothesized that image-defined risk factors (IDRFs) would reflect molecular features, histopathology, and clinical outcomes in neuroblastoma. METHODS: We performed a retrospective cohort study of 76 patients with neuroblastoma or ganglioneuroblastoma. Diagnostic CT scans were reviewed for 20 IDRFs, which were consolidated into five IDRF groups (involvement of multiple body compartments, vascular encasement, tumor infiltration of adjacent organs/structures, airway compression, or intraspinal extension). IDRF groups were analyzed for association with clinical, molecular, and histopathologic features of neuroblastoma. RESULTS: Patients with more IDRF groups had a higher risk of surgical complications (OR = 3.1, p = 0.001). Tumor vascular encasement was associated with increased risk of surgical complications (OR = 5.40, p = 0.009) and increased risk of undifferentiated/poorly differentiated histologic grade (OR = 11.11, p = 0.013). Tumor infiltration of adjacent organs and structures was associated with decreased survival (HR = 8.90, p = 0.007), MYCN amplification (OR = 9.91, p = 0.001), high MKI (OR = 6.20, p = 0.003), and increased risk of International Neuroblastoma Staging System stage 4 disease (OR = 8.96, p < 0.001). CONCLUSIONS: The presence of IDRFs at diagnosis was associated with high-risk clinical, molecular, and histopathologic features of neuroblastoma. The IDRF group tumor infiltration into adjacent organs and structures was associated with decreased survival. Collectively, these findings may assist surgical planning and medical management for neuroblastoma patients.


Asunto(s)
Neuroblastoma , Complicaciones Posoperatorias , Preescolar , Femenino , Ganglioneuroblastoma/diagnóstico por imagen , Ganglioneuroblastoma/genética , Ganglioneuroblastoma/patología , Ganglioneuroblastoma/cirugía , Genes myc , Humanos , Lactante , Estimación de Kaplan-Meier , Masculino , Clasificación del Tumor , Invasividad Neoplásica , Neuroblastoma/diagnóstico por imagen , Neuroblastoma/genética , Neuroblastoma/patología , Neuroblastoma/cirugía , Oportunidad Relativa , Complicaciones Posoperatorias/clasificación , Modelos de Riesgos Proporcionales , Estudios Retrospectivos , Factores de Riesgo , Estadísticas no Paramétricas , Tomografía Computarizada por Rayos X
20.
Genome Biol ; 22(1): 249, 2021 08 26.
Artículo en Inglés | MEDLINE | ID: mdl-34446078

RESUMEN

Aligning sequencing reads onto a reference is an essential step of the majority of genomic analysis pipelines. Computational algorithms for read alignment have evolved in accordance with technological advances, leading to today's diverse array of alignment methods. We provide a systematic survey of algorithmic foundations and methodologies across 107 alignment methods, for both short and long reads. We provide a rigorous experimental evaluation of 11 read aligners to demonstrate the effect of these underlying algorithms on speed and efficiency of read alignment. We discuss how general alignment algorithms have been tailored to the specific needs of various domains in biology.


Asunto(s)
Algoritmos , Biología Computacional/métodos , Alineación de Secuencia , Genoma Humano , VIH/fisiología , Humanos , Metagenómica , Sulfitos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA