RESUMO
GPR17 is a G-protein-coupled receptor (GPCR) implicated in the regulation of glucose metabolism and energy homeostasis. Such evidence is primarily drawn from mouse knockout studies and suggests GPR17 as a potential novel therapeutic target for the treatment of metabolic diseases. However, links between human GPR17 genetic variants, downstream cellular signaling, and metabolic diseases have yet to be reported. Here, we analyzed GPR17 coding sequences from control and disease cohorts consisting of individuals with adverse clinical metabolic deficits including severe insulin resistance, hypercholesterolemia, and obesity. We identified 18 nonsynonymous GPR17 variants, including eight variants that were exclusive to the disease cohort. We characterized the protein expression levels, membrane localization, and downstream signaling profiles of nine GPR17 variants (F43L, V96M, V103M, D105N, A131T, G136S, R248Q, R301H, and G354V). These nine GPR17 variants had similar protein expression and subcellular localization as wild-type GPR17; however, they showed diverse downstream signaling profiles. GPR17-G136S lost the capacity for agonist-mediated cAMP, Ca2+, and ß-arrestin signaling. GPR17-V96M retained cAMP inhibition similar to GPR17-WT, but showed impaired Ca2+ and ß-arrestin signaling. GPR17-D105N displayed impaired cAMP and Ca2+ signaling, but unaffected agonist-stimulated ß-arrestin recruitment. The identification and functional profiling of naturally occurring human GPR17 variants from individuals with metabolic diseases revealed receptor variants with diverse signaling profiles, including differential signaling perturbations that resulted in GPCR signaling bias. Our findings provide a framework for structure-function relationship studies of GPR17 signaling and metabolic disease.
Assuntos
Síndrome Metabólica/genética , Mutação de Sentido Incorreto , Receptores Acoplados a Proteínas G/genética , Transdução de Sinais , Cálcio/metabolismo , AMP Cíclico/metabolismo , Células HEK293 , Humanos , Transporte Proteico , Receptores Acoplados a Proteínas G/química , Receptores Acoplados a Proteínas G/metabolismo , beta-Arrestinas/metabolismoRESUMO
INTRODUCTION: Historically, high rates of actionable driver mutations have been reported in never-smokers with lung adenocarcinoma (ADC). In the era of modern, comprehensive cancer mutation sequencing, this relationship necessitates a more detailed analysis. METHODS: All Mount Sinai patients between January 1, 2015, and June 1, 2020, with a diagnosis of ADC of any stage with known smoking status who received genomic testing were included. Most patients were analyzed using the Sema4 hotspot panel or the Oncomine Comprehensive Assay version 3 next-generation sequencing (NGS) panel conducted at Sema4. Patients were considered fully genotyped if they were comprehensively analyzed for alterations in EGFR, KRAS, MET, ALK, RET, ROS1, BRAF, NTRK1-3, and ERBB2, otherwise they were considered partially genotyped. RESULTS: Two hundred and thirty-six never-smokers and 671 smokers met the above criteria. Of the never-smokers, 201 (85%) had a driver mutation with 167 (71%) considered actionable (ie, those with US Food and Drug Administration-approved agents). Among smokers, 439 (65%) had an identified driver mutation with 258 (38%) actionable (P < .0001). When comprehensively sequenced, 95% (70/74) of never-smokers had a driver mutation with 78% (58/74) actionable; whereas, for smokers, 75% (135/180) had a driver with only 47% (74/180) actionable (P < .0001). Within mutations groups, EGFR G719X and KRAS G12Cs were more common to smokers. For stage IV patients harboring EGFR-mutant tumors treated with EGFR-directed therapies, never-smokers had significantly improved OS compared to smokers (hazard ratio = 2.71; P = .025). In multivariable analysis, Asian ancestry and female sex remained significant predictors of (1) OS in stage IV patients and (2) likelihood of harboring a receptor of fusion-based driver. CONCLUSION: Comprehensive NGS revealed driver alterations in 95% of never-smokers, with the majority having an associated therapy available. All efforts should be exhausted to identify or rule out the presence of an actionable driver mutation in all metastatic lung ADC.
Assuntos
Adenocarcinoma de Pulmão , Neoplasias Pulmonares , Adenocarcinoma de Pulmão/genética , Adenocarcinoma de Pulmão/patologia , Feminino , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Neoplasias Pulmonares/tratamento farmacológico , Mutação , Proteínas Proto-Oncogênicas/genética , Proteínas Proto-Oncogênicas p21(ras)/genética , FumantesRESUMO
BACKGROUND: Racial disparities among clinical trial participants present a challenge to assess whether trial results can be generalized into patients representing diverse races and ethnicities. The objective of this study was to evaluate the impact of race and ethnicity on treatment response in patients with advanced non-small cell lung cancer (aNSCLC) treated with programmed cell death-1 (PD-1) or programmed cell death-ligand 1 (PD-L1) inhibitors through analysis of real-world data (RWD). MATERIALS AND METHODS: A retrospective cohort study of 11,138 patients with lung cancer treated at hospitals within the Mount Sinai Health System was performed. Patients with confirmed aNSCLC who received anti-PD-1/PD-L1 treatment were analyzed for clinical outcomes. Our cohort included 249 patients with aNSCLC who began nivolumab, pembrolizumab, or atezolizumab treatment between November 2014 and December 2018. Time-to-treatment discontinuation (TTD) and overall survival (OS) were the analyzed clinical endpoints. RESULTS: After a median follow-up of 14.8 months, median TTD was 7.8 months (95% confidence interval, 5.4-not estimable [NE]) in 75 African American patients versus 4.6 (2.4-7.2) in 110 White patients (hazard ratio [HR], 0.63). Median OS was not reached (18.4-NE) in African American patients versus 11.6 months (9.7-NE) in White patients (HR, 0.58). Multivariable Cox regression conducted with potential confounders confirmed longer TTD (adjusted HR, 0.65) and OS (adjusted HR, 0.60) in African American versus White patients. Similar real-world response rate (42.6% vs. 43.5%) and disease control rate (59.6% vs. 56.5%) were observed in the African American and White patient populations. Further investigation revealed the African American patient group had lower incidence (14.7%) of putative hyperprogressive diseases (HPD) upon anti-PD-1/PD-L1 treatment than the White patient group (24.5%). CONCLUSION: Analysis of RWD showed longer TTD and OS in African American patients with aNSCLC treated with anti-PD-1/PD-L1 inhibitors. Lower incidence of putative HPD is a possible reason for the favorable outcomes in this patient population. IMPLICATIONS FOR PRACTICE: There is a significant underrepresentation of minority patients in randomized clinical trials, and this study demonstrates that real-world data can be used to investigate the impact of race and ethnicity on treatment response. In retrospective analysis of patients with advanced non-small cell lung cancer treated with programmed cell death-1 or programmed cell death-ligand 1 inhibitors, African American patients had significantly longer time-to-treatment discontinuation and longer overall survival. Analysis of real-world data can yield clinical insights and establish a more complete picture of medical interventions in routine clinical practice.
Assuntos
Carcinoma Pulmonar de Células não Pequenas , Neoplasias Pulmonares , Apoptose , Antígeno B7-H1 , Carcinoma Pulmonar de Células não Pequenas/tratamento farmacológico , Etnicidade , Humanos , Inibidores de Checkpoint Imunológico , Ligantes , Neoplasias Pulmonares/tratamento farmacológico , Estudos RetrospectivosRESUMO
BACKGROUND: Immune checkpoint inhibitors (ICIs) have been incorporated into various clinical oncology guidelines for systemic treatment of advanced non-small cell lung cancers (aNSCLC). However, less than 50% (and 20%) of the patients responded to the therapy as a first (or second) line of therapy. PD-L1 immunohistochemistry (IHC) is an extensively studied biomarker of response to ICI, but results from this test have equivocal predictive power. In order to identify other biomarkers that support clinical decision-making around whether to treat with ICIs or not, we performed a retrospective study of patients with aNSCLC who underwent ICI-based therapy in the Mount Sinai Health System between 2014 and 2019. METHODS: We analyzed data from standard laboratory tests performed in patients as a part of the routine clinical workup during treatment, including complete blood counts (CBC) and a comprehensive metabolic panel (CMP), to correlate test results with clinical response and survival. RESULTS: Of 11,138 NSCLC patients identified, 249 had been treated with ICIs. We found associations between high neutrophil-to-lymphocyte ratio (NLR ≥ 5) and poor survival in ICI-treated NSCLC. We further observed that sustained high NLR after initiation of treatment had a more profound impact on survival than baseline NLR, regardless of PD-L1 status. Hazard ratios when comparing patients with NLR ≥ 5 vs. NLR < 5 are 1.7 (p = 0.02), 3.4 (p = 4.2 × 10- 8), and 3.9 (p = 1.4 × 10- 6) at baseline, 2-8 weeks, and 8-14 weeks after treatment start, respectively. Mild anemia, defined as hemoglobin (HGB) less than 12 g/dL was correlated with survival independently of NLR. Finally, we developed a composite NLR and HGB biomarker. Patients with pretreatment NLR ≥ 5 and HGB < 12 g/dL had a median overall survival (OS) of 8.0 months (95% CI 4.5-11.5) compared to the rest of the cohort with a median OS not reached (95% CI 15.9-NE, p = 1.8 × 10- 5), and a hazard ratio of 2.6 (95% CI 1.7-4.1, p = 3.5 × 10- 5). CONCLUSIONS: We developed a novel composite biomarker for ICI-based therapy in NSCLC based on routine CBC tests, which may provide meaningful clinical utility to guide treatment decision. The results suggest that treatment of anemia to elevate HGB before initiation of ICI therapy may improve patient outcomes or the use of alternative non-chemotherapy containing regimens.
Assuntos
Antineoplásicos Imunológicos/uso terapêutico , Carcinoma Pulmonar de Células não Pequenas/sangue , Carcinoma Pulmonar de Células não Pequenas/tratamento farmacológico , Índices de Eritrócitos , Contagem de Leucócitos , Neoplasias Pulmonares/sangue , Neoplasias Pulmonares/tratamento farmacológico , Antineoplásicos Imunológicos/administração & dosagem , Antineoplásicos Imunológicos/efeitos adversos , Antígeno B7-H1/antagonistas & inibidores , Biomarcadores Tumorais , Carcinoma Pulmonar de Células não Pequenas/etiologia , Carcinoma Pulmonar de Células não Pequenas/patologia , Feminino , Humanos , Estimativa de Kaplan-Meier , Neoplasias Pulmonares/etiologia , Neoplasias Pulmonares/patologia , Linfócitos , Masculino , Metástase Neoplásica , Estadiamento de Neoplasias , Neutrófilos , Razão de Chances , Prognóstico , Receptor de Morte Celular Programada 1/antagonistas & inibidores , Resultado do TratamentoRESUMO
Primary Biliary Cholangitis (PBC) is a chronic autoimmune liver disease characterised by progressive destruction of intrahepatic bile ducts. The strongest genetic association is with HLA-DQA1*04:01, but at least three additional independent HLA haplotypes contribute to susceptibility. We used dense single nucleotide polymorphism (SNP) data in 2861 PBC cases and 8514 controls to impute classical HLA alleles and amino acid polymorphisms using state-of-the-art methodologies. We then demonstrated through stepwise regression that association in the HLA region can be largely explained by variation at five separate amino acid positions. Three-dimensional modelling of protein structures and calculation of electrostatic potentials for the implicated HLA alleles/amino acid substitutions demonstrated a correlation between the electrostatic potential of pocket P6 in HLA-DP molecules and the HLA-DPB1 alleles/amino acid substitutions conferring PBC susceptibility/protection, highlighting potential new avenues for future functional investigation.
Assuntos
Antígenos HLA/genética , Cirrose Hepática Biliar/genética , Cirrose Hepática Biliar/imunologia , Complexo Principal de Histocompatibilidade , Sequência de Aminoácidos , Substituição de Aminoácidos , Genes MHC da Classe II , Estudos de Associação Genética , Predisposição Genética para Doença , Antígenos HLA/química , Antígenos HLA-C/genética , Cadeias beta de HLA-DP/química , Cadeias beta de HLA-DP/genética , Cadeias alfa de HLA-DQ/genética , Cadeias beta de HLA-DQ/genética , Cadeias HLA-DRB1/genética , Humanos , Modelos Genéticos , Modelos Moleculares , Polimorfismo de Nucleotídeo Único , Conformação Proteica , Análise de Regressão , Eletricidade EstáticaRESUMO
MOTIVATION: Underrepresentation of racial groups represents an important challenge and major gap in phenomics research. Most of the current human phenomics research is based primarily on European populations; hence it is an important challenge to expand it to consider other population groups. One approach is to utilize data from EMR databases that contain patient data from diverse demographics and ancestries. The implications of this racial underrepresentation of data can be profound regarding effects on the healthcare delivery and actionability. To the best of our knowledge, our work is the first attempt to perform comparative, population-scale analyses of disease networks across three different populations, namely Caucasian (EA), African American (AA) and Hispanic/Latino (HL). RESULTS: We compared susceptibility profiles and temporal connectivity patterns for 1988 diseases and 37 282 disease pairs represented in a clinical population of 1 025 573 patients. Accordingly, we revealed appreciable differences in disease susceptibility, temporal patterns, network structure and underlying disease connections between EA, AA and HL populations. We found 2158 significantly comorbid diseases for the EA cohort, 3265 for AA and 672 for HL. We further outlined key disease pair associations unique to each population as well as categorical enrichments of these pairs. Finally, we identified 51 key 'hub' diseases that are the focal points in the race-centric networks and of particular clinical importance. Incorporating race-specific disease comorbidity patterns will produce a more accurate and complete picture of the disease landscape overall and could support more precise understanding of disease relationships and patient management towards improved clinical outcomes. CONTACTS: rong.chen@mssm.edu or joel.dudley@mssm.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Registros Eletrônicos de Saúde , Negro ou Afro-Americano , Bases de Dados Factuais , Hispânico ou Latino , Humanos , População BrancaRESUMO
BACKGROUND: Alzheimer's disease (AD) represents the most common form of dementia in elder populations with approximately 30 million cases worldwide. Genome wide genotyping and sequencing studies have identified many genetic variants associated with late-onset Alzheimer's disease (LOAD). While most of these variants are associated with increased risk of developing LOAD, only limited number of reports focused on variants that are protective against the disease. METHODS: Here we applied a novel approach to uncover protective alleles against AD by analyzing genetic and phenotypic data in Mount Sinai Biobank and Electronic Medical Record (EMR) databases. RESULTS: We discovered a likely loss-of-function small deletion variant in the caspase 7 (CASP7) gene associated with significantly reduced incidence of LOAD in carriers of the high-risk APOE ε4 allele. Further investigation of four independent cohorts of European ancestry revealed the protective effect of the CASP7 variant against AD is most significant in homozygous APOE ε4 allele carriers. Meta analysis of multiple datasets shows overall odds ratio = 0.45 (p = 0.004). Analysis of RNA sequencing derived gene expression data indicated the variant correlates with reduced caspase 7 expression in multiple brain tissues we examined. CONCLUSIONS: Taken together, these results are consistent with the notion that caspase 7 plays a key role in microglial activation driving neuro-degeneration during AD pathogenesis, and may explain the underlying genetic mechanisms that anti-inflammatory interventions in AD show greater benefit in APOE ε4 carriers than non-carriers. Our findings inform potential novel therapeutic opportunities for AD and warrant further investigations.
Assuntos
Doença de Alzheimer/genética , Apolipoproteínas E/genética , Caspase 7/genética , Perfilação da Expressão Gênica/métodos , Análise de Sequência de RNA/métodos , Idade de Início , Alelos , Regulação para Baixo , Registros Eletrônicos de Saúde , Predisposição Genética para Doença , Homozigoto , Humanos , Razão de Chances , Deleção de SequênciaRESUMO
The genetic contribution to the variation in human lifespan is â¼ 25%. Despite the large number of identified disease-susceptibility loci, it is not known which loci influence population mortality. We performed a genome-wide association meta-analysis of 7729 long-lived individuals of European descent (≥ 85 years) and 16 121 younger controls (<65 years) followed by replication in an additional set of 13 060 long-lived individuals and 61 156 controls. In addition, we performed a subset analysis in cases aged ≥ 90 years. We observed genome-wide significant association with longevity, as reflected by survival to ages beyond 90 years, at a novel locus, rs2149954, on chromosome 5q33.3 (OR = 1.10, P = 1.74 × 10(-8)). We also confirmed association of rs4420638 on chromosome 19q13.32 (OR = 0.72, P = 3.40 × 10(-36)), representing the TOMM40/APOE/APOC1 locus. In a prospective meta-analysis (n = 34 103), the minor allele of rs2149954 (T) on chromosome 5q33.3 associates with increased survival (HR = 0.95, P = 0.003). This allele has previously been reported to associate with low blood pressure in middle age. Interestingly, the minor allele (T) associates with decreased cardiovascular mortality risk, independent of blood pressure. We report on the first GWAS-identified longevity locus on chromosome 5q33.3 influencing survival in the general European population. The minor allele of this locus associates with low blood pressure in middle age, although the contribution of this allele to survival may be less dependent on blood pressure. Hence, the pleiotropic mechanisms by which this intragenic variation contributes to lifespan regulation have to be elucidated.
Assuntos
Loci Gênicos/fisiologia , Longevidade/genética , Fatores Etários , Idoso , Idoso de 80 Anos ou mais , Doenças Cardiovasculares/genética , Mapeamento Cromossômico , Cromossomos Humanos Par 19 , Cromossomos Humanos Par 5 , Feminino , Estudo de Associação Genômica Ampla , Humanos , Hipertensão/genética , Masculino , Fenótipo , Estudos Prospectivos , População BrancaRESUMO
We conducted a genome-wide association study to search for risk alleles associated with Tetralogy of Fallot (TOF), using a northern European discovery set of 835 cases and 5159 controls. A region on chromosome 12q24 was associated (P = 1.4 × 10(-7)) and replicated convincingly (P = 3.9 × 10(-5)) in 798 cases and 2931 controls [per allele odds ratio (OR) = 1.27 in replication cohort, P = 7.7 × 10(-11) in combined populations]. Single nucleotide polymorphisms in the glypican 5 gene on chromosome 13q32 were also associated (P = 1.7 × 10(-7)) and replicated convincingly (P = 1.2 × 10(-5)) in 789 cases and 2927 controls (per allele OR = 1.31 in replication cohort, P = 3.03 × 10(-11) in combined populations). Four additional regions on chromosomes 10, 15 and 16 showed suggestive association accompanied by nominal replication. This study, the first genome-wide association study of a congenital heart malformation phenotype, provides evidence that common genetic variation influences the risk of TOF.
Assuntos
Cromossomos Humanos Par 12/genética , Cromossomos Humanos Par 13/genética , Estudo de Associação Genômica Ampla , Tetralogia de Fallot/genética , Estudos de Casos e Controles , Feminino , Frequência do Gene , Loci Gênicos , Humanos , Desequilíbrio de Ligação , Masculino , Polimorfismo de Nucleotídeo ÚnicoRESUMO
In spite of the success of genome-wide association studies in finding many common variants associated with disease, these variants seem to explain only a small proportion of the estimated heritability. Data collection has turned toward exome and whole genome sequencing, but it is well known that single marker methods frequently used for common variants have low power to detect rare variants associated with disease, even with very large sample sizes. In response, a variety of methods have been developed that attempt to cluster rare variants so that they may gather strength from one another under the premise that there may be multiple causal variants within a gene. Most of these methods group variants by gene or proximity, and test one gene or marker window at a time. We propose a penalized regression method (PeRC) that analyzes all genes at once, allowing grouping of all (rare and common) variants within a gene, along with subgrouping of the rare variants, thus borrowing strength from both rare and common variants within the same gene. The method can incorporate either a burden-based weighting of the rare variants or one in which the weights are data driven. In simulations, our method performs favorably when compared to many previously proposed approaches, including its predecessor, the sparse group lasso [Friedman et al., 2010].
Assuntos
Variação Genética , Modelos Logísticos , Modelos Genéticos , Estudos de Casos e Controles , Simulação por Computador , Frequência do Gene , Genética Populacional , Humanos , Polimorfismo de Nucleotídeo ÚnicoRESUMO
PURPOSE: Data quality and standardization remain a challenge when analyzing real-world clinical data. We built a clinical research database, using machine learning and natural learning processing, and investigated factors influencing testosterone recovery (T-recovery) in patients with localized prostate cancer (LPC) after initial androgen deprivation therapy (ADT). METHODS: Medication and treatment-associated dates missing in structured tables were extracted from patient notes using ConceptMapper, an automated data extraction tool, standardized and curated in Sema4 clinical research database. ADT usage duration was evaluated, and T-recovery in patients with LPC was analyzed by the Kaplan-Meier method and multivariable Cox proportional hazards models. We assessed the prognostic value of post-ADT T-recovery with prostate-specific antigen progression-free survival and failure-free survival. RESULTS: In total, 4,125 of 30,832 (13.4%) patients with prostate cancer had medication exclusively from notes with high precision and recall, F1 score ≥ 0.95. Association of dates with medication usage had a F1 score of 0.76. ADT duration estimation had higher accuracy combining information from notes to tables from electronic medical record (70% v 45%). Baseline testosterone was the strongest predictor of T-recovery in these patients. Patients with a baseline testosterone ≥ 300 ng/dL recovered in 9.79 versus 38 months for patients with baseline testosterone < 300 ng/dL (P < .0001). Shorter prostate-specific antigen progression-free interval was observed for patients with T-recovery (≥ 300 ng/dL) at 6 months after ADT cessation compared with patients without T-recovery (< 300 ng/dL; 13.7 v 25.1 months; P = .055). CONCLUSION: We augmented structured electronic medical record data with data extracted from notes and improved the accuracy of medication information for patients. ADT exposure and T-recovery in patients with LPC produced results consistent with the literature and clinical experience and illustrates the power of applying machine learning methods to enhance the quality of real-world evidence in answering clinically relevant questions.
Assuntos
Antígeno Prostático Específico , Neoplasias da Próstata , Antagonistas de Androgênios/efeitos adversos , Antagonistas de Androgênios/uso terapêutico , Registros Eletrônicos de Saúde , Humanos , Masculino , Antígeno Prostático Específico/uso terapêutico , Neoplasias da Próstata/tratamento farmacológico , Testosterona/uso terapêuticoRESUMO
Penalized regression methods offer an attractive alternative to single marker testing in genetic association analysis. Penalized regression methods shrink down to zero the coefficient of markers that have little apparent effect on the trait of interest, resulting in a parsimonious subset of what we hope are true pertinent predictors. Here we explore the performance of penalization in selecting SNPs as predictors in genetic association studies. The strength of the penalty can be chosen either to select a good predictive model (via methods such as computationally expensive cross validation), through maximum likelihood-based model selection criterion (such as the BIC), or to select a model that controls for type I error, as done here. We have investigated the performance of several penalized logistic regression approaches, simulating data under a variety of disease locus effect size and linkage disequilibrium patterns. We compared several penalties, including the elastic net, ridge, Lasso, MCP and the normal-exponential-γ shrinkage prior implemented in the hyperlasso software, to standard single locus analysis and simple forward stepwise regression. We examined how markers enter the model as penalties and P-value thresholds are varied, and report the sensitivity and specificity of each of the methods. Results show that penalized methods outperform single marker analysis, with the main difference being that penalized methods allow the simultaneous inclusion of a number of markers, and generally do not allow correlated variables to enter the model, producing a sparse model in which most of the identified explanatory markers are accounted for.
Assuntos
Antígenos CD/genética , Regulador de Condutância Transmembrana em Fibrose Cística/genética , Citocromo P-450 CYP2D6/genética , Estudo de Associação Genômica Ampla/métodos , Modelos Logísticos , Polimorfismo de Nucleotídeo Único , Algoritmos , Alelos , Antígeno CTLA-4 , Simulação por Computador , Marcadores Genéticos , Humanos , Desequilíbrio de Ligação , Modelos Estatísticos , Sensibilidade e Especificidade , SoftwareRESUMO
MOTIVATION: Low haplotype diversity and linkage disequilibrium are the rule in short genomic segments. This fact suggests that parsimony should be enforced in estimation of haplotype frequencies. The current article introduces a diversity penalty that automatically discards potential haplotypes with low explanatory power. The standard EM algorithm for haplotype frequency estimation can accommodate the penalty if one passes over to a more general minorize-maximize (MM) scheme for estimation. RESULTS: Our new MM algorithm converges in fewer iterations, eliminates marginal haplotypes from further consideration and reduces the computational complexity of each iteration. Estimation by the MM algorithm also improves haplotyping and genotype imputation compared to naive application of the EM algorithm. Thus, the MM algorithm is a useful substitute for the EM algorithm. Compared to the most sophisticated current methods of haplotyping and genotype imputation, the MM algorithm is slightly less accurate but at least an order of magnitude faster. AVAILABILITY: Our software will be made available in the next release the program Mendel at http://www.genetics.ucla.edu/software/.
Assuntos
Biologia Computacional/métodos , Frequência do Gene , Haplótipos , Algoritmos , Cromossomos Humanos X/genética , Interpretação Estatística de Dados , Bases de Dados Genéticas , Genótipo , Humanos , Desequilíbrio de Ligação , Masculino , Modelos Estatísticos , Modelos Teóricos , Reprodutibilidade dos Testes , SoftwareRESUMO
BACKGROUND: Genetic loss-of-function variants (LoFs) associated with disease traits are increasingly recognized as critical evidence for the selection of therapeutic targets. We integrated the analysis of genetic and clinical data from 10,511 individuals in the Mount Sinai BioMe Biobank to identify genes with loss-of-function variants (LoFs) significantly associated with cardiovascular disease (CVD) traits, and used RNA-sequence data of seven metabolic and vascular tissues isolated from 600 CVD patients in the Stockholm-Tartu Atherosclerosis Reverse Network Engineering Task (STARNET) study for validation. We also carried out in vitro functional studies of several candidate genes, and in vivo studies of one gene. RESULTS: We identified LoFs in 433 genes significantly associated with at least one of 10 major CVD traits. Next, we used RNA-sequence data from the STARNET study to validate 115 of the 433 LoF harboring-genes in that their expression levels were concordantly associated with corresponding CVD traits. Together with the documented hepatic lipid-lowering gene, APOC3, the expression levels of six additional liver LoF-genes were positively associated with levels of plasma lipids in STARNET. Candidate LoF-genes were subjected to gene silencing in HepG2 cells with marked overall effects on cellular LDLR, levels of triglycerides and on secreted APOB100 and PCSK9. In addition, we identified novel LoFs in DGAT2 associated with lower plasma cholesterol and glucose levels in BioMe that were also confirmed in STARNET, and showed a selective DGAT2-inhibitor in C57BL/6 mice not only significantly lowered fasting glucose levels but also affected body weight. CONCLUSION: In sum, by integrating genetic and electronic medical record data, and leveraging one of the world's largest human RNA-sequence datasets (STARNET), we identified known and novel CVD-trait related genes that may serve as targets for CVD therapeutics and as such merit further investigation.
Assuntos
Doenças Cardiovasculares/genética , Genômica , Mutação , Doenças Cardiovasculares/sangue , Colesterol/sangue , Genótipo , Humanos , Triglicerídeos/sangueRESUMO
Human longevity is heritable, but genome-wide association (GWA) studies have had limited success. Here, we perform two meta-analyses of GWA studies of a rigorous longevity phenotype definition including 11,262/3484 cases surviving at or beyond the age corresponding to the 90th/99th survival percentile, respectively, and 25,483 controls whose age at death or at last contact was at or below the age corresponding to the 60th survival percentile. Consistent with previous reports, rs429358 (apolipoprotein E (ApoE) ε4) is associated with lower odds of surviving to the 90th and 99th percentile age, while rs7412 (ApoE ε2) shows the opposite. Moreover, rs7676745, located near GPR78, associates with lower odds of surviving to the 90th percentile age. Gene-level association analysis reveals a role for tissue-specific expression of multiple genes in longevity. Finally, genetic correlation of the longevity GWA results with that of several disease-related phenotypes points to a shared genetic architecture between health and longevity.
Assuntos
Apolipoproteína E2/genética , Apolipoproteína E4/genética , Proteínas de Choque Térmico/genética , Longevidade/genética , Chaperona BiP do Retículo Endoplasmático , Estudo de Associação Genômica Ampla , HumanosRESUMO
Context: The hypothalamic melanocortin 4 receptor (MC4R) pathway serves a critical role in regulating body weight. Loss of function (LoF) mutations in the MC4R pathway, including mutations in the pro-opiomelanocortin (POMC), prohormone convertase 1 (PCSK1), leptin receptor (LEPR), or MC4R genes, have been shown to cause early-onset severe obesity. Methods: Through a comprehensive epidemiological analysis of known and predicted LoF variants in the POMC, PCSK1, and LEPR genes, we sought to estimate the number of US individuals with biallelic MC4R pathway LoF variants. Results: We predict ~650 α-melanocyte-stimulating hormone (MSH)/POMC, 8500 PCSK1, and 3600 LEPR homozygous and compound heterozygous individuals in the United States, cumulatively enumerating >12,800 MC4R pathway-deficient obese patients. Few of these variants have been genetically diagnosed to date. These estimates increase when we include a small subset of less rare variants: ß-MSH/POMC,PCSK1 N221D, and a PCSK1 LoF variant (T640A). To further define the MC4R pathway and its potential impact on obesity, we tested associations between body mass index (BMI) and LoF mutation burden in the POMC, PCSK1, and LEPR genes in various populations. We show that the cumulative allele burden in individuals with two or more LoF alleles in one or more genes in the MC4R pathway are predisposed to a higher BMI than noncarriers or heterozygous LoF carriers with a defect in only one gene. Conclusions: Our analysis represents a genetically rationalized study of the hypothalamic MC4R pathway aimed at genetic patient stratification to determine which obese subpopulations should be studied to elucidate MC4R agonist (e.g., setmelanotide) treatment responsiveness.
Assuntos
Mutação com Perda de Função/genética , Obesidade/epidemiologia , Obesidade/genética , Receptor Tipo 4 de Melanocortina/genética , Transdução de Sinais/genética , Alelos , Fármacos Antiobesidade/farmacologia , Índice de Massa Corporal , Feminino , Heterozigoto , Homozigoto , Humanos , Masculino , Obesidade/tratamento farmacológico , Pró-Opiomelanocortina/genética , Pró-Proteína Convertase 1/genética , Receptor Tipo 4 de Melanocortina/agonistas , Receptores para Leptina/genética , Estados Unidos/epidemiologia , alfa-MSH/análogos & derivados , alfa-MSH/farmacologiaRESUMO
Elevated blood pressure is a major risk factor for cardiovascular disease and has a substantial genetic contribution. Genetic variation influencing blood pressure has the potential to identify new pharmacological targets for the treatment of hypertension. To discover additional novel blood pressure loci, we used 1000 Genomes Project-based imputation in 150 134 European ancestry individuals and sought significant evidence for independent replication in a further 228 245 individuals. We report 6 new signals of association in or near HSPB7, TNXB, LRP12, LOC283335, SEPT9, and AKT2, and provide new replication evidence for a further 2 signals in EBF2 and NFKBIA. Combining large whole-blood gene expression resources totaling 12 607 individuals, we investigated all novel and previously reported signals and identified 48 genes with evidence for involvement in blood pressure regulation that are significant in multiple resources. Three novel kidney-specific signals were also detected. These robustly implicated genes may provide new leads for therapeutic innovation.
RESUMO
We propose a dictionary model for haplotypes. According to the model, a haplotype is constructed by randomly concatenating haplotype segments from a given dictionary of segments. A haplotype block is defined as a set of haplotype segments that begin and end with the same pair of markers. In this framework, haplotype blocks can overlap, and the model provides a setting for testing the accuracy of simpler models invoking only nonoverlapping blocks. Each haplotype segment in a dictionary has an assigned probability and alternate spellings that account for genotyping errors and mutation. The model also allows for missing data, unphased genotypes, and prior distribution of parameters. Likelihood evaluations rely on forward and backward recurrences similar to the ones encountered in hidden Markov models. Parameter estimation is carried out with an EM algorithm. The search for the optimal dictionary is particularly difficult because of the variable dimension of the model space. We define a minimum description length criteria to evaluate each dictionary and use a combination of greedy search and careful initialization to select a best dictionary for a given dataset. Application of the model to simulated data gives encouraging results. In a real dataset, we are able to reconstruct a parsimonious dictionary that captures patterns of linkage disequilibrium well.