RESUMO
To address the challenge of translating genetic discoveries for type 1 diabetes (T1D) into mechanistic insight, we have developed the T1D Knowledge Portal (T1DKP), an open-access resource for hypothesis development and target discovery in T1D.
Assuntos
Diabetes Mellitus Tipo 1 , Humanos , Diabetes Mellitus Tipo 1/genética , Genômica , Genética HumanaRESUMO
Large-scale gene sequencing studies for complex traits have the potential to identify causal genes with therapeutic implications. We performed gene-based association testing of blood lipid levels with rare (minor allele frequency < 1%) predicted damaging coding variation by using sequence data from >170,000 individuals from multiple ancestries: 97,493 European, 30,025 South Asian, 16,507 African, 16,440 Hispanic/Latino, 10,420 East Asian, and 1,182 Samoan. We identified 35 genes associated with circulating lipid levels; some of these genes have not been previously associated with lipid levels when using rare coding variation from population-based samples. We prioritize 32 genes in array-based genome-wide association study (GWAS) loci based on aggregations of rare coding variants; three (EVI5, SH2B3, and PLIN1) had no prior association of rare coding variants with lipid levels. Most of our associated genes showed evidence of association among multiple ancestries. Finally, we observed an enrichment of gene-based associations for low-density lipoprotein cholesterol drug target genes and for genes closest to GWAS index single-nucleotide polymorphisms (SNPs). Our results demonstrate that gene-based associations can be beneficial for drug target development and provide evidence that the gene closest to the array-based GWAS index SNP is often the functional gene for blood lipid levels.
Assuntos
Exoma , Variação Genética , Estudo de Associação Genômica Ampla , Lipídeos/sangue , Fases de Leitura Aberta , Alelos , Glicemia/genética , Estudos de Casos e Controles , Biologia Computacional/métodos , Bases de Dados Genéticas , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/metabolismo , Predisposição Genética para Doença , Genética Populacional , Estudo de Associação Genômica Ampla/métodos , Humanos , Metabolismo dos Lipídeos/genética , Fígado/metabolismo , Fígado/patologia , Anotação de Sequência Molecular , Herança Multifatorial , Fenótipo , Polimorfismo de Nucleotídeo ÚnicoRESUMO
Apolipoprotein L1 (APOL1)-associated focal segmental glomerulosclerosis (FSGS) is the dominant form of FSGS in Black individuals. There are no targeted therapies for this condition, in part because the molecular mechanisms underlying APOL1's pathogenic contribution to FSGS are incompletely understood. Studying the transcriptomic landscape of APOL1 FSGS in patient kidneys is an important way to discover genes and molecular behaviors that are unique or most relevant to the human disease. With the hypothesis that the pathology driven by the high-risk APOL1 genotype is reflected in alteration of gene expression across the glomerular transcriptome, we compared expression and co-expression profiles of 15,703 genes in 16 Black patients with FSGS at high-risk vs 14 Black patients with a low-risk APOL1 genotype. Expression data from APOL1-inducible HEK293 cells and normal human glomeruli were used to pursue genes and molecular pathways uncovered in these studies. We discovered increased expression of APOL1 and nine other significant differentially expressed genes in high-risk patients. This included stanniocalcin, which has a role in mitochondrial and calcium-related processes along with differential correlations between high- and low-risk APOL1 and metabolism pathway genes. There were similar correlations with extracellular matrix- and immune-related genes, but significant loss of co-expression of mitochondrial genes in high-risk FSGS, and an NF-κB-down regulating gene, NKIRAS1, as the most significant hub gene with strong differential correlations with NDUF family (mitochondrial respiratory genes) and immune-related (JAK-STAT) genes. Thus, differences in mitochondrial gene regulation appear to underlie many differences observed between high- and low-risk Black patients with FSGS.
Assuntos
Apolipoproteína L1 , Glomerulosclerose Segmentar e Focal , Apolipoproteína L1/genética , Glomerulosclerose Segmentar e Focal/genética , Glomerulosclerose Segmentar e Focal/patologia , Células HEK293 , Humanos , Glomérulos Renais/patologia , TranscriptomaRESUMO
Performing genetic studies in multiple human populations can identify disease risk alleles that are common in one population but rare in others, with the potential to illuminate pathophysiology, health disparities, and the population genetic origins of disease alleles. Here we analysed 9.2 million single nucleotide polymorphisms (SNPs) in each of 8,214 Mexicans and other Latin Americans: 3,848 with type 2 diabetes and 4,366 non-diabetic controls. In addition to replicating previous findings, we identified a novel locus associated with type 2 diabetes at genome-wide significance spanning the solute carriers SLC16A11 and SLC16A13 (P = 3.9 × 10(-13); odds ratio (OR) = 1.29). The association was stronger in younger, leaner people with type 2 diabetes, and replicated in independent samples (P = 1.1 × 10(-4); OR = 1.20). The risk haplotype carries four amino acid substitutions, all in SLC16A11; it is present at ~50% frequency in Native American samples and ~10% in east Asian, but is rare in European and African samples. Analysis of an archaic genome sequence indicated that the risk haplotype introgressed into modern humans via admixture with Neanderthals. The SLC16A11 messenger RNA is expressed in liver, and V5-tagged SLC16A11 protein localizes to the endoplasmic reticulum. Expression of SLC16A11 in heterologous cells alters lipid metabolism, most notably causing an increase in intracellular triacylglycerol levels. Despite type 2 diabetes having been well studied by genome-wide association studies in other populations, analysis in Mexican and Latin American individuals identified SLC16A11 as a novel candidate gene for type 2 diabetes with a possible role in triacylglycerol metabolism.
Assuntos
Diabetes Mellitus Tipo 2/genética , Predisposição Genética para Doença/genética , Transportadores de Ácidos Monocarboxílicos/genética , Polimorfismo de Nucleotídeo Único/genética , Alelos , Animais , Povo Asiático/genética , População Negra/genética , Estudos de Coortes , Retículo Endoplasmático/genética , Feminino , Estudo de Associação Genômica Ampla , Haplótipos/genética , Células HeLa , Humanos , Indígenas Norte-Americanos/genética , Metabolismo dos Lipídeos/genética , Fígado/citologia , Fígado/metabolismo , Masculino , México , Homem de Neandertal/genética , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Triglicerídeos/metabolismo , População Branca/genéticaRESUMO
To gain insight into potential regulatory mechanisms through which the effects of variants at four established type 2 diabetes (T2D) susceptibility loci (CDKAL1, CDKN2A-B, IGF2BP2 and KCNQ1) are mediated, we undertook transancestral fine-mapping in 22 086 cases and 42 539 controls of East Asian, European, South Asian, African American and Mexican American descent. Through high-density imputation and conditional analyses, we identified seven distinct association signals at these four loci, each with allelic effects on T2D susceptibility that were homogenous across ancestry groups. By leveraging differences in the structure of linkage disequilibrium between diverse populations, and increased sample size, we localised the variants most likely to drive each distinct association signal. We demonstrated that integration of these genetic fine-mapping data with genomic annotation can highlight potential causal regulatory elements in T2D-relevant tissues. These analyses provide insight into the mechanisms through which T2D association signals are mediated, and suggest future routes to understanding the biology of specific disease susceptibility loci.
Assuntos
Mapeamento Cromossômico , Diabetes Mellitus Tipo 2/genética , Estudos de Associação Genética , Predisposição Genética para Doença , Negro ou Afro-Americano/genética , Alelos , Povo Asiático/genética , Inibidor p16 de Quinase Dependente de Ciclina , Inibidor de Quinase Dependente de Ciclina p18/genética , Diabetes Mellitus Tipo 2/patologia , Feminino , Humanos , Canal de Potássio KCNQ1/genética , Desequilíbrio de Ligação , Masculino , Polimorfismo de Nucleotídeo Único , Proteínas de Ligação a RNA/genética , Elementos Reguladores de Transcrição/genética , População Branca/genética , tRNA Metiltransferases/genéticaRESUMO
Genome wide association studies (GWAS) for fasting glucose (FG) and insulin (FI) have identified common variant signals which explain 4.8% and 1.2% of trait variance, respectively. It is hypothesized that low-frequency and rare variants could contribute substantially to unexplained genetic variance. To test this, we analyzed exome-array data from up to 33,231 non-diabetic individuals of European ancestry. We found exome-wide significant (P<5×10-7) evidence for two loci not previously highlighted by common variant GWAS: GLP1R (p.Ala316Thr, minor allele frequency (MAF)=1.5%) influencing FG levels, and URB2 (p.Glu594Val, MAF = 0.1%) influencing FI levels. Coding variant associations can highlight potential effector genes at (non-coding) GWAS signals. At the G6PC2/ABCB11 locus, we identified multiple coding variants in G6PC2 (p.Val219Leu, p.His177Tyr, and p.Tyr207Ser) influencing FG levels, conditionally independent of each other and the non-coding GWAS signal. In vitro assays demonstrate that these associated coding alleles result in reduced protein abundance via proteasomal degradation, establishing G6PC2 as an effector gene at this locus. Reconciliation of single-variant associations and functional effects was only possible when haplotype phase was considered. In contrast to earlier reports suggesting that, paradoxically, glucose-raising alleles at this locus are protective against type 2 diabetes (T2D), the p.Val219Leu G6PC2 variant displayed a modest but directionally consistent association with T2D risk. Coding variant associations for glycemic traits in GWAS signals highlight PCSK1, RREB1, and ZHX3 as likely effector transcripts. These coding variant association signals do not have a major impact on the trait variance explained, but they do provide valuable biological insights.
Assuntos
Glicemia/genética , Diabetes Mellitus Tipo 2/genética , Glucose-6-Fosfatase/genética , Insulina/sangue , Diabetes Mellitus Tipo 2/sangue , Diabetes Mellitus Tipo 2/patologia , Exoma/genética , Frequência do Gene , Estudo de Associação Genômica Ampla , Receptor do Peptídeo Semelhante ao Glucagon 1 , Índice Glicêmico/genética , Humanos , Insulina/genética , Polimorfismo de Nucleotídeo Único , Receptores de Glucagon/genéticaRESUMO
Systemic lupus erythematosus (SLE) is an autoimmune disease influenced by genetic and environmental factors. We carried out a genome-wide association scan and replication study and found an association between SLE and a variant in TNFAIP3 (rs5029939, meta-analysis P = 2.89 x 10(-12), OR = 2.29). We also found evidence of two independent signals near TNFAIP3 associated with SLE, including one previously associated with rheumatoid arthritis (RA). These results establish that variants near TNFAIP3 contribute to differential risk of SLE and RA.
Assuntos
Cromossomos Humanos Par 6 , Peptídeos e Proteínas de Sinalização Intracelular/genética , Lúpus Eritematoso Sistêmico/genética , Proteínas Nucleares/genética , Artrite Reumatoide/genética , Proteínas de Ligação a DNA , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Polimorfismo de Nucleotídeo Único , Proteína 3 Induzida por Fator de Necrose Tumoral alfaRESUMO
Common diseases such as type 2 diabetes are phenotypically heterogeneous. Obesity is a major risk factor for type 2 diabetes, but patients vary appreciably in body mass index. We hypothesized that the genetic predisposition to the disease may be different in lean (BMI<25 Kg/m²) compared to obese cases (BMI≥30 Kg/m²). We performed two case-control genome-wide studies using two accepted cut-offs for defining individuals as overweight or obese. We used 2,112 lean type 2 diabetes cases (BMI<25 kg/m²) or 4,123 obese cases (BMI≥30 kg/m²), and 54,412 un-stratified controls. Replication was performed in 2,881 lean cases or 8,702 obese cases, and 18,957 un-stratified controls. To assess the effects of known signals, we tested the individual and combined effects of SNPs representing 36 type 2 diabetes loci. After combining data from discovery and replication datasets, we identified two signals not previously reported in Europeans. A variant (rs8090011) in the LAMA1 gene was associated with type 2 diabetes in lean cases (Pâ=â8.4×10â»9, ORâ=â1.13 [95% CI 1.09-1.18]), and this association was stronger than that in obese cases (Pâ=â0.04, ORâ=â1.03 [95% CI 1.00-1.06]). A variant in HMG20A--previously identified in South Asians but not Europeans--was associated with type 2 diabetes in obese cases (Pâ=â1.3×10â»8, ORâ=â1.11 [95% CI 1.07-1.15]), although this association was not significantly stronger than that in lean cases (Pâ=â0.02, ORâ=â1.09 [95% CI 1.02-1.17]). For 36 known type 2 diabetes loci, 29 had a larger odds ratio in the lean compared to obese (binomial Pâ=â0.0002). In the lean analysis, we observed a weighted per-risk allele ORâ=â1.13 [95% CI 1.10-1.17], Pâ=â3.2×10⻹4. This was larger than the same model fitted in the obese analysis where the ORâ=â1.06 [95% CI 1.05-1.08], Pâ=â2.2×10⻹6. This study provides evidence that stratification of type 2 diabetes cases by BMI may help identify additional risk variants and that lean cases may have a stronger genetic predisposition to type 2 diabetes.
Assuntos
Índice de Massa Corporal , Diabetes Mellitus Tipo 2/genética , Proteínas de Grupo de Alta Mobilidade/genética , Laminina/genética , Obesidade/genética , Idoso , Alelos , Povo Asiático/genética , Estudos de Casos e Controles , Diabetes Mellitus Tipo 2/complicações , Feminino , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Masculino , Pessoa de Meia-Idade , Obesidade/complicações , Polimorfismo de Nucleotídeo Único , Fatores de Risco , População Branca/genéticaRESUMO
Genome-wide association studies have identified hundreds of loci for type 2 diabetes, coronary artery disease and myocardial infarction, as well as for related traits such as body mass index, glucose and insulin levels, lipid levels, and blood pressure. These studies also have pointed to thousands of loci with promising but not yet compelling association evidence. To establish association at additional loci and to characterize the genome-wide significant loci by fine-mapping, we designed the "Metabochip," a custom genotyping array that assays nearly 200,000 SNP markers. Here, we describe the Metabochip and its component SNP sets, evaluate its performance in capturing variation across the allele-frequency spectrum, describe solutions to methodological challenges commonly encountered in its analysis, and evaluate its performance as a platform for genotype imputation. The metabochip achieves dramatic cost efficiencies compared to designing single-trait follow-up reagents, and provides the opportunity to compare results across a range of related traits. The metabochip and similar custom genotyping arrays offer a powerful and cost-effective approach to follow-up large-scale genotyping and sequencing studies and advance our understanding of the genetic basis of complex human diseases and traits.
Assuntos
Antropometria/instrumentação , Metabolômica/instrumentação , Análise de Sequência com Séries de Oligonucleotídeos/instrumentação , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Alelos , Antropometria/métodos , Doenças Cardiovasculares/diagnóstico , Doenças Cardiovasculares/genética , Doenças Cardiovasculares/metabolismo , Diabetes Mellitus Tipo 2/diagnóstico , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/metabolismo , Frequência do Gene , Genoma Humano , Estudo de Associação Genômica Ampla , Genótipo , Técnicas de Genotipagem , Humanos , Metabolômica/métodos , Análise de Sequência com Séries de Oligonucleotídeos/métodos , FenótipoRESUMO
A general question for linkage disequilibrium-based association studies is how power to detect an association is compromised when tag SNPs are chosen from data in one population sample and then deployed in another sample. Specifically, it is important to know how well tags picked from the HapMap DNA samples capture the variation in other samples. To address this, we collected dense data uniformly across the four HapMap population samples and eleven other population samples. We picked tag SNPs using genotype data we collected in the HapMap samples and then evaluated the effective coverage of these tags in comparison to the entire set of common variants observed in the other samples. We simulated case-control association studies in the non-HapMap samples under a disease model of modest risk, and we observed little loss in power. These results demonstrate that the HapMap DNA samples can be used to select tags for genome-wide association studies in many samples around the world.
Assuntos
Mapeamento Cromossômico/métodos , Genética Populacional/métodos , Polimorfismo de Nucleotídeo Único , Sitios de Sequências Rotuladas , Neoplasias da Mama/etnologia , Neoplasias da Mama/genética , Estudos de Casos e Controles , Estudos de Coortes , Simulação por Computador , Feminino , Variação Genética , Genoma Humano , Projeto Genoma Humano , Humanos , Desequilíbrio de Ligação , Masculino , Neoplasias da Próstata/etnologia , Neoplasias da Próstata/genéticaRESUMO
IMPORTANCE: Latino populations have one of the highest prevalences of type 2 diabetes worldwide. OBJECTIVES: To investigate the association between rare protein-coding genetic variants and prevalence of type 2 diabetes in a large Latino population and to explore potential molecular and physiological mechanisms for the observed relationships. DESIGN, SETTING, AND PARTICIPANTS: Whole-exome sequencing was performed on DNA samples from 3756 Mexican and US Latino individuals (1794 with type 2 diabetes and 1962 without diabetes) recruited from 1993 to 2013. One variant was further tested for allele frequency and association with type 2 diabetes in large multiethnic data sets of 14,276 participants and characterized in experimental assays. MAIN OUTCOME AND MEASURES: Prevalence of type 2 diabetes. Secondary outcomes included age of onset, body mass index, and effect on protein function. RESULTS: A single rare missense variant (c.1522G>A [p.E508K]) was associated with type 2 diabetes prevalence (odds ratio [OR], 5.48; 95% CI, 2.83-10.61; P = 4.4 × 10(-7)) in hepatocyte nuclear factor 1-α (HNF1A), the gene responsible for maturity onset diabetes of the young type 3 (MODY3). This variant was observed in 0.36% of participants without type 2 diabetes and 2.1% of participants with it. In multiethnic replication data sets, the p.E508K variant was seen only in Latino patients (n = 1443 with type 2 diabetes and 1673 without it) and was associated with type 2 diabetes (OR, 4.16; 95% CI, 1.75-9.92; P = .0013). In experimental assays, HNF-1A protein encoding the p.E508K mutant demonstrated reduced transactivation activity of its target promoter compared with a wild-type protein. In our data, carriers and noncarriers of the p.E508K mutation with type 2 diabetes had no significant differences in compared clinical characteristics, including age at onset. The mean (SD) age for carriers was 45.3 years (11.2) vs 47.5 years (11.5) for noncarriers (P = .49) and the mean (SD) BMI for carriers was 28.2 (5.5) vs 29.3 (5.3) for noncarriers (P = .19). CONCLUSIONS AND RELEVANCE: Using whole-exome sequencing, we identified a single low-frequency variant in the MODY3-causing gene HNF1A that is associated with type 2 diabetes in Latino populations and may affect protein function. This finding may have implications for screening and therapeutic modification in this population, but additional studies are required.
Assuntos
Diabetes Mellitus Tipo 2/genética , Fator 1-alfa Nuclear de Hepatócito/genética , Adulto , Idade de Início , Idoso , Feminino , Genótipo , Hispânico ou Latino/genética , Humanos , Masculino , México , Pessoa de Meia-Idade , Mutação de Sentido Incorreto , Análise de Sequência de DNA , Estados UnidosRESUMO
Although large-scale genetic association studies have proven opportunistic for the delineation of neurodegenerative disease processes, we still lack a full understanding of the pathological mechanisms of these diseases, resulting in few appropriate treatment options and diagnostic challenges. To mitigate these gaps, the Neurodegenerative Disease Knowledge Portal (NDKP) was created as an open-science initiative with the aim to aggregate, enable analysis, and display all available genomic datasets of neurodegenerative disease, while protecting the integrity and confidentiality of the underlying datasets. The portal contains 218 genomic datasets, including genotyping and sequencing studies, of individuals across ten different phenotypic groups, including neurological conditions such as Alzheimer's disease, amyotrophic lateral sclerosis, Lewy body dementia, and Parkinson's disease. In addition to securely hosting large genomic datasets, the NDKP provides accessible workflows and tools to effectively utilize the datasets and assist in the facilitation of customized genomic analyses. Here, we summarize the genomic datasets currently included within the portal, the bioinformatics processing of the datasets, and the variety of phenotypes captured. We also present example use-cases of the various user interfaces and integrated analytic tools to demonstrate their extensive utility in enabling the extraction of high-quality results at the source, for both genomics experts and those in other disciplines. Overall, the NDKP promotes open-science and collaboration, maximizing the potential for discovery from the large-scale datasets researchers and consortia are expending immense resources to produce and resulting in reproducible conclusions to improve diagnostic and therapeutic care for neurodegenerative disease patients.
RESUMO
Large-scale sequencing has enabled unparalleled opportunities to investigate the role of rare coding variation in human phenotypic variability. Here, we present a pan-ancestry analysis of sequencing data from three large biobanks, including the All of Us research program. Using mixed-effects models, we performed gene-based rare variant testing for 601 diseases across 748,879 individuals, including 155,236 with ancestry dissimilar to European. We identified 363 significant associations, which highlighted core genes for the human disease phenome and identified potential novel associations, including UBR3 for cardiometabolic disease and YLPM1 for psychiatric disease. Pan-ancestry burden testing represented an inclusive and useful approach for discovery in diverse datasets, although we also highlight the importance of ancestry-specific sensitivity analyses in this setting. Finally, we found that effect sizes for rare protein-disrupting variants were concordant between samples similar to European ancestry and other genetic ancestries (ßDeming = 0.7-1.0). Our results have implications for multi-ancestry and cross-biobank approaches in sequencing association studies for human disease.
Assuntos
Bancos de Espécimes Biológicos , Humanos , Variação Genética , Predisposição Genética para Doença , População Branca/genética , Doença/genética , Estudo de Associação Genômica AmplaRESUMO
The human retina is a multilayered tissue that offers a unique window into systemic health. Optical coherence tomography (OCT) is widely used in eye care and allows the noninvasive, rapid capture of retinal anatomy in exquisite detail. We conducted genotypic and phenotypic analyses of retinal layer thicknesses using macular OCT images from 44,823 UK Biobank participants. We performed OCT layer cross-phenotype association analyses (OCT-XWAS), associating retinal thicknesses with 1866 incident conditions (median 10-year follow-up) and 88 quantitative traits and blood biomarkers. We performed genome-wide association studies (GWASs), identifying inherited genetic markers that influence retinal layer thicknesses and replicated our associations among the LIFE-Adult Study (N = 6313). Last, we performed a comparative analysis of phenome- and genome-wide associations to identify putative causal links between retinal layer thicknesses and both ocular and systemic conditions. Independent associations with incident mortality were detected for thinner photoreceptor segments (PSs) and, separately, ganglion cell complex layers. Phenotypic associations were detected between thinner retinal layers and ocular, neuropsychiatric, cardiometabolic, and pulmonary conditions. A GWAS of retinal layer thicknesses yielded 259 unique loci. Consistency between epidemiologic and genetic associations suggested links between a thinner retinal nerve fiber layer with glaucoma, thinner PS with age-related macular degeneration, and poor cardiometabolic and pulmonary function with a thinner PS. In conclusion, we identified multiple inherited genetic loci and acquired systemic cardio-metabolic-pulmonary conditions associated with thinner retinal layers and identify retinal layers wherein thinning is predictive of future ocular and systemic conditions.
Assuntos
Doenças Cardiovasculares , Estudo de Associação Genômica Ampla , Adulto , Humanos , Tomografia de Coerência Óptica , Face , Retina/diagnóstico por imagemRESUMO
Translating genetic discoveries for type 1 diabetes (T1D) into mechanistic insight can reveal novel biology and therapeutic targets but remains a major challenge. We developed the T1D Knowledge Portal (T1DKP), a disease-specific resource of genetic and functional annotation data that enables users to develop hypotheses for T1D-based research and target discovery. The T1DKP can be used to query genes and genomic regions for genetic associations, identify epigenomic features, access results of bioinformatic analyses, and obtain expert-curated resources. The T1DKP is available at http://t1d.hugeamp.org .
RESUMO
Expression quantitative trait locus (eQTL) studies illuminate genomic variants that regulate specific genes and contribute to fine-mapped loci discovered via genome-wide association studies (GWAS). Efforts to maximize their accuracy are ongoing. Using 240 glomerular (GLOM) and 311 tubulointerstitial (TUBE) micro-dissected samples from human kidney biopsies, we discovered 5371 GLOM and 9787 TUBE genes with at least one variant significantly associated with expression (eGene) by incorporating kidney single-nucleus open chromatin data and transcription start site distance as an "integrative prior" for Bayesian statistical fine-mapping. The use of an integrative prior resulted in higher resolution eQTLs illustrated by (1) smaller numbers of variants in credible sets with greater confidence, (2) increased enrichment of partitioned heritability for GWAS of two kidney traits, (3) an increased number of variants colocalized with the GWAS loci, and (4) enrichment of computationally predicted functional regulatory variants. A subset of variants and genes were validated experimentally in vitro and using a Drosophila nephrocyte model. More broadly, this study demonstrates that tissue-specific eQTL maps informed by single-nucleus open chromatin data have enhanced utility for diverse downstream analyses.
Assuntos
Estudo de Associação Genômica Ampla , Nefropatias , Humanos , Estudo de Associação Genômica Ampla/métodos , Teorema de Bayes , Nefropatias/genética , Genômica , Cromatina/genética , Polimorfismo de Nucleotídeo Único , Predisposição Genética para Doença/genéticaRESUMO
Next-generation sequencing technologies are making it possible to study the role of rare variants in human disease. Many studies balance statistical power with cost-effectiveness by (a) sampling from phenotypic extremes and (b) utilizing a two-stage design. Two-stage designs include a broad-based discovery phase and selection of a subset of potential causal genes/variants to be further examined in independent samples. We evaluate three parameters: first, the gain in statistical power due to extreme sampling to discover causal variants; second, the informativeness of initial (Phase I) association statistics to select genes/variants for follow-up; third, the impact of extreme and random sampling in (Phase 2) replication. We present a quantitative method to select individuals from the phenotypic extremes of a binary trait, and simulate disease association studies under a variety of sample sizes and sampling schemes. First, we find that while studies sampling from extremes have excellent power to discover rare variants, they have limited power to associate them to phenotypesuggesting high false-negative rates for upcoming studies. Second, consistent with previous studies, we find that the effect sizes estimated in these studies are expected to be systematically larger compared with the overall population effect size; in a well-cited lipids study, we estimate the reported effect to be twofold larger. Third, replication studies require large samples from the general population to have sufficient power; extreme sampling could reduce the required sample size as much as fourfold. Our observations offer practical guidance for the design and interpretation of studies that utilize extreme sampling.
Assuntos
Estudos de Associação Genética/métodos , Variação Genética , Simulação por Computador , Humanos , Modelos Genéticos , Modelos Estatísticos , Fenótipo , Tamanho da Amostra , Estudos de Amostragem , Análise de Sequência de DNARESUMO
Lipid droplets (LDs) are organelles of cellular lipid storage with fundamental roles in energy metabolism and cell membrane homeostasis. There has been an explosion of research into the biology of LDs, in part due to their relevance in diseases of lipid storage, such as atherosclerosis, obesity, type 2 diabetes, and hepatic steatosis. Consequently, there is an increasing need for a resource that combines datasets from systematic analyses of LD biology. Here, we integrate high-confidence, systematically generated human, mouse, and fly data from studies on LDs in the framework of an online platform named the "Lipid Droplet Knowledge Portal" (https://lipiddroplet.org/). This scalable and interactive portal includes comprehensive datasets, across a variety of cell types, for LD biology, including transcriptional profiles of induced lipid storage, organellar proteomics, genome-wide screen phenotypes, and ties to human genetics. This resource is a powerful platform that can be utilized to identify determinants of lipid storage.
Assuntos
Bases de Dados como Assunto , Gotículas Lipídicas/metabolismo , Animais , Ésteres do Colesterol/metabolismo , Mineração de Dados , Genoma , Humanos , Inflamação/patologia , Metabolismo dos Lipídeos , Fígado/metabolismo , Masculino , Camundongos Endogâmicos C57BL , Fenótipo , Fosforilação , Interferência de RNARESUMO
BACKGROUND: Common single-nucleotide polymorphisms (SNPs) that are associated with blood low-density lipoprotein (LDL) or high-density lipoprotein (HDL) cholesterol modestly affect lipid levels. We tested the hypothesis that a combination of such SNPs contributes to the risk of cardiovascular disease. METHODS: We studied SNPs at nine loci in 5414 subjects from the cardiovascular cohort of the Malmö Diet and Cancer Study. We first validated the association between SNPs and either LDL or HDL cholesterol and subsequently created a genotype score on the basis of the number of unfavorable alleles. We used Cox proportional-hazards models to determine the time to the first cardiovascular event in relation to the genotype score. RESULTS: All nine SNPs showed replication of an association with levels of either LDL or HDL cholesterol. With increasing genotype scores, the level of LDL cholesterol increased from 152 mg to 171 mg per deciliter (3.9 to 4.4 mmol per liter), whereas HDL cholesterol decreased from 60 mg to 51 mg per deciliter (1.6 to 1.3 mmol per liter). During follow-up (median, 10.6 years), 238 subjects had a first cardiovascular event. The genotype score was associated with incident cardiovascular disease in models adjusted for covariates including baseline lipid levels (P<0.001). The use of the genotype score did not improve the clinical risk prediction, as assessed by the C statistic. However, there was a significant improvement in risk classification with the use of models that included the genotype score, as compared with those that did not include the genotype score. CONCLUSIONS: A genotype score of nine validated SNPs that are associated with modulation in levels of LDL or HDL cholesterol was an independent risk factor for incident cardiovascular disease. The score did not improve risk discrimination but did modestly improve clinical risk reclassification for individual subjects beyond standard clinical factors.