RESUMO
Autozygosity is associated with rare Mendelian disorders and clinically relevant quantitative traits. We investigated associations between the fraction of the genome in runs of homozygosity (FROH) and common diseases in Genes & Health (n = 23,978 British South Asians), UK Biobank (n = 397,184), and 23andMe. We show that restricting analysis to offspring of first cousins is an effective way of reducing confounding due to social/environmental correlates of FROH. Within this group in G&H+UK Biobank, we found experiment-wide significant associations between FROH and twelve common diseases. We replicated associations with type 2 diabetes (T2D) and post-traumatic stress disorder via within-sibling analysis in 23andMe (median n = 480,282). We estimated that autozygosity due to consanguinity accounts for 5%-18% of T2D cases among British Pakistanis. Our work highlights the possibility of widespread non-additive genetic effects on common diseases and has important implications for global populations with high rates of consanguinity.
Assuntos
Consanguinidade , Diabetes Mellitus Tipo 2 , Humanos , Diabetes Mellitus Tipo 2/genética , Homozigoto , Fenótipo , Polimorfismo de Nucleotídeo Único , Bancos de Espécimes Biológicos , Genoma Humano , Predisposição Genética para Doença , Reino UnidoRESUMO
Most loci identified by GWASs have been found in populations of European ancestry (EUR). In trans-ethnic meta-analyses for 15 hematological traits in 746,667 participants, including 184,535 non-EUR individuals, we identified 5,552 trait-variant associations at p < 5 × 10-9, including 71 novel associations not found in EUR populations. We also identified 28 additional novel variants in ancestry-specific, non-EUR meta-analyses, including an IL7 missense variant in South Asians associated with lymphocyte count in vivo and IL-7 secretion levels in vitro. Fine-mapping prioritized variants annotated as functional and generated 95% credible sets that were 30% smaller when using the trans-ethnic as opposed to the EUR-only results. We explored the clinical significance and predictive value of trans-ethnic variants in multiple populations and compared genetic architecture and the effect of natural selection on these blood phenotypes between populations. Altogether, our results for hematological traits highlight the value of a more global representation of populations in genetic studies.
Assuntos
Povo Asiático/genética , Mutação de Sentido Incorreto/genética , Polimorfismo de Nucleotídeo Único/genética , População Branca/genética , Genética , Estudo de Associação Genômica Ampla/métodos , Células HEK293 , Humanos , Interleucina-7/genética , FenótipoRESUMO
Human genetic studies of common variants have provided substantial insight into the biological mechanisms that govern ovarian ageing1. Here we report analyses of rare protein-coding variants in 106,973 women from the UK Biobank study, implicating genes with effects around five times larger than previously found for common variants (ETAA1, ZNF518A, PNPLA8, PALB2 and SAMHD1). The SAMHD1 association reinforces the link between ovarian ageing and cancer susceptibility1, with damaging germline variants being associated with extended reproductive lifespan and increased all-cause cancer risk in both men and women. Protein-truncating variants in ZNF518A are associated with shorter reproductive lifespan-that is, earlier age at menopause (by 5.61 years) and later age at menarche (by 0.56 years). Finally, using 8,089 sequenced trios from the 100,000 Genomes Project (100kGP), we observe that common genetic variants associated with earlier ovarian ageing associate with an increased rate of maternally derived de novo mutations. Although we were unable to replicate the finding in independent samples from the deCODE study, it is consistent with the expected role of DNA damage response genes in maintaining the genetic integrity of germ cells. This study provides evidence of genetic links between age of menopause and cancer risk.
Assuntos
Envelhecimento , Predisposição Genética para Doença , Menopausa , Taxa de Mutação , Neoplasias , Ovário , Adulto , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Envelhecimento/genética , Envelhecimento/patologia , Dano ao DNA/genética , Fertilidade/genética , Predisposição Genética para Doença/genética , Variação Genética/genética , Genoma Humano/genética , Mutação em Linhagem Germinativa/genética , Menarca/genética , Menopausa/genética , Neoplasias/genética , Ovário/metabolismo , Ovário/patologia , Fatores de Tempo , Biobanco do Reino Unido , Reino Unido/epidemiologiaRESUMO
BACKGROUND: Type 2 diabetes (T2D) is highly prevalent in British South Asians, yet they are underrepresented in research. Genes & Health (G&H) is a large, population study of British Pakistanis and Bangladeshis (BPB) comprising genomic and routine health data. We assessed the extent to which genetic risk for T2D is shared between BPB and European populations (EUR). We then investigated whether the integration of a polygenic risk score (PRS) for T2D with an existing risk tool (QDiabetes) could improve prediction of incident disease and the characterisation of disease subtypes. METHODS AND FINDINGS: In this observational cohort study, we assessed whether common genetic loci associated with T2D in EUR individuals were replicated in 22,490 BPB individuals in G&H. We replicated fewer loci in G&H (n = 76/338, 22%) than would be expected given power if all EUR-ascertained loci were transferable (n = 101, 30%; p = 0.001). Of the 27 transferable loci that were powered to interrogate this, only 9 showed evidence of shared causal variants. We constructed a T2D PRS and combined it with a clinical risk instrument (QDiabetes) in a novel, integrated risk tool (IRT) to assess risk of incident diabetes. To assess model performance, we compared categorical net reclassification index (NRI) versus QDiabetes alone. In 13,648 patients free from T2D followed up for 10 years, NRI was 3.2% for IRT versus QDiabetes (95% confidence interval (CI): 2.0% to 4.4%). IRT performed best in reclassification of individuals aged less than 40 years deemed low risk by QDiabetes alone (NRI 5.6%, 95% CI 3.6% to 7.6%), who tended to be free from comorbidities and slim. After adjustment for QDiabetes score, PRS was independently associated with progression to T2D after gestational diabetes (hazard ratio (HR) per SD of PRS 1.23, 95% CI 1.05 to 1.42, p = 0.028). Using cluster analysis of clinical features at diabetes diagnosis, we replicated previously reported disease subgroups, including Mild Age-Related, Mild Obesity-related, and Insulin-Resistant Diabetes, and showed that PRS distribution differs between subgroups (p = 0.002). Integrating PRS in this cluster analysis revealed a Probable Severe Insulin Deficient Diabetes (pSIDD) subgroup, despite the absence of clinical measures of insulin secretion or resistance. We also observed differences in rates of progression to micro- and macrovascular complications between subgroups after adjustment for confounders. Study limitations include the absence of an external replication cohort and the potential biases arising from missing or incorrect routine health data. CONCLUSIONS: Our analysis of the transferability of T2D loci between EUR and BPB indicates the need for larger, multiancestry studies to better characterise the genetic contribution to disease and its varied aetiology. We show that a T2D PRS optimised for this high-risk BPB population has potential clinical application in BPB, improving the identification of T2D risk (especially in the young) on top of an established clinical risk algorithm and aiding identification of subgroups at diagnosis, which may help future efforts to stratify care and treatment of the disease.
Assuntos
Diabetes Mellitus Tipo 2 , Povo Asiático , Estudos de Coortes , Diabetes Mellitus Tipo 2/diagnóstico , Diabetes Mellitus Tipo 2/epidemiologia , Diabetes Mellitus Tipo 2/genética , Feminino , Humanos , Insulina , Paquistão/epidemiologia , Fatores de RiscoRESUMO
Cytokines are essential regulatory components of the immune system, and their aberrant levels have been linked to many disease states. Despite increasing evidence that cytokines operate in concert, many of the physiological interactions between cytokines, and the shared genetic architecture that underlies them, remain unknown. Here, we aimed to identify and characterize genetic variants with pleiotropic effects on cytokines. Using three population-based cohorts (n = 9,263), we performed multivariate genome-wide association studies (GWAS) for a correlation network of 11 circulating cytokines, then combined our results in meta-analysis. We identified a total of eight loci significantly associated with the cytokine network, of which two (PDGFRB and ABO) had not been detected previously. In addition, conditional analyses revealed a further four secondary signals at three known cytokine loci. Integration, through the use of Bayesian colocalization analysis, of publicly available GWAS summary statistics with the cytokine network associations revealed shared causal variants between the eight cytokine loci and other traits; in particular, cytokine network variants at the ABO, SERPINE2, and ZFPM2 loci showed pleiotropic effects on the production of immune-related proteins, on metabolic traits such as lipoprotein and lipid levels, on blood-cell-related traits such as platelet count, and on disease traits such as coronary artery disease and type 2 diabetes.
Assuntos
Biomarcadores/análise , Doenças Cardiovasculares/genética , Citocinas/genética , Pleiotropia Genética , Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Adolescente , Adulto , Idoso , Proteínas Sanguíneas/genética , Proteínas Sanguíneas/imunologia , Doenças Cardiovasculares/imunologia , Doenças Cardiovasculares/patologia , Criança , Citocinas/imunologia , Feminino , Seguimentos , Redes Reguladoras de Genes , Predisposição Genética para Doença , Genoma Humano , Humanos , Estudos Longitudinais , Masculino , Pessoa de Meia-Idade , Prognóstico , Estudos Prospectivos , Adulto JovemRESUMO
Investigation of the genetic architecture of gene expression traits has aided interpretation of disease and trait-associated genetic variants; however, key aspects of expression quantitative trait loci (eQTL) study design and analysis remain understudied. We used extensive, empirically driven simulations to explore eQTL study design and the performance of various analysis strategies. Across multiple testing correction methods, false discoveries of genes with eQTLs (eGenes) were substantially inflated when false discovery rate (FDR) control was applied to all tests and only appropriately controlled using hierarchical procedures. All multiple testing correction procedures had low power and inflated FDR for eGenes whose causal SNPs had small allele frequencies using small sample sizes (e.g. frequency <10% in 100 samples), indicating that even moderately low frequency eQTL SNPs (eSNPs) in these studies are enriched for false discoveries. In scenarios with ≥80% power, the top eSNP was the true simulated eSNP 90% of the time, but substantially less frequently for very common eSNPs (minor allele frequencies >25%). Overestimation of eQTL effect sizes, so-called 'Winner's Curse', was common in low and moderate power settings. To address this, we developed a bootstrap method (BootstrapQTL) that led to more accurate effect size estimation. These insights provide a foundation for future eQTL studies, especially those with sampling constraints and subtly different conditions.
Assuntos
Genoma Humano/genética , Estudo de Associação Genômica Ampla/métodos , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas/genética , Algoritmos , Simulação por Computador , Expressão Gênica , Frequência do Gene , Predisposição Genética para Doença/genética , Humanos , Modelos Genéticos , Tamanho da AmostraRESUMO
Traditional genome-wide scans for positive selection have mainly uncovered selective sweeps associated with monogenic traits. While selection on quantitative traits is much more common, very few signals have been detected because of their polygenic nature. We searched for positive selection signals underlying coronary artery disease (CAD) in worldwide populations, using novel approaches to quantify relationships between polygenic selection signals and CAD genetic risk. We identified new candidate adaptive loci that appear to have been directly modified by disease pressures given their significant associations with CAD genetic risk. These candidates were all uniquely and consistently associated with many different male and female reproductive traits suggesting selection may have also targeted these because of their direct effects on fitness. We found that CAD loci are significantly enriched for lifetime reproductive success relative to the rest of the human genome, with evidence that the relationship between CAD and lifetime reproductive success is antagonistic. This supports the presence of antagonistic-pleiotropic tradeoffs on CAD loci and provides a novel explanation for the maintenance and high prevalence of CAD in modern humans. Lastly, we found that positive selection more often targeted CAD gene regulatory variants using HapMap3 lymphoblastoid cell lines, which further highlights the unique biological significance of candidate adaptive loci underlying CAD. Our study provides a novel approach for detecting selection on polygenic traits and evidence that modern human genomes have evolved in response to CAD-induced selection pressures and other early-life traits sharing pleiotropic links with CAD.
Assuntos
Doença da Artéria Coronariana/genética , Loci Gênicos , Pleiotropia Genética , Seleção Genética , Aptidão Genética , Projeto HapMap , Humanos , Polimorfismo de Nucleotídeo ÚnicoRESUMO
Polygenic risk scores (PRSs) are an emerging tool to predict the clinical phenotypes and outcomes of individuals. We propose PRSmix, a framework that leverages the PRS corpus of a target trait to improve prediction accuracy, and PRSmix+, which incorporates genetically correlated traits to better capture the human genetic architecture for 47 and 32 diseases/traits in European and South Asian ancestries, respectively. PRSmix demonstrated a mean prediction accuracy improvement of 1.20-fold (95% confidence interval [CI], [1.10; 1.3]; p = 9.17 × 10-5) and 1.19-fold (95% CI, [1.11; 1.27]; p = 1.92 × 10-6), and PRSmix+ improved the prediction accuracy by 1.72-fold (95% CI, [1.40; 2.04]; p = 7.58 × 10-6) and 1.42-fold (95% CI, [1.25; 1.59]; p = 8.01 × 10-7) in European and South Asian ancestries, respectively. Compared to the previously cross-trait-combination methods with scores from pre-defined correlated traits, we demonstrated that our method improved prediction accuracy for coronary artery disease up to 3.27-fold (95% CI, [2.1; 4.44]; p value after false discovery rate (FDR) correction = 2.6 × 10-4). Our method provides a comprehensive framework to benchmark and leverage the combined power of PRS for maximal performance in a desired target population.
Assuntos
Doença da Artéria Coronariana , Osteopatia , Humanos , Herança Multifatorial/genética , Estratificação de Risco Genético , Benchmarking , Doença da Artéria Coronariana/diagnósticoRESUMO
Most genome-wide association studies (GWAS) of major depression (MD) have been conducted in samples of European ancestry. Here we report a multi-ancestry GWAS of MD, adding data from 21 cohorts with 88,316 MD cases and 902,757 controls to previously reported data. This analysis used a range of measures to define MD and included samples of African (36% of effective sample size), East Asian (26%) and South Asian (6%) ancestry and Hispanic/Latin American participants (32%). The multi-ancestry GWAS identified 53 significantly associated novel loci. For loci from GWAS in European ancestry samples, fewer than expected were transferable to other ancestry groups. Fine mapping benefited from additional sample diversity. A transcriptome-wide association study identified 205 significantly associated novel genes. These findings suggest that, for MD, increasing ancestral and global diversity in genetic studies may be particularly important to ensure discovery of core genes and inform about transferability of findings.
Assuntos
Transtorno Depressivo Maior , Estudo de Associação Genômica Ampla , Humanos , Predisposição Genética para Doença , Transtorno Depressivo Maior/genética , Depressão , Mapeamento Cromossômico , Polimorfismo de Nucleotídeo Único/genéticaRESUMO
Polygenic risk scores (PRS) are an emerging tool to predict the clinical phenotypes and outcomes of individuals. Validation and transferability of existing PRS across independent datasets and diverse ancestries are limited, which hinders the practical utility and exacerbates health disparities. We propose PRSmix, a framework that evaluates and leverages the PRS corpus of a target trait to improve prediction accuracy, and PRSmix+, which incorporates genetically correlated traits to better capture the human genetic architecture. We applied PRSmix to 47 and 32 diseases/traits in European and South Asian ancestries, respectively. PRSmix demonstrated a mean prediction accuracy improvement of 1.20-fold (95% CI: [1.10; 1.3]; P-value = 9.17 × 10-5) and 1.19-fold (95% CI: [1.11; 1.27]; P-value = 1.92 × 10-6), and PRSmix+ improved the prediction accuracy by 1.72-fold (95% CI: [1.40; 2.04]; P-value = 7.58 × 10-6) and 1.42-fold (95% CI: [1.25; 1.59]; P-value = 8.01 × 10-7) in European and South Asian ancestries, respectively. Compared to the previously established cross-trait-combination method with scores from pre-defined correlated traits, we demonstrated that our method can improve prediction accuracy for coronary artery disease up to 3.27-fold (95% CI: [2.1; 4.44]; P-value after FDR correction = 2.6 × 10-4). Our method provides a comprehensive framework to benchmark and leverage the combined power of PRS for maximal performance in a desired target population.
RESUMO
Our understanding of the genetics of the human cerebral cortex is limited both in terms of the diversity and the anatomical granularity of brain structural phenotypes. Here we conducted a genome-wide association meta-analysis of 13 structural and diffusion magnetic resonance imaging-derived cortical phenotypes, measured globally and at 180 bilaterally averaged regions in 36,663 individuals and identified 4,349 experiment-wide significant loci. These phenotypes include cortical thickness, surface area, gray matter volume, measures of folding, neurite density and water diffusion. We identified four genetic latent structures and causal relationships between surface area and some measures of cortical folding. These latent structures partly relate to different underlying gene expression trajectories during development and are enriched for different cell types. We also identified differential enrichment for neurodevelopmental and constrained genes and demonstrate that common genetic variants associated with cortical expansion are associated with cephalic disorders. Finally, we identified complex interphenotype and inter-regional genetic relationships among the 13 phenotypes, reflecting the developmental differences among them. Together, these analyses identify distinct genetic organizational principles of the cortex and their correlates with neurodevelopment.
Assuntos
Córtex Cerebral , Estudo de Associação Genômica Ampla , Humanos , Córtex Cerebral/diagnóstico por imagem , Encéfalo/diagnóstico por imagem , Neuroimagem , FenótipoRESUMO
CONTEXT: The melanocortin 3 receptor (MC3R) has recently emerged as a critical regulator of pubertal timing, linear growth, and the acquisition of lean mass in humans and mice. In population-based studies, heterozygous carriers of deleterious variants in MC3R report a later onset of puberty than noncarriers. However, the frequency of such variants in patients who present with clinical disorders of pubertal development is currently unknown. OBJECTIVE: This work aimed to determine whether deleterious MC3R variants are more frequently found in patients clinically presenting with constitutional delay of growth and puberty (CDGP) or normosmic idiopathic hypogonadotropic hypogonadism (nIHH). METHODS: We examined the sequence of MC3R in 362 adolescents with a clinical diagnosis of CDGP and 657 patients with nIHH, experimentally characterized the signaling properties of all nonsynonymous variants found and compared their frequency to that in 5774 controls from a population-based cohort. Additionally, we established the relative frequency of predicted deleterious variants in individuals with self-reported delayed vs normally timed menarche/voice-breaking in the UK Biobank cohort. RESULTS: MC3R loss-of-function variants were infrequent but overrepresented in patients with CDGP (8/362 [2.2%]; OR = 4.17; P = .001). There was no strong evidence of overrepresentation in patients with nIHH (4/657 [0.6%]; OR = 1.15; P = .779). In 246 328 women from the UK Biobank, predicted deleterious variants were more frequently found in those self-reporting delayed (aged ≥16 years) vs normal age at menarche (OR = 1.66; P = 3.90E-07). CONCLUSION: We have found evidence that functionally damaging variants in MC3R are overrepresented in individuals with CDGP but are not a common cause of this phenotype.
Assuntos
Hipogonadismo , Puberdade Tardia , Adolescente , Humanos , Feminino , Animais , Camundongos , Receptor Tipo 3 de Melanocortina , Prevalência , Hipogonadismo/epidemiologia , Hipogonadismo/genética , Hipogonadismo/complicações , Puberdade Tardia/epidemiologia , Puberdade Tardia/genética , Puberdade Tardia/diagnóstico , Puberdade/genética , Transtornos do Crescimento/genéticaRESUMO
In the title compound, C(13)H(12)N(2)O(4), the dihedral angle between the benzene and pyrimidine rings is 55.57â (13)°. The carbonyl group and the two methoxyl groups are approximately coplanar with the benzene ring and pyrimidine ring; the C-C-C-O, C-O-C-N and C-O-C-C torsion angles being -6.1â (5), -4.8â (4) and 179.9â (3)°, respectively. In the crystal, mol-ecules are linked via C-Hâ¯O inter-actions, forming chains propagating along [110].
RESUMO
Individuals with South Asian ancestry have a higher risk of heart disease than other groups but have been largely excluded from genetic research. Using data from 22,000 British Pakistani and Bangladeshi individuals with linked electronic health records from the Genes & Health cohort, we conducted genome-wide association studies of coronary artery disease and its key risk factors. Using power-adjusted transferability ratios, we found evidence for transferability for the majority of cardiometabolic loci powered to replicate. The performance of polygenic scores was high for lipids and blood pressure, but lower for BMI and coronary artery disease. Adding a polygenic score for coronary artery disease to clinical risk factors showed significant improvement in reclassification. In Mendelian randomisation using transferable loci as instruments, our findings were consistent with results in European-ancestry individuals. Taken together, trait-specific transferability of trait loci between populations is an important consideration with implications for risk prediction and causal inference.
Assuntos
Doença da Artéria Coronariana , Estudo de Associação Genômica Ampla , Povo Asiático/genética , Doença da Artéria Coronariana/epidemiologia , Doença da Artéria Coronariana/genética , Loci Gênicos , Humanos , Paquistão , Polimorfismo de Nucleotídeo ÚnicoRESUMO
Previous genetic and public health research in the Pakistani population has focused on the role of consanguinity in increasing recessive disease risk, but little is known about its recent population history or the effects of endogamy. Here, we investigate fine-scale population structure, history and consanguinity patterns using genotype chip data from 2,200 British Pakistanis. We reveal strong recent population structure driven by the biraderi social stratification system. We find that all subgroups have had low recent effective population sizes (Ne), with some showing a decrease 15â20 generations ago that has resulted in extensive identity-by-descent sharing and homozygosity, increasing the risk of recessive disorders. Our results from two orthogonal methods (one using machine learning and the other coalescent-based) suggest that the detailed reporting of parental relatedness for mothers in the cohort under-represents the true levels of consanguinity. These results demonstrate the impact of cultural practices on population structure and genomic diversity in Pakistanis, and have important implications for medical genetic studies.
Assuntos
Povo Asiático/genética , Consanguinidade , Genética Populacional , População Branca/genética , Estudos de Coortes , Demografia , Genótipo , Homozigoto , Humanos , Casamento , Modelos Genéticos , Paquistão , Pais , Densidade Demográfica , Status SocialRESUMO
Chronic immune-mediated diseases of adulthood often originate in early childhood. To investigate genetic associations between neonatal immunity and disease, we map expression quantitative trait loci (eQTLs) in resting myeloid cells and CD4+ T cells from cord blood samples, as well as in response to lipopolysaccharide (LPS) or phytohemagglutinin (PHA) stimulation, respectively. Cis-eQTLs are largely specific to cell type or stimulation, and 31% and 52% of genes with cis-eQTLs have response eQTLs (reQTLs) in myeloid cells and T cells, respectively. We identified cis regulatory factors acting as mediators of trans effects. There is extensive colocalisation between condition-specific neonatal cis-eQTLs and variants associated with immune-mediated diseases, in particular CTSH had widespread colocalisation across diseases. Mendelian randomisation shows causal neonatal gene expression effects on disease risk for BTN3A2, HLA-C and others. Our study elucidates the genetics of gene expression in neonatal immune cells, and aetiological origins of autoimmune and allergic diseases.
Assuntos
Doenças Autoimunes/genética , Desenvolvimento Infantil/fisiologia , Regulação da Expressão Gênica no Desenvolvimento/imunologia , Hipersensibilidade/genética , Locos de Características Quantitativas/imunologia , Doenças Autoimunes/imunologia , Butirofilinas/genética , Butirofilinas/metabolismo , Linfócitos T CD4-Positivos/imunologia , Linfócitos T CD4-Positivos/metabolismo , Catepsina H/genética , Catepsina H/metabolismo , Criança , Pré-Escolar , Conjuntos de Dados como Assunto , Sangue Fetal/citologia , Perfilação da Expressão Gênica , Redes Reguladoras de Genes/imunologia , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Antígenos HLA-C/genética , Antígenos HLA-C/metabolismo , Humanos , Hipersensibilidade/imunologia , Lactente , Recém-Nascido , Análise da Randomização Mendeliana , Células Mieloides/imunologia , Células Mieloides/metabolismo , Análise de Sequência com Séries de Oligonucleotídeos , Polimorfismo de Nucleotídeo Único , Estudos ProspectivosRESUMO
By sequencing autozygous human populations, we identified a healthy adult woman with lifelong complete knockout of HAO1 (expected ~1 in 30 million outbred people). HAO1 (glycolate oxidase) silencing is the mechanism of lumasiran, an investigational RNA interference therapeutic for primary hyperoxaluria type 1. Her plasma glycolate levels were 12 times, and urinary glycolate 6 times, the upper limit of normal observed in healthy reference individuals (n = 67). Plasma metabolomics and lipidomics (1871 biochemicals) revealed 18 markedly elevated biochemicals (>5 sd outliers versus n = 25 controls) suggesting additional HAO1 effects. Comparison with lumasiran preclinical and clinical trial data suggested she has <2% residual glycolate oxidase activity. Cell line p.Leu333SerfsTer4 expression showed markedly reduced HAO1 protein levels and cellular protein mis-localisation. In this woman, lifelong HAO1 knockout is safe and without clinical phenotype, de-risking a therapeutic approach and informing therapeutic mechanisms. Unlocking evidence from the diversity of human genetic variation can facilitate drug development.