ABSTRACT
INTRODUCTION: This case report presents two patients affected by a very rare association of bilateral retinoblastoma and osteogenesis imperfecta. CASE REPORT: Two Caucasian males with familial history and clinical signs of osteogenesis imperfecta came to our attention for bilateral leukocoria. The ocular fundus examination revealed bilateral retinoblastoma. Proper therapies were dispensed in order to achieve full regression. Genetic counseling was performed. DISCUSSION: The primary role of genetics in retinoblastoma pathogenesis in widely known, and different genes have been identified. Osteogenesis imperfecta is a rare connective tissue disorders, caused by mutated genes encoding for collagen. The single gene defect in osteogenesis imperfecta type VI is Serpin Family F Member 1 (SERPINF1), a neurotrophic factor for the neuronal differentiation in retinoblastoma cells. The association of bilateral retinoblastoma and osteogenesis imperfecta could be the result of the mutation of a single gene playing a role in a hypothetical common pathway.
Subject(s)
Osteogenesis Imperfecta , Retinal Neoplasms , Retinoblastoma , Serpins , Eye Proteins/genetics , Humans , Male , Mutation , Osteogenesis Imperfecta/diagnosis , Osteogenesis Imperfecta/genetics , Retinal Neoplasms/diagnosis , Retinal Neoplasms/genetics , Retinoblastoma/diagnosis , Retinoblastoma/genetics , Serpins/geneticsABSTRACT
PURPOSE: Lung cancer is strongly associated to tobacco smoking. However, global statistics estimate that in females the proportion of lung cancer cases that is unrelated to tobacco smoking reaches fifty percent, making questionable the etiology of the disease. MATERIALS AND METHODS: A never-smoker female with primary EGFR/KRAS/ALK-negative squamous cell carcinoma of the lung and their normal sibswere subjected to a novel integrative “omic” approach using a pedigree-based model for discovering genetic factors leading to cancer in the absence of well-known environmental trigger. A first-stepwhole-exome sequencing on tumor and normal tissue did not identify mutations in known driver genes. Building on the idea of a germline oligogenic origin of lung cancer, we performed whole-exome sequencing of DNA from patients' peripheral blood and their unaffected sibs. Finally, RNA-sequencing analysis in tumoral and matched non-tumoral tissues was carried out in order to investigate the clonal profile and the pathogenic role of the identified variants. RESULTS: Filtering for rare variants with Combined Annotation Dependent Depletion (CADD) > 25 and potentially damaging effect, we identified rare/private germline deleterious variants in 11 cancer-associated genes, none ofwhich, except one, sharedwith the healthy sib, pinpointing to a “private” oligogenic germline signature. Noteworthy, among these, two mutated genes, namely ACACA and DEPTOR, turned to be potential targets for therapy because related to known drivers, such as BRCA1 and EGFR. CONCLUSION: In the era of precision medicine, this report emphasizes the importance of an “omic” approach to uncover oligogenic germline signature underlying cancer development and to identify suitable therapeutic targets as well.
Subject(s)
Female , Humans , Carcinoma, Squamous Cell , Disease Susceptibility , DNA , Epithelial Cells , Exome , High-Throughput Nucleotide Sequencing , Lung Neoplasms , Lung , Multifactorial Inheritance , Precision Medicine , SmokingABSTRACT
BackgroundIncreased vitamin D levels, as reflected by 25OHD measurements, have been proposed to protect against COVID-19 disease based on in-vitro, observational, and ecological studies. However, vitamin D levels are associated with many confounding variables and thus associations described to date may not be causal. Vitamin D Mendelian randomization (MR) studies have provided results that are concordant with large-scale vitamin D randomized trials. Here, we used two-sample MR to assess evidence supporting a causal effect of circulating 25OHD levels on COVID-19 susceptibility and severity. Methods and findingsGenetic variants strongly associated with 25OHD levels in a genome-wide association study (GWAS) of 443,734 participants of European ancestry (including 401,460 from the UK Biobank) were used as instrumental variables. GWASs of COVID-19 susceptibility, hospitalization, and severe disease from the COVID-19 Host Genetics Initiative were used as outcome GWASs. These included up to 14,134 individuals with COVID-19, and 1,284,876 without COVID-19, from 11 countries. SARS-CoV-2 positivity was determined by laboratory testing or medical chart review. Population controls without COVID-19 were also included in the control groups for all outcomes, including hospitalization and severe disease. Analyses were restricted to individuals of European descent when possible. Using inverse-weighted MR, genetically increased 25OHD levels by one standard deviation on the logarithmic scale had no clear association with COVID-19 susceptibility (OR = 0.97; 95% CI: 0.95, 1.10; P=0.61), hospitalization (OR = 1.11; 95% CI: 0.91, 1.35; P=0.30), and severe disease (OR = 0.93; 95% CI: 0.73, 1.17; P=0.53). We used an additional 6 meta-analytic methods, as well as sensitivity analyses after removal of variants at risk of horizontal pleiotropy and obtained similar results. These results may be limited by weak instrument bias in some analyses. Further, our results do not apply to individuals with vitamin D deficiency. ConclusionIn this two-sample MR study, we did not observe evidence to support an association between 25OHD levels and COVID-19 susceptibility, severity, or hospitalization. Hence, vitamin D supplementation as a mean of protecting against worsened COVID-19 outcomes is not supported by genetic evidence. Other therapeutic or preventative avenues should be given higher priority for COVID-19 randomized controlled trials. Author SummaryO_LIWhy was this study done? - Vitamin D levels have been associated with COVID-19 outcomes in multiple observational studies, though confounders are likely to bias these associations. - By using genetic instruments which limit such confounding, Mendelian randomization studies have consistently obtained results concordant with vitamin D supplementation randomized trials. This provides rationale to undertake vitamin D Mendelian randomization studies for COVID-19 outcomes. C_LIO_LIWhat did the researchers do and find? - We used the genetic variants obtained from the largest consortium of COVID-19 cases and controls, and the largest study on genetic determinants of vitamin D levels. We used Mendelian randomization to estimate the effect of increased vitamin D on COVID-19 outcomes, while limiting confounding. - In multiple analyses, our results consistently showed no evidence for an association between genetically predicted vitamin D levels and COVID-19 susceptibility, hospitalization, or severe disease. C_LIO_LIWhat do these findings mean? - Vitamin D is a highly confounded variable, and traditional observational studies are at high risk of biased estimates. - We did not find evidence that vitamin D supplementation would improve COVID-19 outcomes. - Given Mendelian randomizations past track-record of anticipating the results of vitamin D randomized controlled trials, other therapeutic and preventative avenues should be prioritized for COVID-19 trials. C_LI
ABSTRACT
BackgroundCOVID-19 clinical presentation ranges from asymptomatic to fatal outcome. This variability is due in part to host genome specific mutations. Recently, two families in which COVID-19 segregates like an X-linked recessive monogenic disorder environmentally conditioned by SARS-CoV-2 have been reported leading to identification of loss-of-function variants in TLR7. ObjectiveWe sought to determine whether the two families represent the tip of the iceberg of a subset of COVID-19 male patients. MethodsWe compared male subjects with extreme phenotype selected from the Italian GEN-COVID cohort of 1178 SARS-CoV-2-infected subjects (<60y, 79 severe cases versus 77 control cases). We applied the LASSO Logistic Regression analysis, considering only rare variants on the young male subset, picking up TLR7 as the most important susceptibility gene. ResultsRare TLR7 missense variants were predicted to impact on protein function in severely affected males and in none of the asymptomatic subjects. We then investigated a similar white European cohort in Spain, confirming the impact of TRL7 variants. A gene expression profile analysis in peripheral blood mononuclear cells after stimulation with TLR7 agonist demonstrated a reduction of mRNA level of TLR7, IRF7, ISG15, IFN-{square} and IFN-{gamma} in COVID-19 patients compared with unaffected controls demonstrating an impairment in type I and II INF responses. ConclusionYoung males with TLR7 loss-of-function mutations and severe COVID-19 in the two reported families represent only a fraction of a broader and complex host genome situation. Specifically, missense mutations in the X-linked recessive TLR7 disorder may significantly contribute to disease susceptibility in up to 4% of severe COVID-19. Clinical ImplicationIn this new yet complex scenario, our observations provide the basis for a personalized interferon-based therapy in patients with rare TLR7 variants. CAPSULE SUMMARYOur results in large cohorts from Italy and Spain showed that X-linked recessive TLR7 disorder may represent the cause of disease susceptibility to COVID-19 in up to 4% of severely affected young male cases.
ABSTRACT
Within the GEN-COVID Multicenter Study, biospecimens from more than 1,000 SARS-CoV-2-positive individuals have thus far been collected in the GEN-COVID Biobank (GCB). Sample types include whole blood, plasma, serum, leukocytes, and DNA. The GCB links samples to detailed clinical data available in the GEN-COVID Patient Registry (GCPR). It includes hospitalized patients (74.25%), broken down into intubated, treated by CPAP-biPAP, treated with O2 supplementation, and without respiratory support (9.5%, 18.4%, 31.55% and 14.8, respectively); and non-hospitalized subjects (25.75%), either pauci- or asymptomatic. More than 150 clinical patient-level data fields have been collected and binarized for further statistics according to the organs/systems primarily affected by COVID-19: heart, liver, pancreas, kidney, chemosensors, innate or adaptive immunity, and clotting system. Hierarchical Clustering analysis identified five main clinical categories: i) severe multisystemic failure with either thromboembolic or pancreatic variant; ii) cytokine storm type, either severe with liver involvement or moderate; iii) moderate heart type, either with or without liver damage; iv) moderate multisystemic involvement, either with or without liver damage; v) mild, either with or without hyposmia. GCB and GCPR are further linked to the GEN-COVID Genetic Data Repository (GCGDR), which includes data from Whole Exome Sequencing and high-density SNP genotyping. The data are available for sharing through the Network for Italian Genomes, found within the COVID-19 dedicated section. The study objective is to systematize this comprehensive data collection and begin identifying multi-organ involvement in COVID-19, defining genetic parameters for infection susceptibility within the population and mapping genetically COVID-19 severity and clinical complexity among patients.
ABSTRACT
In December 2019, an initial cluster of interstitial bilateral pneumonia emerged in Wuhan, China. A human-to-human transmission was assumed and a previously unrecognized entity, termed coronavirus-disease-19 (COVID-19) due to a novel coronavirus (SARS-CoV-2) was described. The infection has rapidly spread out all over the world and Italy has been the first European country experiencing the endemic wave with unexpected clinical severity in comparison with Asian countries. It has been shown that SARS-CoV-2 utilizes angiotensin converting enzyme 2 (ACE2) as host receptor and host proteases for cell surface binding and internalization. Thus, a predisposing genetic background can give reason for inter-individual disease susceptibility and/or severity. Taking advantage of the Network of Italian Genomes (NIG), here we mined whole-exome-sequencing data of 6930 Italian control individuals from five different centers looking for ACE2 variants. A number of variants with a potential impact on protein stability were identified. Among these, three more common missense changes, p.(Asn720Asp), p.(Lys26Arg), p.(Gly211Arg) were predicted to interfere with protein structure and stabilization. Rare variants likely interfering with the internalization process, namely p.(Leu351Val) and p.(Pro389His), predicted to interfere with SARS-CoV-2 spike protein binding, were also observed. Comparison of ACE2 WES data between a cohort of 131 patients and 258 controls allowed identifying a statistically significant (P value <0,029) higher allelic variability in controls compared to patients. These findings suggest that a predisposing genetic background may contribute to the observed inter-individual clinical variability associated with COVID-19, allowing an evidence-based risk assessment leading to personalized preventive measures and therapeutic options.
ABSTRACT
The determinants of severe COVID-19 in non-elderly adults are poorly understood, which limits opportunities for early intervention and treatment. Here we present novel machine learning frameworks for identifying common and rare disease-associated genetic variation, which outperform conventional approaches. By integrating single-cell multiomics profiling of human lungs to link genetic signals to cell-type-specific functions, we have discovered and validated over 1,000 risk genes underlying severe COVID-19 across 19 cell types. Identified risk genes are overexpressed in healthy lungs but relatively downregulated in severely diseased lungs. Genetic risk for severe COVID-19, within both common and rare variants, is particularly enriched in natural killer (NK) cells, which places these immune cells upstream in the pathogenesis of severe disease. Mendelian randomization indicates that failed NKG2D-mediated activation of NK cells leads to critical illness. Network analysis further links multiple pathways associated with NK cell activation, including type-I-interferon-mediated signalling, to severe COVID-19. Our rare variant model, PULSE, enables sensitive prediction of severe disease in non-elderly patients based on whole-exome sequencing; individualized predictions are accurate independent of age and sex, and are consistent across multiple populations and cohorts. Risk stratification based on exome sequencing has the potential to facilitate post-exposure prophylaxis in at-risk individuals, potentially based around augmentation of NK cell function. Overall, our study characterizes a comprehensive genetic landscape of COVID-19 severity and provides novel insights into the molecular mechanisms of severe disease, leading to new therapeutic targets and sensitive detection of at-risk individuals.
ABSTRACT
Thromboembolism is a frequent cause of severity and mortality in COVID-19. However, the etiology of this phenomenon is not well understood. A cohort of 1,186 subjects, from the GEN-COVID consortium, infected by SARS-CoV-2 with different severity were stratified by sex and adjusted by age. Then, common coding variants from whole exome sequencing were mined by LASSO logistic regression. The homozygosity of the cell adhesion molecule P-selectin gene (SELP) rs6127 (c.1807G>A; p.Asp603Asn) which increases platelet activation is found to be associated with severity in the male subcohort of 513 subjects (Odds Ratio= 2.27, 95% Confidence Interval 1.54-3.36). As the SELP gene is downregulated by testosterone, the odd ratio is increased in males older than 50 (OR 2.42, 95% CI 1.53-3.82). Asn/Asn homozygotes have increased D-dimers values especially when associated with poly Q[≥]23 in the androgen receptor (AR) gene (OR 3.26, 95% CI 1.41-7.52). These results provide a rationale for the repurposing of antibodies against P-selectin as adjuvant therapy in rs6127 male homozygotes especially if older than 50 or with impaired AR gene. Key points{circ} The functional polymorphism rs6127 (p.Asp603Asn) in the testosterone-regulated SELP gene associates with COVID-19 severity and thrombosis. {circ}Conditions with decreased testosterone (old males), or decreased testosterone efficacy (AR gene polyQ [≥] 23) strengthen the association.
ABSTRACT
Clinical and molecular characterization by Whole Exome Sequencing (WES) is reported in 35 COVID-19 patients attending the University Hospital in Siena, Italy, from April 7 to May 7, 2020. Eighty percent of patients required respiratory assistance, half of them being on mechanical ventilation. Fiftyone percent had hepatic involvement and hyposmia was ascertained in 3 patients. Searching for common genes by collapsing methods against 150 WES of controls of the Italian population failed to give straightforward statistically significant results with the exception of two genes. This result is not unexpected since we are facing the most challenging common disorder triggered by environmental factors with a strong underlying heritability (50%). The lesson learned from Autism-Spectrum-Disorders prompted us to re-analyse the cohort treating each patient as an independent case, following a Mendelian-like model. We identified for each patient an average of 2.5 pathogenic mutations involved in virus infection susceptibility and pinpointing to one or more rare disorder(s). To our knowledge, this is the first report on WES and COVID-19. Our results suggest a combined model for COVID-19 susceptibility with a number of common susceptibility genes which represent the favorite background in which additional host private mutations may determine disease progression.
ABSTRACT
The polymorphism L412F in TLR3 has been associated with several infectious diseases. However, the mechanism underlying this association is still unexplored. Here, we show that the L412F polymorphism in TLR3 is a marker of severity in COVID-19. This association increases in the sub-cohort of males. Impaired autophagy and reduced TNF production was demonstrated in HEK293 cells transfected with TLR3-L412F plasmid and stimulated with specific agonist poly(I:C). A statistically significant reduced survival at 28 days was shown in L412F COVID-19 patients treated with the autophagy-inhibitor hydroxychloroquine (P=0.038). An increased frequency of autoimmune disorders as co-morbidity was found in L412F COVID-19 males with specific class II HLA haplotypes prone to autoantigen presentation. Our analyses indicate that L412F polymorphism makes males at risk of severe COVID-19 and provides a rationale for reinterpreting clinical trials considering autophagy pathways.
ABSTRACT
Host genetics is an emerging theme in COVID-19 and few common polymorphisms and some rare variants have been identified, either by GWAS or candidate gene approach, respectively. However, an organic model is still missing. Here, we propose a new model that takes into account common and rare germline variants applied in a cohort of 1,300 Italian SARS-CoV-2 positive individuals. Ordered logistic regression of clinical WHO grading on sex and age was used to obtain a binary phenotypic classification. Genetic variability from WES was synthesized in several boolean representations differentiated according to allele frequencies and genotype effect. LASSO logistic regression was used for extracting relevant genes. We defined about 100 common driver polymorphisms corresponding to classical "threshold model". Extracted genes were demonstrated to be gender specific. Stochastic rare more penetrant events on about additional 100 extracted genes, when occurred in a medium or severe background (common within the family), simulate Mendelian inheritance in 14% of subjects (having only 1 mutation) or oligogenic inheritance (in 10% having 2 mutations, in 11% having 3 mutations, etc). The combined effect of common and rare results can be described as an integrated polygenic score computed as: (nseverity - nmildness) + F (mseverity - mmildness) where n is the number of common driver genes, m is the number of driver rare variants and F is a factor for appropriately weighing the more powerful rare variants. We called the model "post-Mendelian". The model well describes the cohort, and patients are clustered in severe or mild by the integrated polygenic scores, the F factor being calibrated around 2, with a prediction capacity of 65% in males and 70% in females. In conclusion, this is the first comprehensive model interpreting host genetics in a holistic post-Mendelian manner. Further validations are needed in order to consolidate and refine the model which however holds true in thousands of SARS-CoV-2 Italian subjects.
ABSTRACT
Many host pathogen interactions such as human viruses (including non-SARS-coronaviruses) rely on attachment to host cell-surface glycans. There are conflicting reports about whether the Spike protein of SARS-CoV-2 binds to sialic acid commonly found on host cell-surface N-linked glycans. In the absence of a biochemical assay, the ability to analyze the binding of glycans to heavily- modified proteins and resolve this issue is limited. Classical Saturation Transfer Difference (STD) NMR can be confounded by overlapping sugar resonances that compound with known experimental constraints. Here we present universal saturation transfer analysis (uSTA), an NMR method that builds on existing approaches to provide a general and automated workflow for studying protein-ligand interactions. uSTA reveals that B-origin-lineage-SARS-CoV-2 spike trimer binds sialoside sugars in an end on manner and modelling guided by uSTA localises binding to the spike N-terminal domain (NTD). The sialylated-polylactosamine motif is found on tetraantennary human N-linked-glycoproteins in deeper lung and may have played a role in zoonosis. Provocatively, sialic acid binding is abolished by mutations in some subsequent SARS- CoV-2 variants-of-concern. A very high resolution cryo-EM structure confirms the NTD location and end on mode; it rationalises the effect of NTD mutations and the structure-activity relationship of sialic acid analogues. uSTA is demonstrated to be a robust, rapid and quantitative tool for analysis of binding, even in the most demanding systems. Extended AbstractThe surface proteins found on both pathogens and host cells mediate entry (and exit) and influence disease progression and transmission. Both types can bear host-generated post- translational modifications such as glycosylation that are essential for function but can confound biophysical methods used for dissecting key interactions. Several human viruses (including non- SARS-coronaviruses) attach to host cell-surface N-linked glycans that include forms of sialic acid (sialosides). There remains, however, conflicting evidence as to if or how SARS-associated coronaviruses might use such a mechanism. Here, we demonstrate quantitative extension of saturation transfer protein NMR methods to a complete mathematical model of the magnetization transfer caused by interactions between protein and ligand. The method couples objective resonance-identification via a deconvolution algorithm with Bloch-McConnell analysis to enable a structural, kinetic and thermodynamic analysis of ligand binding beyond previously-perceived limits of exchange rates, concentration or system. Using an automated and openly available workflow this universal saturation transfer analysis (uSTA) can be readily-applied in a range of even heavily-modified systems in a general manner to now obtain quantitative binding interaction parameters (KD, kEx). uSTA proved critical in mapping direct interactions between natural sialoside sugar ligands and relevant virus-surface attachment glycoproteins - SARS-CoV-2-spike and influenza-H1N1-haemagglutinin variants - by quantitating ligand signal in spectral regions otherwise occluded by resonances from mobile protein glycans (that also include sialosides). In B- origin-lineage-SARS-CoV-2 spike trimer end on-binding to sialoside sugars was revealed contrasting with extended surface-binding for heparin sugar ligands; uSTA-derived constraints used in structural modelling suggested sialoside-glycan binding sites in a beta-sheet-rich region of spike N-terminal domain (NTD). Consistent with this, uSTA-glycan binding was minimally- perturbed by antibodies that neutralize the ACE2-binding domain (RBD) but strongly disrupted in spike from the B1.1.7/alpha and B1.351/beta variants-of-concern, which possess hotspot mutations in the NTD. Sialoside binding in B-origin-lineage-NTD was unequivocally pinpointed by cryo-EM to a site that is created from residues that are notably deleted in variants (e.g. H69,V70,Y145 in alpha). An analysis of beneficial genetic variances in cohorts of patients from early 2020 suggests a model in which this site in the NTD of B-origin-lineage-SARS-CoV-2 (but not in alpha/beta-variants) may have exploited a specific sialylated-polylactosamine motif found on tetraantennary human N-linked-glycoproteins in deeper lung. Together these confirm a novel binding mode mediated by the unusual NTD of SARS-CoV-2 and suggest how it may drive virulence and/or zoonosis via modulation of glycan attachment. Since cell-surface glycans are widely relevant to biology and pathology, uSTA can now provide ready, quantitative, widespread analysis of complex, host-derived and post-translationally modified proteins with putative ligands relevant to disease even in previously confounding complex systems.
ABSTRACT
BackgroundCOVID-19 presentation ranges from asymptomatic to fatal. The variability in severity may be due in part to impaired Interferon type I response due to specific mutations in the host genome or to autoantibodies, explaining about 15% of the cases when combined. Exploring the host genome is thus warranted to further elucidate disease variability. MethodsWe developed a synthetic approach to genetic data representation using machine learning methods to investigate complementary genetic variability in COVID-19 infected patients that may explain disease severity, due to poly-amino acids repeat polymorphisms. Using host whole-exome sequencing data, we compared extreme phenotypic presentations (338 severe versus 300 asymptomatic cases) of the entire (men and women) Italian GEN-COVID cohort of 1178 subjects infected with SARS-CoV-2. We then applied the LASSO Logistic Regression model on Boolean gene-based representation of the poly-amino acids variability. FindingsShorter polyQ alleles ([≤]22) in the androgen receptor (AR) conferred protection against a more severe outcome in COVID-19 infection. In the subgroup of males with age <60 years, testosterone was higher in subjects with AR long-polyQ ([≥]23), possibly indicating receptor resistance (p=0.004 Mann-Whitney U test). Inappropriately low testosterone levels for the long-polyQ alleles predicted the need for intensive care in COVID-19 infected men. In agreement with the known anti-inflammatory action of testosterone, patients with long-polyQ ([≥]23) and age>60 years had increased levels of C Reactive Protein (p=0.018). InterpretationOur results may contribute to design reliable clinical and public health measures and provide a rationale to test testosterone treatment as adjuvant therapy in symptomatic COVID-19 men expressing AR polyQ longer than 23 repeats. FundingMIUR project "Dipartimenti di Eccellenza 2018-2020" to Department of Medical Biotechnologies University of Siena, Italy (Italian D.L. n.18 March 17, 2020). Private donors for COVID research and charity funds from Intesa San Paolo. BoxesO_ST_ABSEvidence before this studyC_ST_ABSWe searched on Medline, EMBASE, and Pubmed for articles published from January 2020 to August 2020 using various combinations of the search terms "sex-difference", "gender" AND SARS-Cov-2, or COVID. Epidemiological studies indicate that men and women are similarly infected by COVID-19, but the outcome is less favorable in men, independently of age. Several studies also showed that patients with hypogonadism tend to be more severely affected. A prompt intervention directed toward the most fragile subjects with SARS-Cov2 infection is currently the only strategy to reduce mortality. glucocorticoid treatment has been found cost-effective in improving the outcome of severe cases. Clinical algorithms have been proposed, but little is known on the ability of genetic profiling to predict outcome and disclose novel therapeutic strategies. Added-value of this studyIn a cohort of 1178 men and women with COVID-19, we used a supervised machine learning approach on a synthetic representation of the uncovered variability of the human genome due to poly-amino acid repeats. Comparing the genotype of patients with extreme manifestations (severe vs. asymptomatic), we found that the poly-glutamine repeat of the androgen receptor (AR) gene is relevant for COVID-19 disease and defective AR signaling identifies an association between male sex, testosterone exposure, and COVID-19 outcome. Failure of the endocrine feedback to overcome AR signaling defect by increasing testosterone levels during the infection leads to the fact that polyQ becomes dominant to T levels for the clinical outcome. Implications of all the available evidenceWe identify the first genetic polymorphism predisposing some men to develop a more severe disease irrespectively of age. Based on this, we suggest that sizing the AR poly-glutamine repeat has important implications in the diagnostic pipeline of patients affected by life-threatening COVID-19 infection. Most importantly, our studies open to the potential of using testosterone as adjuvant therapy for severe COVID-19 patients having defective androgen signaling, defined by this study as [≥]23 PolyQ repeats and inappropriate levels of circulating androgens.
ABSTRACT
The combined impact of common and rare exonic variants in COVID-19 host genetics is currently insufficiently understood. Here, common and rare variants from whole exome sequencing data of about 4,000 SARS-CoV-2-positive individuals were used to define an interpretable machine learning model for predicting COVID-19 severity. Firstly, variants were converted into separate sets of Boolean features, depending on the absence or the presence of variants in each gene. An ensemble of LASSO logistic regression models was used to identify the most informative Boolean features with respect to the genetic bases of severity. The Boolean features selected by these logistic models were combined into an Integrated PolyGenic Score that offers a synthetic and interpretable index for describing the contribution of host genetics in COVID-19 severity, as demonstrated through testing in several independent cohorts. Selected features belong to ultra-rare, rare, low-frequency, and common variants, including those in linkage disequilibrium with known GWAS loci. Noteworthly, around one quarter of the selected genes are sex-specific. Pathway analysis of the selected genes associated with COVID-19 severity reflected the multi-organ nature of the disease. The proposed model might provide useful information for developing diagnostics and therapeutics, while also being able to guide bedside disease management.
ABSTRACT
BackgroundThere is considerable variability in COVID-19 outcomes amongst younger adults--and some of this variation may be due to genetic predisposition. We characterized the clinical implications of the major genetic risk factor for COVID-19 severity, and its age-dependent effect, using individual-level data in a large international multi-centre consortium. MethodThe major common COVID-19 genetic risk factor is a chromosome 3 locus, tagged by the marker rs10490770. We combined individual level data for 13,424 COVID-19 positive patients (N=6,689 hospitalized) from 17 cohorts in nine countries to assess the association of this genetic marker with mortality, COVID-19-related complications and laboratory values. We next examined if the magnitude of these associations varied by age and were independent from known clinical COVID-19 risk factors. FindingsWe found that rs10490770 risk allele carriers experienced an increased risk of all-cause mortality (hazard ratio [HR] 1{middle dot}4, 95% confidence interval [CI] 1{middle dot}2-1{middle dot}6) and COVID-19 related mortality (HR 1{middle dot}5, 95%CI 1{middle dot}3-1{middle dot}8). Risk allele carriers had increased odds of several COVID-19 complications: severe respiratory failure (odds ratio [OR] 2{middle dot}0, 95%CI 1{middle dot}6-2{middle dot}6), venous thromboembolism (OR 1{middle dot}7, 95%CI 1{middle dot}2-2{middle dot}4), and hepatic injury (OR 1{middle dot}6, 95%CI 1{middle dot}2-2{middle dot}0). Risk allele carriers [≤] 60 years had higher odds of death or severe respiratory failure (OR 2{middle dot}6, 95%CI 1{middle dot}8-3{middle dot}9) compared to those > 60 years OR 1{middle dot}5 (95%CI 1{middle dot}3-1{middle dot}9, interaction p-value=0{middle dot}04). Amongst individuals [≤] 60 years who died or experienced severe respiratory COVID-19 outcome, we found that 31{middle dot}8% (95%CI 27{middle dot}6-36{middle dot}2) were risk variant carriers, compared to 13{middle dot}9% (95%CI 12{middle dot}6-15{middle dot}2%) of those not experiencing these outcomes. Prediction of death or severe respiratory failure among those [≤] 60 years improved when including the risk allele (AUC 0{middle dot}82 vs 0{middle dot}84, p=0{middle dot}016) and the prediction ability of rs10490770 risk allele was similar to, or better than, most established clinical risk factors. InterpretationThe major common COVID-19 risk locus on chromosome 3 is associated with increased risks of morbidity and mortality--and these are more pronounced amongst individuals [≤] 60 years. The effect on COVID-19 severity was similar to, or larger than most established risk factors, suggesting potential implications for clinical risk management. FundingFunding was obtained by each of the participating cohorts individually.
ABSTRACT
Host genetics is a key determinant of COVID-19 outcomes. Previously, the COVID-19 Host Genetics Initiative genome-wide association study used common variants to identify multiple loci associated with COVID-19 outcomes. However, variants with the largest impact on COVID-19 outcomes are expected to be rare in the population. Hence, studying rare variants may provide additional insights into disease susceptibility and pathogenesis, thereby informing therapeutics development. Here, we combined whole-exome and whole-genome sequencing from 21 cohorts across 12 countries and performed rare variant exome-wide burden analyses for COVID-19 outcomes. In an analysis of 5,085 severe disease cases and 571,737 controls, we observed that carrying a rare deleterious variant in the SARS-CoV-2 sensor toll-like receptor TLR7 (on chromosome X) was associated with a 5.3-fold increase in severe disease (95% CI: 2.75-10.05, p=5.41x10-7). This association was consistent across sexes. These results further support TLR7 as a genetic determinant of severe disease and suggest that larger studies on rare variants influencing COVID-19 outcomes could provide additional insights. Author SummaryCOVID-19 clinical outcomes vary immensely, but a patients genetic make-up is an important determinant of how they will fare against the virus. While many genetic variants commonly found in the populations were previously found to be contributing to more severe disease by the COVID-19 Host Genetics Initiative, it isnt clear if more rare variants found in less individuals could also play a role. This is important because genetic variants with the largest impact on COVID-19 severity are expected to be rarely found in the population, and these rare variants require different technologies to be studies (usually whole-exome or whole-genome sequencing). Here, we combined sequencing results from 21 cohorts across 12 countries to perform a rare variant association study. In an analysis comprising 5,085 participants with severe COVID-19 and 571,737 controls, we found that the gene for toll-like receptor 7 (TLR7) on chromosome X was an important determinant of severe COVID-19. Importantly, despite being found on a sex chromosome, this observation was consistent across both sexes.