Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 20 de 39
1.
Sci Rep ; 14(1): 1793, 2024 01 20.
Article En | MEDLINE | ID: mdl-38245528

We present an ensemble transfer learning method to predict suicide from Veterans Affairs (VA) electronic medical records (EMR). A diverse set of base models was trained to predict a binary outcome constructed from reported suicide, suicide attempt, and overdose diagnoses with varying choices of study design and prediction methodology. Each model used twenty cross-sectional and 190 longitudinal variables observed in eight time intervals covering 7.5 years prior to the time of prediction. Ensembles of seven base models were created and fine-tuned with ten variables expected to change with study design and outcome definition in order to predict suicide and combined outcome in a prospective cohort. The ensemble models achieved c-statistics of 0.73 on 2-year suicide risk and 0.83 on the combined outcome when predicting on a prospective cohort of [Formula: see text] 4.2 M veterans. The ensembles rely on nonlinear base models trained using a matched retrospective nested case-control (Rcc) study cohort and show good calibration across a diversity of subgroups, including risk strata, age, sex, race, and level of healthcare utilization. In addition, a linear Rcc base model provided a rich set of biological predictors, including indicators of suicide, substance use disorder, mental health diagnoses and treatments, hypoxia and vascular damage, and demographics.


Carcinoma, Renal Cell , Kidney Neoplasms , Veterans , Humans , Veterans/psychology , Retrospective Studies , Cross-Sectional Studies , Prospective Studies , Suicide, Attempted , Machine Learning
2.
Nat Genet ; 55(12): 2065-2074, 2023 Dec.
Article En | MEDLINE | ID: mdl-37945903

The transferability and clinical value of genetic risk scores (GRSs) across populations remain limited due to an imbalance in genetic studies across ancestrally diverse populations. Here we conducted a multi-ancestry genome-wide association study of 156,319 prostate cancer cases and 788,443 controls of European, African, Asian and Hispanic men, reflecting a 57% increase in the number of non-European cases over previous prostate cancer genome-wide association studies. We identified 187 novel risk variants for prostate cancer, increasing the total number of risk variants to 451. An externally replicated multi-ancestry GRS was associated with risk that ranged from 1.8 (per standard deviation) in African ancestry men to 2.2 in European ancestry men. The GRS was associated with a greater risk of aggressive versus non-aggressive disease in men of African ancestry (P = 0.03). Our study presents novel prostate cancer susceptibility loci and a GRS with effective risk stratification across ancestry groups.


Genetic Predisposition to Disease , Prostatic Neoplasms , Humans , Male , Black People/genetics , Genome-Wide Association Study , Hispanic or Latino/genetics , Polymorphism, Single Nucleotide , Prostatic Neoplasms/genetics , Risk Factors , White People/genetics , Asian People/genetics
3.
BMC Genomics ; 24(1): 542, 2023 Sep 13.
Article En | MEDLINE | ID: mdl-37704951

BACKGROUND: Plasmodium falciparum malaria is a leading cause of pediatric morbidity and mortality in holoendemic transmission areas. Severe malarial anemia [SMA, hemoglobin (Hb) < 5.0 g/dL in children] is the most common clinical manifestation of severe malaria in such regions. Although innate immune response genes are known to influence the development of SMA, the role of natural killer (NK) cells in malaria pathogenesis remains largely undefined. As such, we examined the impact of genetic variation in the gene encoding a primary NK cell receptor, natural cytotoxicity-triggering receptor 3 (NCR3), on the occurrence of malaria and SMA episodes over time. METHODS: Susceptibility to malaria, SMA, and all-cause mortality was determined in carriers of NCR3 genetic variants (i.e., rs2736191:C > G and rs11575837:C > T) and their haplotypes. The prospective observational study was conducted over a 36 mos. follow-up period in a cohort of children (n = 1,515, aged 1.9-40 mos.) residing in a holoendemic P. falciparum transmission region, Siaya, Kenya. RESULTS: Poisson regression modeling, controlling for anemia-promoting covariates, revealed a significantly increased risk of malaria in carriers of the homozygous mutant allele genotype (TT) for rs11575837 after multiple test correction [Incidence rate ratio (IRR) = 1.540, 95% CI = 1.114-2.129, P = 0.009]. Increased risk of SMA was observed for rs2736191 in children who inherited the CG genotype (IRR = 1.269, 95% CI = 1.009-1.597, P = 0.041) and in the additive model (presence of 1 or 2 copies) (IRR = 1.198, 95% CI = 1.030-1.393, P = 0.019), but was not significant after multiple test correction. Modeling of the haplotypes revealed that the CC haplotype had a significant additive effect for protection against SMA (i.e., reduced risk for development of SMA) after multiple test correction (IRR = 0.823, 95% CI = 0.711-0.952, P = 0.009). Although increased susceptibility to SMA was present in carriers of the GC haplotype (IRR = 1.276, 95% CI = 1.030-1.581, P = 0.026) with an additive effect (IRR = 1.182, 95% CI = 1.018-1.372, P = 0.029), the results did not remain significant after multiple test correction. None of the NCR3 genotypes or haplotypes were associated with all-cause mortality. CONCLUSIONS: Variation in NCR3 alters susceptibility to malaria and SMA during the acquisition of naturally-acquired malarial immunity. These results highlight the importance of NK cells in the innate immune response to malaria.


Anemia , Malaria, Falciparum , Malaria , Humans , Child , Anemia/genetics , Genotype , Malaria, Falciparum/genetics , Alleles , Natural Cytotoxicity Triggering Receptor 3
4.
JAMA Cardiol ; 8(6): 564-574, 2023 06 01.
Article En | MEDLINE | ID: mdl-37133828

Importance: Primary prevention of atherosclerotic cardiovascular disease (ASCVD) relies on risk stratification. Genome-wide polygenic risk scores (PRSs) are proposed to improve ASCVD risk estimation. Objective: To determine whether genome-wide PRSs for coronary artery disease (CAD) and acute ischemic stroke improve ASCVD risk estimation with traditional clinical risk factors in an ancestrally diverse midlife population. Design, Setting, and Participants: This was a prognostic analysis of incident events in a retrospectively defined longitudinal cohort conducted from January 1, 2011, to December 31, 2018. Included in the study were adults free of ASCVD and statin naive at baseline from the Million Veteran Program (MVP), a mega biobank with genetic, survey, and electronic health record data from a large US health care system. Data were analyzed from March 15, 2021, to January 5, 2023. Exposures: PRSs for CAD and ischemic stroke derived from cohorts of largely European descent and risk factors, including age, sex, systolic blood pressure, total cholesterol, high-density lipoprotein (HDL) cholesterol, smoking, and diabetes status. Main Outcomes and Measures: Incident nonfatal myocardial infarction (MI), ischemic stroke, ASCVD death, and composite ASCVD events. Results: A total of 79 151 participants (mean [SD] age, 57.8 [13.7] years; 68 503 male [86.5%]) were included in the study. The cohort included participants from the following harmonized genetic ancestry and race and ethnicity categories: 18 505 non-Hispanic Black (23.4%), 6785 Hispanic (8.6%), and 53 861 non-Hispanic White (68.0%) with a median (5th-95th percentile) follow-up of 4.3 (0.7-6.9) years. From 2011 to 2018, 3186 MIs (4.0%), 1933 ischemic strokes (2.4%), 867 ASCVD deaths (1.1%), and 5485 composite ASCVD events (6.9%) were observed. CAD PRS was associated with incident MI in non-Hispanic Black (hazard ratio [HR], 1.10; 95% CI, 1.02-1.19), Hispanic (HR, 1.26; 95% CI, 1.09-1.46), and non-Hispanic White (HR, 1.23; 95% CI, 1.18-1.29) participants. Stroke PRS was associated with incident stroke in non-Hispanic White participants (HR, 1.15; 95% CI, 1.08-1.21). A combined CAD plus stroke PRS was associated with ASCVD deaths among non-Hispanic Black (HR, 1.19; 95% CI, 1.03-1.17) and non-Hispanic (HR, 1.11; 95% CI, 1.03-1.21) participants. The combined PRS was also associated with composite ASCVD across all ancestry groups but greater among non-Hispanic White (HR, 1.20; 95% CI, 1.16-1.24) than non-Hispanic Black (HR, 1.11; 95% CI, 1.05-1.17) and Hispanic (HR, 1.12; 95% CI, 1.00-1.25) participants. Net reclassification improvement from adding PRS to a traditional risk model was modest for the intermediate risk group for composite CVD among men (5-year risk >3.75%, 0.38%; 95% CI, 0.07%-0.68%), among women, (6.79%; 95% CI, 3.01%-10.58%), for age older than 55 years (0.25%; 95% CI, 0.03%-0.47%), and for ages 40 to 55 years (1.61%; 95% CI, -0.07% to 3.30%). Conclusions and Relevance: Study results suggest that PRSs derived predominantly in European samples were statistically significantly associated with ASCVD in the multiancestry midlife and older-age MVP cohort. Overall, modest improvement in discrimination metrics were observed with addition of PRSs to traditional risk factors with greater magnitude in women and younger age groups.


Atherosclerosis , Cardiovascular Diseases , Coronary Artery Disease , Ischemic Stroke , Myocardial Infarction , Stroke , Veterans , Adult , Humans , Male , Female , Middle Aged , Cardiovascular Diseases/epidemiology , Cardiovascular Diseases/genetics , Retrospective Studies , Risk Assessment/methods , Risk Factors , Coronary Artery Disease/epidemiology , Coronary Artery Disease/genetics , Atherosclerosis/epidemiology , Myocardial Infarction/epidemiology , Stroke/epidemiology , Cholesterol
5.
Eur Urol ; 84(1): 13-21, 2023 07.
Article En | MEDLINE | ID: mdl-36872133

BACKGROUND: Genetic factors play an important role in prostate cancer (PCa) susceptibility. OBJECTIVE: To discover common genetic variants contributing to the risk of PCa in men of African ancestry. DESIGN, SETTING, AND PARTICIPANTS: We conducted a meta-analysis of ten genome-wide association studies consisting of 19378 cases and 61620 controls of African ancestry. OUTCOME MEASUREMENTS AND STATISTICAL ANALYSIS: Common genotyped and imputed variants were tested for their association with PCa risk. Novel susceptibility loci were identified and incorporated into a multiancestry polygenic risk score (PRS). The PRS was evaluated for associations with PCa risk and disease aggressiveness. RESULTS AND LIMITATIONS: Nine novel susceptibility loci for PCa were identified, of which seven were only found or substantially more common in men of African ancestry, including an African-specific stop-gain variant in the prostate-specific gene anoctamin 7 (ANO7). A multiancestry PRS of 278 risk variants conferred strong associations with PCa risk in African ancestry studies (odds ratios [ORs] >3 and >5 for men in the top PRS decile and percentile, respectively). More importantly, compared with men in the 40-60% PRS category, men in the top PRS decile had a significantly higher risk of aggressive PCa (OR = 1.23, 95% confidence interval = 1.10-1.38, p = 4.4 × 10-4). CONCLUSIONS: This study demonstrates the importance of large-scale genetic studies in men of African ancestry for a better understanding of PCa susceptibility in this high-risk population and suggests a potential clinical utility of PRS in differentiating between the risks of developing aggressive and nonaggressive disease in men of African ancestry. PATIENT SUMMARY: In this large genetic study in men of African ancestry, we discovered nine novel prostate cancer (PCa) risk variants. We also showed that a multiancestry polygenic risk score was effective in stratifying PCa risk, and was able to differentiate risk of aggressive and nonaggressive disease.


Genetic Predisposition to Disease , Prostatic Neoplasms , Male , Humans , Genome-Wide Association Study , Prostatic Neoplasms/genetics , Prostatic Neoplasms/epidemiology , Risk Factors , Black People/genetics
6.
PLoS Genet ; 19(3): e1010623, 2023 03.
Article En | MEDLINE | ID: mdl-36940203

Suicidal ideation (SI) often precedes and predicts suicide attempt and death, is the most common suicidal phenotype and is over-represented in veterans. The genetic architecture of SI in the absence of suicide attempt (SA) is unknown, yet believed to have distinct and overlapping risk with other suicidal behaviors. We performed the first GWAS of SI without SA in the Million Veteran Program (MVP), identifying 99,814 SI cases from electronic health records without a history of SA or suicide death (SD) and 512,567 controls without SI, SA or SD. GWAS was performed separately in the four largest ancestry groups, controlling for sex, age and genetic substructure. Ancestry-specific results were combined via meta-analysis to identify pan-ancestry loci. Four genome-wide significant (GWS) loci were identified in the pan-ancestry meta-analysis with loci on chromosomes 6 and 9 associated with suicide attempt in an independent sample. Pan-ancestry gene-based analysis identified GWS associations with DRD2, DCC, FBXL19, BCL7C, CTF1, ANNK1, and EXD3. Gene-set analysis implicated synaptic and startle response pathways (q's<0.05). European ancestry (EA) analysis identified GWS loci on chromosomes 6 and 9, as well as GWS gene associations in EXD3, DRD2, and DCC. No other ancestry-specific GWS results were identified, underscoring the need to increase representation of diverse individuals. The genetic correlation of SI and SA within MVP was high (rG = 0.87; p = 1.09e-50), as well as with post-traumatic stress disorder (PTSD; rG = 0.78; p = 1.98e-95) and major depressive disorder (MDD; rG = 0.78; p = 8.33e-83). Conditional analysis on PTSD and MDD attenuated most pan-ancestry and EA GWS signals for SI without SA to nominal significance, with the exception of EXD3 which remained GWS. Our novel findings support a polygenic and complex architecture for SI without SA which is largely shared with SA and overlaps with psychiatric conditions frequently comorbid with suicidal behaviors.


Depressive Disorder, Major , Veterans , Humans , Suicidal Ideation , Veterans/psychology , Genome-Wide Association Study , Depressive Disorder, Major/genetics , Suicide, Attempted/psychology , Risk Factors
7.
JAMA Psychiatry ; 80(2): 135-145, 2023 02 01.
Article En | MEDLINE | ID: mdl-36515925

Importance: Suicide is a leading cause of death; however, the molecular genetic basis of suicidal thoughts and behaviors (SITB) remains unknown. Objective: To identify novel, replicable genomic risk loci for SITB. Design, Setting, and Participants: This genome-wide association study included 633 778 US military veterans with and without SITB, as identified through electronic health records. GWAS was performed separately by ancestry, controlling for sex, age, and genetic substructure. Cross-ancestry risk loci were identified through meta-analysis. Study enrollment began in 2011 and is ongoing. Data were analyzed from November 2021 to August 2022. Main Outcome and Measures: SITB. Results: A total of 633 778 US military veterans were included in the analysis (57 152 [9%] female; 121 118 [19.1%] African ancestry, 8285 [1.3%] Asian ancestry, 452 767 [71.4%] European ancestry, and 51 608 [8.1%] Hispanic ancestry), including 121 211 individuals with SITB (19.1%). Meta-analysis identified more than 200 GWS (P < 5 × 10-8) cross-ancestry risk single-nucleotide variants for SITB concentrated in 7 regions on chromosomes 2, 6, 9, 11, 14, 16, and 18. Top single-nucleotide variants were largely intronic in nature; 5 were independently replicated in ISGC, including rs6557168 in ESR1, rs12808482 in DRD2, rs77641763 in EXD3, rs10671545 in DCC, and rs36006172 in TRAF3. Associations for FBXL19 and AC018880.2 were not replicated. Gene-based analyses implicated 24 additional GWS cross-ancestry risk genes, including FURIN, TSNARE1, and the NCAM1-TTC12-ANKK1-DRD2 gene cluster. Cross-ancestry enrichment analyses revealed significant enrichment for expression in brain and pituitary tissue, synapse and ubiquitination processes, amphetamine addiction, parathyroid hormone synthesis, axon guidance, and dopaminergic pathways. Seven other unique European ancestry-specific GWS loci were identified, 2 of which (POM121L2 and METTL15/LINC02758) were replicated. Two additional GWS ancestry-specific loci were identified within the African ancestry (PET112/GATB) and Hispanic ancestry (intergenic locus on chromosome 4) subsets, both of which were replicated. No GWS loci were identified within the Asian ancestry subset; however, significant enrichment was observed for axon guidance, cyclic adenosine monophosphate signaling, focal adhesion, glutamatergic synapse, and oxytocin signaling pathways across all ancestries. Within the European ancestry subset, genetic correlations (r > 0.75) were observed between the SITB phenotype and a suicide attempt-only phenotype, depression, and posttraumatic stress disorder. Additionally, polygenic risk score analyses revealed that the Million Veteran Program polygenic risk score had nominally significant main effects in 2 independent samples of veterans of European and African ancestry. Conclusions and Relevance: The findings of this analysis may advance understanding of the molecular genetic basis of SITB and provide evidence for ESR1, DRD2, TRAF3, and DCC as cross-ancestry candidate risk genes. More work is needed to replicate these findings and to determine if and how these genes might impact clinical care.


Veterans , Humans , Female , Male , Suicidal Ideation , Genome-Wide Association Study , TNF Receptor-Associated Factor 3/genetics , Genetic Loci/genetics , Nucleotides , Polymorphism, Single Nucleotide/genetics , Genetic Predisposition to Disease/genetics , Proteins , Protein Serine-Threonine Kinases/genetics
8.
BMJ Open ; 12(12): e064135, 2022 12 23.
Article En | MEDLINE | ID: mdl-36564105

OBJECTIVES: To evaluate the benefits of vaccination on the case fatality rate (CFR) for COVID-19 infections. DESIGN, SETTING AND PARTICIPANTS: The US Department of Veterans Affairs has 130 medical centres. We created multivariate models from these data-339 772 patients with COVID-19-as of 30 September 2021. OUTCOME MEASURES: The primary outcome for all models was death within 60 days of the diagnosis. Logistic regression was used to derive adjusted ORs for vaccination and infection with Delta versus earlier variants. Models were adjusted for confounding factors, including demographics, comorbidity indices and novel parameters representing prior diagnoses, vital signs/baseline laboratory tests and outpatient treatments. Patients with a Delta infection were divided into eight cohorts based on the time from vaccination to diagnosis. A common model was used to estimate the odds of death associated with vaccination for each cohort relative to that of unvaccinated patients. RESULTS: 9.1% of subjects were vaccinated. 21.5% had the Delta variant. 18 120 patients (5.33%) died within 60 days of their diagnoses. The adjusted OR for a Delta infection was 1.87±0.05, which corresponds to a relative risk (RR) of 1.78. The overall adjusted OR for prior vaccination was 0.280±0.011 corresponding to an RR of 0.291. Raw CFR rose steadily after 10-14 weeks. The OR for vaccination remained stable for 10-34 weeks. CONCLUSIONS: Our CFR model controls for the severity of confounding factors and priority of vaccination, rather than solely using the presence of comorbidities. Our results confirm that Delta was more lethal than earlier variants and that vaccination is an effective means of preventing death. After adjusting for major selection biases, we found no evidence that the benefits of vaccination on CFR declined over 34 weeks. We suggest that this model can be used to evaluate vaccines designed for emerging variants.


COVID-19 , Hepatitis D , Veterans , Humans , COVID-19/prevention & control , SARS-CoV-2 , Vaccination
9.
Sci Rep ; 12(1): 17821, 2022 10 24.
Article En | MEDLINE | ID: mdl-36280773

In recent years, data-driven, deep-learning-based models have shown great promise in medical risk prediction. By utilizing the large-scale Electronic Health Record data found in the U.S. Department of Veterans Affairs, the largest integrated healthcare system in the United States, we have developed an automated, personalized risk prediction model to support the clinical decision-making process for localized prostate cancer patients. This method combines the representative power of deep learning and the analytical interpretability of parametric regression models and can implement both time-dependent and static input data. To collect a comprehensive evaluation of model performances, we calculate time-dependent C-statistics [Formula: see text] over 2-, 5-, and 10-year time horizons using either a composite outcome or prostate cancer mortality as the target event. The composite outcome combines the Prostate-Specific Antigen (PSA) test, metastasis, and prostate cancer mortality. Our longitudinal model Recurrent Deep Survival Machine (RDSM) achieved [Formula: see text] 0.85 (0.83), 0.80 (0.83), and 0.76 (0.81), while the cross-sectional model Deep Survival Machine (DSM) attained [Formula: see text] 0.85 (0.82), 0.80 (0.82), and 0.76 (0.79) for the 2-, 5-, and 10-year composite (mortality) outcomes, respectively. In addition to estimating the survival probability, our method can quantify the uncertainty associated with the prediction. The uncertainty scores show a consistent correlation with the prediction accuracy. We find PSA and prostate cancer stage information are the most important indicators in risk prediction. Our work demonstrates the utility of the data-driven machine learning model in prostate cancer risk prediction, which can play a critical role in the clinical decision system.


Deep Learning , Prostatic Neoplasms , Male , Humans , United States , Prostate-Specific Antigen , Cross-Sectional Studies , Prostatic Neoplasms/pathology , Survival Analysis
10.
Front Genet ; 13: 977810, 2022.
Article En | MEDLINE | ID: mdl-36186473

Background: Severe malarial anemia (SMA; Hb < 5.0 g/dl) is a leading cause of childhood morbidity and mortality in holoendemic Plasmodium falciparum transmission regions such as western Kenya. Methods: We investigated the relationship between two novel complement component 5 (C5) missense mutations [rs17216529:C>T, p(Val145Ile) and rs17610:C>T, p(Ser1310Asn)] and longitudinal outcomes of malaria in a cohort of Kenyan children (under 60 mos, n = 1,546). Molecular modeling was used to investigate the impact of the amino acid transitions on the C5 protein structure. Results: Prediction of the wild-type and mutant C5 protein structures did not reveal major changes to the overall structure. However, based on the position of the variants, subtle differences could impact on the stability of C5b. The influence of the C5 genotypes/haplotypes on the number of malaria and SMA episodes over 36 months was determined by Poisson regression modeling. Genotypic analyses revealed that inheritance of the homozygous mutant (TT) for rs17216529:C>T enhanced the risk for both malaria (incidence rate ratio, IRR = 1.144, 95%CI: 1.059-1.236, p = 0.001) and SMA (IRR = 1.627, 95%CI: 1.201-2.204, p = 0.002). In the haplotypic model, carriers of TC had increased risk of malaria (IRR = 1.068, 95%CI: 1.017-1.122, p = 0.009), while carriers of both wild-type alleles (CC) were protected against SMA (IRR = 0.679, 95%CI: 0.542-0.850, p = 0.001). Conclusion: Collectively, these findings show that the selected C5 missense mutations influence the longitudinal risk of malaria and SMA in immune-naïve children exposed to holoendemic P. falciparum transmission through a mechanism that remains to be defined.

11.
J Am Med Inform Assoc ; 29(10): 1737-1743, 2022 09 12.
Article En | MEDLINE | ID: mdl-35920306

The predictive modeling literature for biomedical applications is dominated by biostatistical methods for survival analysis, and more recently some out of the box machine learning approaches. In this article, we show a presentation of a machine learning method appropriate for time-to-event modeling in the area of prostate cancer long-term disease progression. Using XGBoost adapted to long-term disease progression, we developed a predictive model for 118 788 patients with localized prostate cancer at diagnosis from the Department of Veterans Affairs (VA). Our model accounted for patient censoring. Harrell's c-index for our model using only features available at the time of diagnosis was 0.757 95% confidence interval [0.756, 0.757]. Our results show that machine learning methods like XGBoost can be adapted to use accelerated failure time (AFT) with censoring to model long-term risk of disease progression. The long median survival justifies and requires censoring. Overall, we show that an existing machine learning approach can be used for AFT outcome modeling in prostate cancer, and more generally for other chronic diseases with long observation times.


Biomedical Research , Prostatic Neoplasms , Disease Progression , Humans , Machine Learning , Male , Prostatic Neoplasms/diagnosis , Survival Analysis
12.
Trop Med Health ; 50(1): 41, 2022 Jun 25.
Article En | MEDLINE | ID: mdl-35752805

Plasmodium falciparum infections remain among the leading causes of morbidity and mortality in holoendemic transmission areas. Located within region 5q31.1, the colony-stimulating factor 2 gene (CSF2) encodes granulocyte-macrophage colony-stimulating factor (GM-CSF), a hematopoietic growth factor that mediates host immune responses. Since the effect of CSF2 variation on malaria pathogenesis remains unreported, we investigated the impact of two genetic variants in the 5q31.1 gene region flanking CSF2:g-7032 G > A (rs168681:G > A) and CSF2:g.64544T > C (rs246835:T > C) on the rate and timing of malaria and severe malarial anemia (SMA, Hb < 5.0 g/dL) episodes over 36 months of follow-up. Children (n = 1654, aged 2-70 months) were recruited from a holoendemic P. falciparum transmission area of western Kenya. Decreased incidence rate ratio (IRR) for malaria was conferred by inheritance of the CSF2:g.64544 TC genotype (P = 0.0277) and CSF2 AC/GC diplotype (P = 0.0015). Increased IRR for malaria was observed in carriers of the CSF2 AT/GC diplotype (P = 0.0237), while the inheritance of the CSF2 AT haplotype increased the IRR for SMA (P = 0.0166). A model estimating the longitudinal risk of malaria showed decreased hazard rates among CSF2 AC haplotype carriers (P = 0.0045). Investigation of all-cause mortality revealed that inheritance of the GA genotype at CSF2:g-7032 increased the risk of mortality (P = 0.0315). Higher risk of SMA and all-cause mortality were observed in younger children (P < 0.0001 and P = 0.0015), HIV-1(+) individuals (P < 0.0001 and P < 0.0001), and carriers of HbSS (P = 0.0342 and P = 0.0019). Results from this holoendemic P. falciparum area show that variation in gene region 5q31.1 influences susceptibility to malaria, SMA, and mortality, as does age, HIV-1 status, and inheritance of HbSS.

13.
Curr Res Struct Biol ; 4: 220-230, 2022.
Article En | MEDLINE | ID: mdl-35765663

SARS-CoV-2 is the virus responsible for the COVID-19 pandemic and catastrophic, worldwide health and economic impacts. The spike protein on the viral surface is responsible for viral entry into the host cell. The binding of spike protein to the host cell receptor ACE2 is the first step leading to fusion of the host and viral membranes. Despite the vast amount of structure data that has been generated for the spike protein of SARS-CoV-2, many of the detailed structures of the spike protein in different stages of the fusion pathway are unknown, leaving a wealth of potential drug-target space unexplored. The atomic-scale structure of the complete S2 segment, as well as the complete fusion intermediate are also unknown and represent major gaps in our knowledge of the infectious pathway of SAR-CoV-2. The conformational changes of the spike protein during this process are similarly not well understood. Here we present structures of the spike protein at different stages of the fusion process. With the transitions being a necessary step before the receptor binding, we propose sites along the transition pathways as potential targets for drug development.

14.
J Biomed Inform ; 130: 104084, 2022 06.
Article En | MEDLINE | ID: mdl-35533991

Analysis of longitudinal Electronic Health Record (EHR) data is an important goal for precision medicine. Difficulty in applying Machine Learning (ML) methods, either predictive or unsupervised, stems in part from the heterogeneity and irregular sampling of EHR data. We present an unsupervised probabilistic model that captures nonlinear relationships between variables over continuous-time. This method works with arbitrary sampling patterns and captures the joint probability distribution between variable measurements and the time intervals between them. Inference algorithms are derived that can be used to evaluate the likelihood of future using under a trained model. As an example, we consider data from the United States Veterans Health Administration (VHA) in the areas of diabetes and depression. Likelihood ratio maps are produced showing the likelihood of risk for moderate-severe vs minimal depression as measured by the Patient Health Questionnaire-9 (PHQ-9).


Electronic Health Records , Machine Learning , Algorithms , Humans , Models, Statistical , Probability
15.
Mol Psychiatry ; 27(4): 2264-2272, 2022 04.
Article En | MEDLINE | ID: mdl-35347246

To identify pan-ancestry and ancestry-specific loci associated with attempting suicide among veterans, we conducted a genome-wide association study (GWAS) of suicide attempts within a large, multi-ancestry cohort of U.S. veterans enrolled in the Million Veterans Program (MVP). Cases were defined as veterans with a documented history of suicide attempts in the electronic health record (EHR; N = 14,089) and controls were defined as veterans with no documented history of suicidal thoughts or behaviors in the EHR (N = 395,064). GWAS was performed separately in each ancestry group, controlling for sex, age and genetic substructure. Pan-ancestry risk loci were identified through meta-analysis and included two genome-wide significant loci on chromosomes 20 (p = 3.64 × 10-9) and 1 (p = 3.69 × 10-8). A strong pan-ancestry signal at the Dopamine Receptor D2 locus (p = 1.77 × 10-7) was also identified and subsequently replicated in a large, independent international civilian cohort (p = 7.97 × 10-4). Additionally, ancestry-specific genome-wide significant loci were also detected in African-Americans, European-Americans, Asian-Americans, and Hispanic-Americans. Pathway analyses suggested over-representation of many biological pathways with high clinical significance, including oxytocin signaling, glutamatergic synapse, cortisol synthesis and secretion, dopaminergic synapse, and circadian rhythm. These findings confirm that the genetic architecture underlying suicide attempt risk is complex and includes both pan-ancestry and ancestry-specific risk loci. Moreover, pathway analyses suggested many commonly impacted biological pathways that could inform development of improved therapeutics for suicide prevention.


Genome-Wide Association Study , Veterans , Black or African American/genetics , Genetic Loci , Genetic Predisposition to Disease/genetics , Humans , Polymorphism, Single Nucleotide/genetics , Suicide, Attempted , White People/genetics
16.
Biochem Biophys Rep ; 29: 101207, 2022 Mar.
Article En | MEDLINE | ID: mdl-35071802

Plasmodium falciparum (Pf) malaria is among the leading causes of childhood morbidity and mortality worldwide. During a natural infection, ingestion of the malarial parasite product, hemozoin (PfHz), by circulating phagocytic cells induces dysregulation in innate immunity and enhances malaria pathogenesis. Treatment of cultured peripheral blood mononuclear cells (PBMCs) from healthy, malaria-naïve donors with physiological concentrations of PfHz can serve as an in vitro model to investigate cellular processes. Although disruptions in host ubiquitination processes are central to the pathogenesis of many diseases, this system remains unexplored in malaria. As such, we investigated the impact of PfHz on the temporal expression patterns of 84 genes involved in ubiquitination processes. Donor PBMCs were cultured in the absence or presence of PfHz for 3-, 9-, and 24 h. Stimulation with PfHz for 3 h did not significantly alter gene expression. Incubation for 9 h, however, elicited significant changes for 6 genes: 4 were down-regulated (FBXO4, NEDD8, UBE2E3, and UBE2W) and 2 were up-regulated (HERC5 and UBE2J1). PfHz treatment for 24 h significantly altered expression for 14 genes: 12 were down-regulated (ANAPC11, BRCC3, CUL4B, FBXO4, MIB1, SKP2, TP53, UBA2, UBA3, UBE2G1, UBE2G2, and WWP1), while 2 were up-regulated (UBE2J1 and UBE2Z). Collectively, these results demonstrate that phagocytosis of PfHz by PBMCs elicits temporal changes in the transcriptional profiles of genes central to host ubiquitination processes. Results presented here suggest that disruptions in ubiquitination may be a previously undiscovered feature of malaria pathogenesis.

17.
Exp Biol Med (Maywood) ; 247(8): 672-682, 2022 04.
Article En | MEDLINE | ID: mdl-34842470

Severe malarial anemia (SMA) is a leading cause of childhood morbidity and mortality in holoendemic Plasmodium falciparum transmission regions. To gain enhanced understanding of predisposing factors for SMA, we explored the relationship between complement component 3 (C3) missense mutations [rs2230199 (2307C>G, Arg>Gly102) and rs11569534 (34420G>A, Gly>Asp1224)], malaria, and SMA in a cohort of children (n = 1617 children) over 36 months of follow-up. Variants were selected based on their ability to impart amino acid substitutions that can alter the structure and function of C3. The 2307C>G mutation results in a basic to a polar residue change (Arg to Gly) at position 102 (ß-chain) in the macroglobulin-1 (MG1) domain, while 34420G>A elicits a polar to acidic residue change (Gly to Asp) at position 1224 (α-chain) in the thioester-containing domain. After adjusting for multiple comparisons, longitudinal analyses revealed that inheritance of the homozygous mutant (GG) at 2307 enhanced the risk of SMA (RR = 2.142, 95%CI: 1.229-3.735, P = 0.007). The haplotype containing both wild-type alleles (CG) decreased the incident risk ratio of both malaria (RR = 0.897, 95%CI: 0.828-0.972, P = 0.008) and SMA (RR = 0.617, 95%CI: 0.448-0.848, P = 0.003). Malaria incident risk ratio was also reduced in carriers of the GG (Gly102Gly1224) haplotype (RR = 0.941, 95%CI: 0.888-0.997, P = 0.040). Collectively, inheritance of the missense mutations in MG1 and thioester-containing domain influence the longitudinal risk of malaria and SMA in children exposed to intense Plasmodium falciparum transmission.


Anemia , Complement C3 , Malaria, Falciparum , Anemia/genetics , Anemia/parasitology , Child , Complement C3/genetics , Genetic Predisposition to Disease , Humans , Malaria, Falciparum/complications , Malaria, Falciparum/genetics , Mutation , Plasmodium falciparum
18.
Front Genet ; 12: 764759, 2021.
Article En | MEDLINE | ID: mdl-34880904

Background: Malaria remains one of the leading global causes of childhood morbidity and mortality. In holoendemic Plasmodium falciparum transmission regions, such as western Kenya, severe malarial anemia [SMA, hemoglobin (Hb) < 6.0 g/dl] is the primary form of severe disease. Ubiquitination is essential for regulating intracellular processes involved in innate and adaptive immunity. Although dysregulation in ubiquitin molecular processes is central to the pathogenesis of multiple human diseases, the expression patterns of ubiquitination genes in SMA remain unexplored. Methods: To examine the role of the ubiquitination processes in pathogenesis of SMA, differential gene expression profiles were determined in Kenyan children (n = 44, aged <48 mos) with either mild malarial anemia (MlMA; Hb ≥9.0 g/dl; n = 23) or SMA (Hb <6.0 g/dl; n = 21) using the Qiagen Human Ubiquitination Pathway RT2 Profiler PCR Array containing a set of 84 human ubiquitination genes. Results: In children with SMA, 10 genes were down-regulated (BRCC3, FBXO3, MARCH5, RFWD2, SMURF2, UBA6, UBE2A, UBE2D1, UBE2L3, UBR1), and five genes were up-regulated (MDM2, PARK2, STUB1, UBE2E3, UBE2M). Enrichment analyses revealed Ubiquitin-Proteasomal Proteolysis as the top disrupted process, along with altered sub-networks involved in proteasomal, protein, and ubiquitin-dependent catabolic processes. Conclusion: Collectively, these novel results show that protein coding genes of the ubiquitination processes are involved in the pathogenesis of SMA.

20.
Math Biosci Eng ; 18(2): 1529-1549, 2021 01 28.
Article En | MEDLINE | ID: mdl-33757197

We develop and analyze a stage-progression compartmental model to study the emerging invasive nontyphoidal Salmonella (iNTS) epidemic in sub-Saharan Africa. iNTS bloodstream infections are often fatal, and the diverse and non-specific clinical features of iNTS make it difficult to diagnose. We focus our study on identifying approaches that can reduce the incidence of new infections. In sub-Saharan Africa, transmission and mortality are correlated with the ongoing HIV epidemic and severe malnutrition. We use our model to quantify the impact that increasing antiretroviral therapy (ART) for HIV infected adults and reducing malnutrition in children would have on mortality from iNTS in the population. We consider immunocompromised subpopulations in the region with major risk factors for mortality, such as malaria and malnutrition among children and HIV infection and ART coverage in both children and adults. We parameterize the progression rates between infection stages using the branching probabilities and estimated time spent at each stage. We interpret the basic reproduction number R0 as the total contribution from an infinite infection loop produced by the asymptomatic carriers in the infection chain. The results indicate that the asymptomatic HIV+ adults without ART serve as the driving force of infection for the iNTS epidemic. We conclude that the worst disease outcome is among the pediatric population, which has the highest infection rates and death counts. Our sensitivity analysis indicates that the most effective strategies to reduce iNTS mortality in the studied population are to improve the ART coverage among high-risk HIV+ adults and reduce malnutrition among children.


Epidemics , HIV Infections , Salmonella Infections , Adult , Africa South of the Sahara/epidemiology , Child , HIV Infections/drug therapy , HIV Infections/epidemiology , Humans , Salmonella , Salmonella Infections/drug therapy , Salmonella Infections/epidemiology
...