Pesquisa | Portal Regional da BVS

1.

Genome-Wide Association Study of Treatment-Resistant Depression: Shared Biology With Metabolic Traits.

Kang, JooEun; Castro, Victor M; Ripperger, Michael; Venkatesh, Sanan; Burstein, David; Linnér, Richard Karlsson; Rocha, Daniel B; Hu, Yirui; Wilimitis, Drew; Morley, Theodore; Han, Lide; Kim, Rachel Youngjung; Feng, Yen-Chen Anne; Ge, Tian; Heckers, Stephan; Voloudakis, Georgios; Chabris, Christopher; Roussos, Panos; McCoy, Thomas H; Walsh, Colin G; Perlis, Roy H; Ruderfer, Douglas M.

Am J Psychiatry ; : appiajp20230247, 2024 May 15.

Artigo em Inglês | MEDLINE | ID: mdl-38745458

RESUMO

OBJECTIVE: Treatment-resistant depression (TRD) occurs in roughly one-third of all individuals with major depressive disorder (MDD). Although research has suggested a significant common variant genetic component of liability to TRD, with heritability estimated at 8% when compared with non-treatment-resistant MDD, no replicated genetic loci have been identified, and the genetic architecture of TRD remains unclear. A key barrier to this work has been the paucity of adequately powered cohorts for investigation, largely because of the challenge in prospectively investigating this phenotype. The objective of this study was to perform a well-powered genetic study of TRD. METHODS: Using receipt of electroconvulsive therapy (ECT) as a surrogate for TRD, the authors applied standard machine learning methods to electronic health record data to derive predicted probabilities of receiving ECT. These probabilities were then applied as a quantitative trait in a genome-wide association study of 154,433 genotyped patients across four large biobanks. RESULTS: Heritability estimates ranged from 2% to 4.2%, and significant genetic overlap was observed with cognition, attention deficit hyperactivity disorder, schizophrenia, alcohol and smoking traits, and body mass index. Two genome-wide significant loci were identified, both previously implicated in metabolic traits, suggesting shared biology and potential pharmacological implications. CONCLUSIONS: This work provides support for the utility of estimation of disease probability for genomic investigation and provides insights into the genetic architecture and biology of TRD.

2.

Validation of an ICD-Code-Based Case Definition for Psychotic Illness Across Three Health Systems.

Deo, Anthony J; Castro, Victor M; Baker, Ashley; Carroll, Devon; Gonzalez-Heydrich, Joseph; Henderson, David C; Holt, Daphne J; Hook, Kimberly; Karmacharya, Rakesh; Roffman, Joshua L; Madsen, Emily M; Song, Eugene; Adams, William G; Camacho, Luisa; Gasman, Sarah; Gibbs, Jada S; Fortgang, Rebecca G; Kennedy, Chris J; Lozinski, Galina; Perez, Daisy C; Wilson, Marina; Reis, Ben Y; Smoller, Jordan W.

Schizophr Bull ; 2024 May 10.

Artigo em Inglês | MEDLINE | ID: mdl-38728421

RESUMO

BACKGROUND AND HYPOTHESIS: Psychosis-associated diagnostic codes are increasingly being utilized as case definitions for electronic health record (EHR)-based algorithms to predict and detect psychosis. However, data on the validity of psychosis-related diagnostic codes is limited. We evaluated the positive predictive value (PPV) of International Classification of Diseases (ICD) codes for psychosis. STUDY DESIGN: Using EHRs at 3 health systems, ICD codes comprising primary psychotic disorders and mood disorders with psychosis were grouped into 5 higher-order groups. 1133 records were sampled for chart review using the full EHR. PPVs (the probability of chart-confirmed psychosis given ICD psychosis codes) were calculated across multiple treatment settings. STUDY RESULTS: PPVs across all diagnostic groups and hospital systems exceeded 70%: Mass General Brigham 0.72 [95% CI 0.68-0.77], Boston Children's Hospital 0.80 [0.75-0.84], and Boston Medical Center 0.83 [0.79-0.86]. Schizoaffective disorder PPVs were consistently the highest across sites (0.80-0.92) and major depressive disorder with psychosis were the most variable (0.57-0.79). To determine if the first documented code captured first-episode psychosis (FEP), we excluded cases with prior chart evidence of a diagnosis of or treatment for a psychotic illness, yielding substantially lower PPVs (0.08-0.62). CONCLUSIONS: We found that the first documented psychosis diagnostic code accurately captured true episodes of psychosis but was a poor index of FEP. These data have important implications for the case definitions used in the development of risk prediction models designed to predict or detect undiagnosed psychosis.

3.

Validation of an ICD-code-based case definition for psychotic illness across three health systems.

Deo, Anthony J; Castro, Victor M; Baker, Ashley; Carroll, Devon; Gonzalez-Heydrich, Joseph; Henderson, David C; Holt, Daphne J; Hook, Kimberly; Karmacharya, Rakesh; Roffman, Joshua L; Madsen, Emily M; Song, Eugene; Adams, William G; Camacho, Luisa; Gasman, Sarah; Gibbs, Jada S; Fortgang, Rebecca G; Kennedy, Chris J; Lozinski, Galina; Perez, Daisy C; Wilson, Marina; Reis, Ben Y; Smoller, Jordan W.

medRxiv ; 2024 Feb 29.

Artigo em Inglês | MEDLINE | ID: mdl-38464074

RESUMO

Background and Hypothesis: Early detection of psychosis is critical for improving outcomes. Algorithms to predict or detect psychosis using electronic health record (EHR) data depend on the validity of the case definitions used, typically based on diagnostic codes. Data on the validity of psychosis-related diagnostic codes is limited. We evaluated the positive predictive value (PPV) of International Classification of Diseases (ICD) codes for psychosis. Study Design: Using EHRs at three health systems, ICD codes comprising primary psychotic disorders and mood disorders with psychosis were grouped into five higher-order groups. 1,133 records were sampled for chart review using the full EHR. PPVs (the probability of chart-confirmed psychosis given ICD psychosis codes) were calculated across multiple treatment settings. Study Results: PPVs across all diagnostic groups and hospital systems exceeded 70%: Massachusetts General Brigham 0.72 [95% CI 0.68-0.77], Boston Children's Hospital 0.80 [0.75-0.84], and Boston Medical Center 0.83 [0.79-0.86]. Schizoaffective disorder PPVs were consistently the highest across sites (0.80-0.92) and major depressive disorder with psychosis were the most variable (0.57-0.79). To determine if the first documented code captured first-episode psychosis (FEP), we excluded cases with prior chart evidence of a diagnosis of or treatment for a psychotic illness, yielding substantially lower PPVs (0.08-0.62). Conclusions: We found that the first documented psychosis diagnostic code accurately captured true episodes of psychosis but was a poor index of FEP. These data have important implications for the development of risk prediction models designed to predict or detect undiagnosed psychosis.

4.

HistoriView: Implementation and Evaluation of a Novel Approach to Review a Patient Using a Scalable Space-Efficient Timeline without Zoom Interactions.

Park, Heekyong; Wang, Taowei David; Wattanasin, Nich; Castro, Victor M; Gainer, Vivian; Murphy, Shawn.

Appl Clin Inform ; 15(2): 250-264, 2024 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-38359876

RESUMO

BACKGROUND: Timelines have been used for patient review. While maintaining a compact overview is important, merged event representations caused by the intricate and voluminous patient data bring event recognition, access ambiguity, and inefficient interaction problems. Handling large patient data efficiently is another challenge. OBJECTIVE: This study aims to develop a scalable, efficient timeline to enhance patient review for research purposes. The focus is on addressing the challenges presented by the intricate and voluminous patient data. METHODS: We propose a high-throughput, space-efficient HistoriView timeline for an individual patient. For a compact overview, it uses nonstacking event representation. An overlay detection algorithm, y-shift visualization, and popup-based interaction facilitate comprehensive analysis of overlapping datasets. An i2b2 HistoriView plugin was deployed, using split query and event reduction approaches, delivering the entire history efficiently without losing information. For evaluation, 11 participants completed a usability survey and a preference survey, followed by qualitative feedback. To evaluate scalability, 100 randomly selected patients over 60 years old were tested on the plugin and were compared with a baseline visualization. RESULTS: Most participants found that HistoriView was easy to use and learn and delivered information clearly without zooming. All preferred HistoriView over a stacked timeline. They expressed satisfaction on display, ease of learning and use, and efficiency. However, challenges and suggestions for improvement were also identified. In the performance test, the largest patient had 32,630 records, which exceeds the baseline limit. HistoriView reduced it to 2,019 visual artifacts. All patients were pulled and visualized within 45.40 seconds. Visualization took less than 3 seconds for all. DISCUSSION AND CONCLUSION: HistoriView allows complete data exploration without exhaustive interactions in a compact overview. It is useful for dense data or iterative comparisons. However, issues in exploring subconcept records were reported. HistoriView handles large patient data preserving original information in a reasonable time.

Assuntos

Algoritmos , Aprendizagem , Humanos , Pessoa de Meia-Idade , Satisfação Pessoal , Pacientes

5.

Offspring cardiometabolic outcomes and postnatal growth trajectories after exposure to maternal SARS-CoV-2 infection.

Shook, Lydia L; Castro, Victor M; Herzberg, Emily M; Fourman, Lindsay T; Kaimal, Anjali J; Perlis, Roy H; Edlow, Andrea G.

Obesity (Silver Spring) ; 32(5): 969-978, 2024 May.

Artigo em Inglês | MEDLINE | ID: mdl-38351665

RESUMO

OBJECTIVE: The objective of this study is to determine whether in utero exposure to SARS-CoV-2 is associated with increased risk for a cardiometabolic diagnosis by 18 months of age. METHODS: This retrospective electronic health record (EHR)-based cohort study included the live-born offspring of all individuals who delivered during the COVID-19 pandemic (April 1, 2020-December 31, 2021) at eight hospitals in Massachusetts. Offspring exposure was defined as a positive maternal SARS-CoV-2 polymerase chain reaction test during pregnancy. The primary outcome was presence of an ICD-10 code for a cardiometabolic disorder in offspring EHR by 18 months. Weight-, length-, and BMI-for-age z scores were calculated and compared at 6-month intervals from birth to 18 months. RESULTS: A total of 29,510 offspring (1599 exposed and 27,911 unexposed) were included. By 18 months, 6.7% of exposed and 4.4% of unexposed offspring had received a cardiometabolic diagnosis (crude odds ratio [OR] 1.47 [95% CI: 1.10 to 1.94], p = 0.007; adjusted OR 1.38 [1.06 to 1.77], p = 0.01). Exposed offspring had a significantly greater mean BMI-for-age z score versus unexposed offspring at 6 months (z score difference 0.19 [95% CI: 0.10 to 0.29], p < 0.001; adjusted difference 0.04 [-0.06 to 0.13], p = 0.4). CONCLUSIONS: Exposure to maternal SARS-CoV-2 infection was associated with an increased risk of receiving a cardiometabolic diagnosis by 18 months preceded by greater BMI-for-age at 6 months.

Assuntos

COVID-19 , Complicações Infecciosas na Gravidez , Efeitos Tardios da Exposição Pré-Natal , SARS-CoV-2 , Humanos , Feminino , COVID-19/epidemiologia , Gravidez , Estudos Retrospectivos , Lactente , Adulto , Masculino , Complicações Infecciosas na Gravidez/virologia , Complicações Infecciosas na Gravidez/epidemiologia , Massachusetts/epidemiologia , Recém-Nascido , Índice de Massa Corporal , Fatores de Risco Cardiometabólico , Desenvolvimento Infantil , Doenças Metabólicas/epidemiologia , Doenças Metabólicas/etiologia

6.

LATTE: Label-efficient incident phenotyping from longitudinal electronic health records.

Wen, Jun; Hou, Jue; Bonzel, Clara-Lea; Zhao, Yihan; Castro, Victor M; Gainer, Vivian S; Weisenfeld, Dana; Cai, Tianrun; Ho, Yuk-Lam; Panickan, Vidul A; Costa, Lauren; Hong, Chuan; Gaziano, J Michael; Liao, Katherine P; Lu, Junwei; Cho, Kelly; Cai, Tianxi.

Patterns (N Y) ; 5(1): 100906, 2024 Jan 12.

Artigo em Inglês | MEDLINE | ID: mdl-38264714

RESUMO

Electronic health record (EHR) data are increasingly used to support real-world evidence studies but are limited by the lack of precise timings of clinical events. Here, we propose a label-efficient incident phenotyping (LATTE) algorithm to accurately annotate the timing of clinical events from longitudinal EHR data. By leveraging the pre-trained semantic embeddings, LATTE selects predictive features and compresses their information into longitudinal visit embeddings through visit attention learning. LATTE models the sequential dependency between the target event and visit embeddings to derive the timings. To improve label efficiency, LATTE constructs longitudinal silver-standard labels from unlabeled patients to perform semi-supervised training. LATTE is evaluated on the onset of type 2 diabetes, heart failure, and relapses of multiple sclerosis. LATTE consistently achieves substantial improvements over benchmark methods while providing high prediction interpretability. The event timings are shown to help discover risk factors of heart failure among patients with rheumatoid arthritis.

7.

Development and multi-site external validation of a generalizable risk prediction model for bipolar disorder.

Walsh, Colin G; Ripperger, Michael A; Hu, Yirui; Sheu, Yi-Han; Lee, Hyunjoon; Wilimitis, Drew; Zheutlin, Amanda B; Rocha, Daniel; Choi, Karmel W; Castro, Victor M; Kirchner, H Lester; Chabris, Christopher F; Davis, Lea K; Smoller, Jordan W.

Transl Psychiatry ; 14(1): 58, 2024 Jan 25.

Artigo em Inglês | MEDLINE | ID: mdl-38272862

RESUMO

Bipolar disorder is a leading contributor to disability, premature mortality, and suicide. Early identification of risk for bipolar disorder using generalizable predictive models trained on diverse cohorts around the United States could improve targeted assessment of high risk individuals, reduce misdiagnosis, and improve the allocation of limited mental health resources. This observational case-control study intended to develop and validate generalizable predictive models of bipolar disorder as part of the multisite, multinational PsycheMERGE Network across diverse and large biobanks with linked electronic health records (EHRs) from three academic medical centers: in the Northeast (Massachusetts General Brigham), the Mid-Atlantic (Geisinger) and the Mid-South (Vanderbilt University Medical Center). Predictive models were developed and valid with multiple algorithms at each study site: random forests, gradient boosting machines, penalized regression, including stacked ensemble learning algorithms combining them. Predictors were limited to widely available EHR-based features agnostic to a common data model including demographics, diagnostic codes, and medications. The main study outcome was bipolar disorder diagnosis as defined by the International Cohort Collection for Bipolar Disorder, 2015. In total, the study included records for 3,529,569 patients including 12,533 cases (0.3%) of bipolar disorder. After internal and external validation, algorithms demonstrated optimal performance in their respective development sites. The stacked ensemble achieved the best combination of overall discrimination (AUC = 0.82-0.87) and calibration performance with positive predictive values above 5% in the highest risk quantiles at all three study sites. In conclusion, generalizable predictive models of risk for bipolar disorder can be feasibly developed across diverse sites to enable precision medicine. Comparison of a range of machine learning methods indicated that an ensemble approach provides the best performance overall but required local retraining. These models will be disseminated via the PsycheMERGE Network website.

Assuntos

Transtorno Bipolar , Humanos , Transtorno Bipolar/diagnóstico , Estudos de Casos e Controles , Medição de Risco/métodos , Aprendizado de Máquina , Registros Eletrônicos de Saúde

8.

An efficient landmark model for prediction of suicide attempts in multiple clinical settings.

Sheu, Yi-Han; Sun, Jiehuan; Lee, Hyunjoon; Castro, Victor M; Barak-Corren, Yuval; Song, Eugene; Madsen, Emily M; Gordon, William J; Kohane, Isaac S; Churchill, Susanne E; Reis, Ben Y; Cai, Tianxi; Smoller, Jordan W.

Psychiatry Res ; 323: 115175, 2023 05.

Artigo em Inglês | MEDLINE | ID: mdl-37003169

RESUMO

Growing evidence has shown that applying machine learning models to large clinical data sources may exceed clinician performance in suicide risk stratification. However, many existing prediction models either suffer from "temporal bias" (a bias that stems from using case-control sampling) or require training on all available patient visit data. Here, we adopt a "landmark model" framework that aligns with clinical practice for prediction of suicide-related behaviors (SRBs) using a large electronic health record database. Using the landmark approach, we developed models for SRB prediction (regularized Cox regression and random survival forest) that establish a time-point (e.g., clinical visit) from which predictions are made over user-specified prediction windows using historical information up to that point. We applied this approach to cohorts from three clinical settings: general outpatient, psychiatric emergency department, and psychiatric inpatients, for varying prediction windows and lengths of historical data. Models achieved high discriminative performance (area under the Receiver Operating Characteristic curve 0.74-0.93 for the Cox model) across different prediction windows and settings, even with relatively short periods of historical data. In short, we developed accurate, dynamic SRB risk prediction models with the landmark approach that reduce bias and enhance the reliability and portability of suicide risk prediction models.

Assuntos

Serviço Hospitalar de Emergência , Tentativa de Suicídio , Humanos , Tentativa de Suicídio/psicologia , Reprodutibilidade dos Testes , Curva ROC

9.

Development and Multi-Site External Validation of a Generalizable Risk Prediction Model for Bipolar Disorder.

Walsh, Colin G; Ripperger, Michael A; Hu, Yirui; Sheu, Yi-Han; Wilimitis, Drew; Zheutlin, Amanda B; Rocha, Daniel; Choi, Karmel W; Castro, Victor M; Kirchner, H Lester; Chabris, Christopher F; Davis, Lea K; Smoller, Jordan W.

medRxiv ; 2023 Feb 26.

Artigo em Inglês | MEDLINE | ID: mdl-36865341

RESUMO

Bipolar disorder is a leading contributor to disability, premature mortality, and suicide. Early identification of risk for bipolar disorder using generalizable predictive models trained on diverse cohorts around the United States could improve targeted assessment of high risk individuals, reduce misdiagnosis, and improve the allocation of limited mental health resources. This observational case-control study intended to develop and validate generalizable predictive models of bipolar disorder as part of the multisite, multinational PsycheMERGE Consortium across diverse and large biobanks with linked electronic health records (EHRs) from three academic medical centers: in the Northeast (Massachusetts General Brigham), the Mid-Atlantic (Geisinger) and the Mid-South (Vanderbilt University Medical Center). Predictive models were developed and validated with multiple algorithms at each study site: random forests, gradient boosting machines, penalized regression, including stacked ensemble learning algorithms combining them. Predictors were limited to widely available EHR-based features agnostic to a common data model including demographics, diagnostic codes, and medications. The main study outcome was bipolar disorder diagnosis as defined by the International Cohort Collection for Bipolar Disorder, 2015. In total, the study included records for 3,529,569 patients including 12,533 cases (0.3%) of bipolar disorder. After internal and external validation, algorithms demonstrated optimal performance in their respective development sites. The stacked ensemble achieved the best combination of overall discrimination (AUC = 0.82 - 0.87) and calibration performance with positive predictive values above 5% in the highest risk quantiles at all three study sites. In conclusion, generalizable predictive models of risk for bipolar disorder can be feasibly developed across diverse sites to enable precision medicine. Comparison of a range of machine learning methods indicated that an ensemble approach provides the best performance overall but required local retraining. These models will be disseminated via the PsycheMERGE Consortium website.

10.

Sex-Specific Neurodevelopmental Outcomes Among Offspring of Mothers With SARS-CoV-2 Infection During Pregnancy.

Edlow, Andrea G; Castro, Victor M; Shook, Lydia L; Haneuse, Sebastien; Kaimal, Anjali J; Perlis, Roy H.

JAMA Netw Open ; 6(3): e234415, 2023 03 01.

Artigo em Inglês | MEDLINE | ID: mdl-36951861

RESUMO

Importance: Prior studies using large registries have suggested a modest increase in risk for neurodevelopmental diagnoses among children of mothers with immune activation during pregnancy, and such risk may be sex-specific. Objective: To determine whether in utero exposure to SARS-CoV-2 is associated with sex-specific risk for neurodevelopmental disorders up to 18 months after birth, compared with unexposed offspring born during or prior to the COVID-19 pandemic period. Design, Setting, and Participants: This retrospective cohort study included the live offspring of all mothers who delivered between January 1 and December 31, 2018 (born and followed up before the COVID-19 pandemic), between March 1 and December 31, 2019 (born before and followed up during the COVID-19 pandemic), and between March 1, 2020, and May 31, 2021 (born and followed up during the COVID-19 pandemic). Offspring were born at any of 8 hospitals across 2 health systems in Massachusetts. Exposures: Polymerase chain reaction evidence of maternal SARS-CoV-2 infection during pregnancy. Main Outcomes and Measures: Electronic health record documentation of International Statistical Classification of Diseases and Related Health Problems, Tenth Revision diagnostic codes corresponding to neurodevelopmental disorders. Results: The COVID-19 pandemic cohort included 18â¯355 live births (9399 boys [51.2%]), including 883 (4.8%) with maternal SARS-CoV-2 positivity during pregnancy. The cohort included 1809 Asian individuals (9.9%), 1635 Black individuals (8.9%), 12â¯718 White individuals (69.3%), and 1714 individuals (9.3%) who were of other race (American Indian or Alaska Native, Native Hawaiian or other Pacific Islander, more than 1 race); 2617 individuals (14.3%) were of Hispanic ethnicity. Mean maternal age was 33.0 (IQR, 30.0-36.0) years. In adjusted regression models accounting for race, ethnicity, insurance status, hospital type (academic center vs community), maternal age, and preterm status, maternal SARS-CoV-2 positivity was associated with a statistically significant elevation in risk for neurodevelopmental diagnoses at 12 months among male offspring (adjusted OR, 1.94 [95% CI 1.12-3.17]; P = .01) but not female offspring (adjusted OR, 0.89 [95% CI, 0.39-1.76]; P = .77). Similar effects were identified using matched analyses in lieu of regression. At 18 months, more modest effects were observed in male offspring (adjusted OR, 1.42 [95% CI, 0.92-2.11]; P = .10). Conclusions and Relevance: In this cohort study of offspring with SARS-CoV-2 exposure in utero, such exposure was associated with greater magnitude of risk for neurodevelopmental diagnoses among male offspring at 12 months following birth. As with prior studies of maternal infection, substantially larger cohorts and longer follow-up will be required to reliably estimate or refute risk.

Assuntos

COVID-19 , Gravidez , Criança , Feminino , Recém-Nascido , Humanos , Masculino , Adulto , COVID-19/epidemiologia , Estudos de Coortes , SARS-CoV-2 , Estudos Retrospectivos , Pandemias

11.

Improving risk prediction for target subpopulations: Predicting suicidal behaviors among multiple sclerosis patients.

Barak-Corren, Yuval; Castro, Victor M; Javitt, Solomon; Nock, Matthew K; Smoller, Jordan W; Reis, Ben Y.

PLoS One ; 18(2): e0277483, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-36795700

RESUMO

Several recent studies have applied machine learning techniques to develop risk algorithms that predict subsequent suicidal behavior based on electronic health record data. In this study we used a retrospective cohort study design to test whether developing more tailored predictive models-within specific subpopulations of patients-would improve predictive accuracy. A retrospective cohort of 15,117 patients diagnosed with multiple sclerosis (MS), a diagnosis associated with increased risk of suicidal behavior, was used. The cohort was randomly divided into equal sized training and validation sets. Overall, suicidal behavior was identified among 191 (1.3%) of the patients with MS. A Naïve Bayes Classifier model was trained on the training set to predict future suicidal behavior. With 90% specificity, the model detected 37% of subjects who later demonstrated suicidal behavior, on average 4.6 years before the first suicide attempt. The performance of a model trained only on MS patients was better at predicting suicide in MS patients than that a model trained on a general patient sample of a similar size (AUC of 0.77 vs. 0.66). Unique risk factors for suicidal behavior among patients with MS included pain-related codes, gastroenteritis and colitis, and history of smoking. Future studies are needed to further test the value of developing population-specific risk models.

Assuntos

Esclerose Múltipla , Ideação Suicida , Humanos , Teorema de Bayes , Estudos Retrospectivos , Tentativa de Suicídio

12.

Evaluation of the portability of computable phenotypes with natural language processing in the eMERGE network.

Pacheco, Jennifer A; Rasmussen, Luke V; Wiley, Ken; Person, Thomas Nate; Cronkite, David J; Sohn, Sunghwan; Murphy, Shawn; Gundelach, Justin H; Gainer, Vivian; Castro, Victor M; Liu, Cong; Mentch, Frank; Lingren, Todd; Sundaresan, Agnes S; Eickelberg, Garrett; Willis, Valerie; Furmanchuk, Al'ona; Patel, Roshan; Carrell, David S; Deng, Yu; Walton, Nephi; Satterfield, Benjamin A; Kullo, Iftikhar J; Dikilitas, Ozan; Smith, Joshua C; Peterson, Josh F; Shang, Ning; Kiryluk, Krzysztof; Ni, Yizhao; Li, Yikuan; Nadkarni, Girish N; Rosenthal, Elisabeth A; Walunas, Theresa L; Williams, Marc S; Karlson, Elizabeth W; Linder, Jodell E; Luo, Yuan; Weng, Chunhua; Wei, WeiQi.

Sci Rep ; 13(1): 1971, 2023 02 03.

Artigo em Inglês | MEDLINE | ID: mdl-36737471

RESUMO

The electronic Medical Records and Genomics (eMERGE) Network assessed the feasibility of deploying portable phenotype rule-based algorithms with natural language processing (NLP) components added to improve performance of existing algorithms using electronic health records (EHRs). Based on scientific merit and predicted difficulty, eMERGE selected six existing phenotypes to enhance with NLP. We assessed performance, portability, and ease of use. We summarized lessons learned by: (1) challenges; (2) best practices to address challenges based on existing evidence and/or eMERGE experience; and (3) opportunities for future research. Adding NLP resulted in improved, or the same, precision and/or recall for all but one algorithm. Portability, phenotyping workflow/process, and technology were major themes. With NLP, development and validation took longer. Besides portability of NLP technology and algorithm replicability, factors to ensure success include privacy protection, technical infrastructure setup, intellectual property agreement, and efficient communication. Workflow improvements can improve communication and reduce implementation time. NLP performance varied mainly due to clinical document heterogeneity; therefore, we suggest using semi-structured notes, comprehensive documentation, and customization options. NLP portability is possible with improved phenotype algorithm performance, but careful planning and architecture of the algorithms is essential to support local customizations.

Assuntos

Registros Eletrônicos de Saúde , Processamento de Linguagem Natural , Genômica , Algoritmos , Fenótipo

13.

Evaluation of a Model to Target High-risk Psychiatric Inpatients for an Intensive Postdischarge Suicide Prevention Intervention.

Kessler, Ronald C; Bauer, Mark S; Bishop, Todd M; Bossarte, Robert M; Castro, Victor M; Demler, Olga V; Gildea, Sarah M; Goulet, Joseph L; King, Andrew J; Kennedy, Chris J; Landes, Sara J; Liu, Howard; Luedtke, Alex; Mair, Patrick; Marx, Brian P; Nock, Matthew K; Petukhova, Maria V; Pigeon, Wilfred R; Sampson, Nancy A; Smoller, Jordan W; Miller, Aletha; Haas, Gretchen; Benware, Jeffrey; Bradley, John; Owen, Richard R; House, Samuel; Urosevic, Snezana; Weinstock, Lauren M.

JAMA Psychiatry ; 80(3): 230-240, 2023 03 01.

Artigo em Inglês | MEDLINE | ID: mdl-36652267

RESUMO

Importance: The months after psychiatric hospital discharge are a time of high risk for suicide. Intensive postdischarge case management, although potentially effective in suicide prevention, is likely to be cost-effective only if targeted at high-risk patients. A previously developed machine learning (ML) model showed that postdischarge suicides can be predicted from electronic health records and geospatial data, but it is unknown if prediction could be improved by adding additional information. Objective: To determine whether model prediction could be improved by adding information extracted from clinical notes and public records. Design, Setting, and Participants: Models were trained to predict suicides in the 12 months after Veterans Health Administration (VHA) short-term (less than 365 days) psychiatric hospitalizations between the beginning of 2010 and September 1, 2012 (299â¯050 hospitalizations, with 916 hospitalizations followed within 12 months by suicides) and tested in the hospitalizations from September 2, 2012, to December 31, 2013 (149â¯738 hospitalizations, with 393 hospitalizations followed within 12 months by suicides). Validation focused on net benefit across a range of plausible decision thresholds. Predictor importance was assessed with Shapley additive explanations (SHAP) values. Data were analyzed from January to August 2022. Main Outcomes and Measures: Suicides were defined by the National Death Index. Base model predictors included VHA electronic health records and patient residential data. The expanded predictors came from natural language processing (NLP) of clinical notes and a social determinants of health (SDOH) public records database. Results: The model included 448â¯788 unique hospitalizations. Net benefit over risk horizons between 3 and 12 months was generally highest for the model that included both NLP and SDOH predictors (area under the receiver operating characteristic curve range, 0.747-0.780; area under the precision recall curve relative to the suicide rate range, 3.87-5.75). NLP and SDOH predictors also had the highest predictor class-level SHAP values (proportional SHAP = 64.0% and 49.3%, respectively), although the single highest positive variable-level SHAP value was for a count of medications classified by the US Food and Drug Administration as increasing suicide risk prescribed the year before hospitalization (proportional SHAP = 15.0%). Conclusions and Relevance: In this study, clinical notes and public records were found to improve ML model prediction of suicide after psychiatric hospitalization. The model had positive net benefit over 3-month to 12-month risk horizons for plausible decision thresholds. Although caution is needed in inferring causality based on predictor importance, several key predictors have potential intervention implications that should be investigated in future studies.

Assuntos

Prevenção do Suicídio , Suicídio , Humanos , Suicídio/psicologia , Alta do Paciente , Pacientes Internados , Assistência ao Convalescente

14.

Sex-specific neurodevelopmental outcomes in offspring of mothers with SARS-CoV-2 in pregnancy: an electronic health records cohort.

Edlow, Andrea G; Castro, Victor M; Shook, Lydia L; Haneuse, Sebastien; Kaimal, Anjali J; Perlis, Roy H.

medRxiv ; 2022 Nov 18.

Artigo em Inglês | MEDLINE | ID: mdl-36415457

RESUMO

Importance: Prior studies using large registries suggested a modest increase in risk for neurodevelopmental diagnoses among children of mothers with immune activation during pregnancy, and such risk may be sex-specific. Objective: To determine whether in utero exposure to the novel coronavirus SARS-CoV-2 is associated with sex-specific risk for neurodevelopmental disorders up to 18 months after birth, compared to unexposed offspring born during or prior to the pandemic period. Design: Retrospective cohort. Participants: Live offspring of all mothers who delivered between March 2018 and May 2021 at any of eight hospitals across two health systems in Massachusetts. Exposure: PCR evidence of maternal SARS-CoV-2 infection during pregnancy. Main Outcome and Measures: Electronic health record documentation of ICD-10 diagnostic codes corresponding to neurodevelopmental disorders. Results: The pandemic cohort included 18,323 live births, including 877 (4.8%) to individuals with SARS-CoV-2 positivity during pregnancy. The cohort included 1806 (9.9%) Asian individuals, 1634 (8.9%) Black individuals, 1711 (9.3%) individuals of another race, and 12,694 (69%) White individuals; 2614 (14%) were of Hispanic ethnicity. Mean maternal age was 33.0 years (IQR 30.0-36.0). In adjusted regression models accounting for race, ethnicity, insurance status, hospital type (academic center vs. community), maternal age, and preterm status, SARS-CoV-2 positivity was associated with statistically significant elevation in risk for neurodevelopmental diagnoses among male offspring (adjusted OR 1.99, 95% CI 1.19-3.34; p=0.009) but not female offspring (adjusted OR 0.90, 95% CI 0.43-1.88; p=0.8). Similar effects were identified using matched analyses in lieu of regression. Conclusion and Relevance: SARS-CoV-2 exposure in utero was associated with greater magnitude of risk for neurodevelopmental diagnoses among male offspring in the 12 months following birth. As with prior studies of maternal infection, substantially larger cohorts and longer follow-up will be required to reliably estimate or refute risk. Trial Registration: NA. Key Points: Question: Are rates of neurodevelopmental disorder diagnoses greater among male or female children with COVID-19 exposure in utero compared to those with no such exposure?Findings: In a cohort of 18,323 infants delivered after February 2020, males but not females born to mothers with a positive SARS-CoV-2 PCR test during pregnancy were more likely to receive a neurodevelopmental diagnosis in the first 12 months after delivery, even after accounting for preterm delivery.Meaning: These findings suggest that male offspring exposed to COVID-19 in utero may be at increased risk for neurodevelopmental disorders.

15.

The Digital Analytic Patient Reviewer (DAPR) for COVID-19 Data Mart Validation.

Park, Heekyong; Wang, Taowei David; Wattanasin, Nich; Castro, Victor M; Gainer, Vivian; Goryachev, Sergey; Murphy, Shawn.

Methods Inf Med ; 61(5-06): 167-173, 2022 12.

Artigo em Inglês | MEDLINE | ID: mdl-36070785

RESUMO

OBJECTIVE: To provide high-quality data for coronavirus disease 2019 (COVID-19) research, we validated derived COVID-19 clinical indicators and 22 associated machine learning phenotypes, in the Mass General Brigham (MGB) COVID-19 Data Mart. METHODS: Fifteen reviewers performed a retrospective manual chart review for 150 COVID-19-positive patients in the data mart. To support rapid chart review for a wide range of target data, we offered a natural language processing (NLP)-based chart review tool, the Digital Analytic Patient Reviewer (DAPR). For this work, we designed a dedicated patient summary view and developed new 127 NLP logics to extract COVID-19 relevant medical concepts and target phenotypes. Moreover, we transformed DAPR for research purposes so that patient information is used for an approved research purpose only and enabled fast access to the integrated patient information. Lastly, we performed a survey to evaluate the validation difficulty and usefulness of the DAPR. RESULTS: The concepts for COVID-19-positive cohort, COVID-19 index date, COVID-19-related admission, and the admission date were shown to have high values in all evaluation metrics. However, three phenotypes showed notable performance degradation than the positive predictive value in the prepandemic population. Based on these results, we removed the three phenotypes from our data mart. In the survey about using the tool, participants expressed positive attitudes toward using DAPR for chart review. They assessed that the validation was easy and DAPR helped find relevant information. Some validation difficulties were also discussed. CONCLUSION: Use of NLP technology in the chart review helped to cope with the challenges of the COVID-19 data validation task and accelerated the process. As a result, we could provide more reliable research data promptly and respond to the COVID-19 crisis. DAPR's benefit can be expanded to other domains. We plan to operationalize it for wider research groups.

Assuntos

COVID-19 , Humanos , Estudos Retrospectivos , Data Warehousing , Processamento de Linguagem Natural , Confiabilidade dos Dados

16.

Multiview Incomplete Knowledge Graph Integration with application to cross-institutional EHR data harmonization.

Zhou, Doudou; Gan, Ziming; Shi, Xu; Patwari, Alina; Rush, Everett; Bonzel, Clara-Lea; Panickan, Vidul A; Hong, Chuan; Ho, Yuk-Lam; Cai, Tianrun; Costa, Lauren; Li, Xiaoou; Castro, Victor M; Murphy, Shawn N; Brat, Gabriel; Weber, Griffin; Avillach, Paul; Gaziano, J Michael; Cho, Kelly; Liao, Katherine P; Lu, Junwei; Cai, Tianxi.

J Biomed Inform ; 133: 104147, 2022 09.

Artigo em Inglês | MEDLINE | ID: mdl-35872266

RESUMO

OBJECTIVE: The growing availability of electronic health records (EHR) data opens opportunities for integrative analysis of multi-institutional EHR to produce generalizable knowledge. A key barrier to such integrative analyses is the lack of semantic interoperability across different institutions due to coding differences. We propose a Multiview Incomplete Knowledge Graph Integration (MIKGI) algorithm to integrate information from multiple sources with partially overlapping EHR concept codes to enable translations between healthcare systems. METHODS: The MIKGI algorithm combines knowledge graph information from (i) embeddings trained from the co-occurrence patterns of medical codes within each EHR system and (ii) semantic embeddings of the textual strings of all medical codes obtained from the Self-Aligning Pretrained BERT (SAPBERT) algorithm. Due to the heterogeneity in the coding across healthcare systems, each EHR source provides partial coverage of the available codes. MIKGI synthesizes the incomplete knowledge graphs derived from these multi-source embeddings by minimizing a spherical loss function that combines the pairwise directional similarities of embeddings computed from all available sources. MIKGI outputs harmonized semantic embedding vectors for all EHR codes, which improves the quality of the embeddings and enables direct assessment of both similarity and relatedness between any pair of codes from multiple healthcare systems. RESULTS: With EHR co-occurrence data from Veteran Affairs (VA) healthcare and Mass General Brigham (MGB), MIKGI algorithm produces high quality embeddings for a variety of downstream tasks including detecting known similar or related entity pairs and mapping VA local codes to the relevant EHR codes used at MGB. Based on the cosine similarity of the MIKGI trained embeddings, the AUC was 0.918 for detecting similar entity pairs and 0.809 for detecting related pairs. For cross-institutional medical code mapping, the top 1 and top 5 accuracy were 91.0% and 97.5% when mapping medication codes at VA to RxNorm medication codes at MGB; 59.1% and 75.8% when mapping VA local laboratory codes to LOINC hierarchy. When trained with 500 labels, the lab code mapping attained top 1 and 5 accuracy at 77.7% and 87.9%. MIKGI also attained best performance in selecting VA local lab codes for desired laboratory tests and COVID-19 related features for COVID EHR studies. Compared to existing methods, MIKGI attained the most robust performance with accuracy the highest or near the highest across all tasks. CONCLUSIONS: The proposed MIKGI algorithm can effectively integrate incomplete summary data from biomedical text and EHR data to generate harmonized embeddings for EHR codes for knowledge graph modeling and cross-institutional translation of EHR codes.

Assuntos

COVID-19 , Registros Eletrônicos de Saúde , Algoritmos , Humanos , Logical Observation Identifiers Names and Codes , Reconhecimento Automatizado de Padrão

17.

Case-control study of neuropsychiatric symptoms in electronic health records following COVID-19 hospitalization in 2 academic health systems.

Castro, Victor M; Rosand, Jonathan; Giacino, Joseph T; McCoy, Thomas H; Perlis, Roy H.

Mol Psychiatry ; 27(9): 3898-3903, 2022 09.

Artigo em Inglês | MEDLINE | ID: mdl-35705635

RESUMO

Neuropsychiatric symptoms may persist following acute COVID-19 illness, but the extent to which these symptoms are specific to COVID-19 has not been established. We utilized electronic health records across 6 hospitals in Massachusetts to characterize cohorts of individuals discharged following admission for COVID-19 between March 2020 and May 2021, and compared them to individuals hospitalized for other indications during this period. Natural language processing was applied to narrative clinical notes to identify neuropsychiatric symptom domains up to 150 days following hospitalization, in addition to those reflected in diagnostic codes as measured in prior studies. Among 6619 individuals hospitalized for COVID-19 drawn from a total of 42,961 hospital discharges, the most commonly-documented symptom domains between 31 and 90 days after initial positive test were fatigue (13.4%), mood and anxiety symptoms (11.2%), and impaired cognition (8.0%). In regression models adjusted for sociodemographic features and hospital course, none of these were significantly more common among COVID-19 patients; indeed, mood and anxiety symptoms were less frequent (adjusted OR 0.72 95% CI 0.64-0.92). Between 91 and 150 days after positivity, most commonly-detected symptoms were fatigue (10.9%), mood and anxiety symptoms (8.2%), and sleep disruption (6.8%), with impaired cognition in 5.8%. Frequency was again similar among non-COVID-19 post-hospital patients, with mood and anxiety symptoms less common (aOR 0.63, 95% CI 0.52-0.75). Propensity-score matched analyses yielded similar results. Overall, neuropsychiatric symptoms were common up to 150 days after initial hospitalization, but occurred at generally similar rates among individuals hospitalized for other indications during the same period. Post-acute sequelae of COVID-19 may benefit from standard if less-specific treatments developed for rehabilitation after hospitalization.

Assuntos

COVID-19 , Humanos , Estudos de Casos e Controles , Registros Eletrônicos de Saúde , Hospitalização , Fadiga

18.

Neurodevelopmental Outcomes at 1 Year in Infants of Mothers Who Tested Positive for SARS-CoV-2 During Pregnancy.

Edlow, Andrea G; Castro, Victor M; Shook, Lydia L; Kaimal, Anjali J; Perlis, Roy H.

JAMA Netw Open ; 5(6): e2215787, 2022 06 01.

Artigo em Inglês | MEDLINE | ID: mdl-35679048

RESUMO

Importance: Epidemiologic studies suggest maternal immune activation during pregnancy may be associated with neurodevelopmental effects in offspring. Objective: To evaluate whether in utero exposure to SARS-CoV-2 is associated with risk for neurodevelopmental disorders in the first 12 months after birth. Design, Setting, and Participants: This retrospective cohort study examined live offspring of all mothers who delivered between March and September 2020 at any of 6 Massachusetts hospitals across 2 health systems. Statistical analysis was performed from October to December 2021. Exposures: Maternal SARS-CoV-2 infection confirmed by a polymerase chain reaction test during pregnancy. Main Outcomes and Measures: Neurodevelopmental disorders determined from International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) diagnostic codes over the first 12 months of life; sociodemographic and clinical features of mothers and offspring; all drawn from the electronic health record. Results: The cohort included 7772 live births (7466 pregnancies, 96% singleton, 222 births to SARS-CoV-2 positive mothers), with mean (SD) maternal age of 32.9 (5.0) years; offspring were 9.9% Asian (772), 8.4% Black (656), and 69.0% White (5363); 15.1% (1134) were of Hispanic ethnicity. Preterm delivery was more likely among exposed mothers: 14.4% (32) vs 8.7% (654) (P = .003). Maternal SARS-CoV-2 positivity during pregnancy was associated with greater rate of neurodevelopmental diagnoses in unadjusted models (odds ratio [OR], 2.17 [95% CI, 1.24-3.79]; P = .006) as well as those adjusted for race, ethnicity, insurance status, offspring sex, maternal age, and preterm status (adjusted OR, 1.86 [95% CI, 1.03-3.36]; P = .04). Third-trimester infection was associated with effects of larger magnitude (adjusted OR, 2.34 [95% CI, 1.23-4.44]; P = .01). Conclusions and Relevance: This cohort study of SARS-CoV-2 exposure in utero found preliminary evidence that maternal SARS-CoV-2 may be associated with neurodevelopmental sequelae in some offspring. Prospective studies with longer follow-up duration will be required to exclude confounding and confirm these associations.

Assuntos

COVID-19 , SARS-CoV-2 , Adulto , COVID-19/diagnóstico , COVID-19/epidemiologia , Estudos de Coortes , Feminino , Humanos , Lactente , Recém-Nascido , Mães , Gravidez , Estudos Prospectivos , Estudos Retrospectivos

19.

Nonspherical anion sequestration by C-H hydrogen bonding.

Mirzaei, Saber; Espinoza Castro, Victor M; Hernández Sánchez, Raúl.

Chem Sci ; 13(7): 2026-2032, 2022 Feb 16.

Artigo em Inglês | MEDLINE | ID: mdl-35308854

RESUMO

Macrocyclic arenes laid the foundations of supramolecular chemistry and their study established the fundamentals of noncovalent interactions. Advancing their frontier, here we designed rigidified resorcin[4]arenes that serve as hosts for large nonspherical anions. In one synthetic step, we vary the host's anion affinity properties by more than seven orders of magnitude. This is possible by engineering electropositive aromatic C-H bond donors in an idealized square planar geometry embedded within the host's inner cavity. The hydrogen atom's electropositivity is tuned by introducing fluorine atoms as electron withdrawing groups. These novel macrocycles, termed fluorocages, are engineered to sequester large anions. Indeed, experimental data shows an increase in the anion association constant (K a) as the number of F atoms increase. The observed trend is rationalized by DFT calculations of Hirshfeld Charges (HCs). Most importantly, fluorocages in solution showed weak-to-medium binding affinity for large anions like [PF6]- (102< K a <104 M-1), and high affinity for [MeSO3]- (K a >106).

20.

Implementing Machine Learning Models for Suicide Risk Prediction in Clinical Practice: Focus Group Study With Hospital Providers.

Bentley, Kate H; Zuromski, Kelly L; Fortgang, Rebecca G; Madsen, Emily M; Kessler, Daniel; Lee, Hyunjoon; Nock, Matthew K; Reis, Ben Y; Castro, Victor M; Smoller, Jordan W.

JMIR Form Res ; 6(3): e30946, 2022 Mar 11.

Artigo em Inglês | MEDLINE | ID: mdl-35275075

RESUMO

BACKGROUND: Interest in developing machine learning models that use electronic health record data to predict patients' risk of suicidal behavior has recently proliferated. However, whether and how such models might be implemented and useful in clinical practice remain unknown. To ultimately make automated suicide risk-prediction models useful in practice, and thus better prevent patient suicides, it is critical to partner with key stakeholders, including the frontline providers who will be using such tools, at each stage of the implementation process. OBJECTIVE: The aim of this focus group study is to inform ongoing and future efforts to deploy suicide risk-prediction models in clinical practice. The specific goals are to better understand hospital providers' current practices for assessing and managing suicide risk; determine providers' perspectives on using automated suicide risk-prediction models in practice; and identify barriers, facilitators, recommendations, and factors to consider. METHODS: We conducted 10 two-hour focus groups with a total of 40 providers from psychiatry, internal medicine and primary care, emergency medicine, and obstetrics and gynecology departments within an urban academic medical center. Audio recordings of open-ended group discussions were transcribed and coded for relevant and recurrent themes by 2 independent study staff members. All coded text was reviewed and discrepancies were resolved in consensus meetings with doctoral-level staff. RESULTS: Although most providers reported using standardized suicide risk assessment tools in their clinical practices, existing tools were commonly described as unhelpful and providers indicated dissatisfaction with current suicide risk assessment methods. Overall, providers' general attitudes toward the practical use of automated suicide risk-prediction models and corresponding clinical decision support tools were positive. Providers were especially interested in the potential to identify high-risk patients who might be missed by traditional screening methods. Some expressed skepticism about the potential usefulness of these models in routine care; specific barriers included concerns about liability, alert fatigue, and increased demand on the health care system. Key facilitators included presenting specific patient-level features contributing to risk scores, emphasizing changes in risk over time, and developing systematic clinical workflows and provider training. Participants also recommended considering risk-prediction windows, timing of alerts, who will have access to model predictions, and variability across treatment settings. CONCLUSIONS: Providers were dissatisfied with current suicide risk assessment methods and were open to the use of a machine learning-based risk-prediction system to inform clinical decision-making. They also raised multiple concerns about potential barriers to the usefulness of this approach and suggested several possible facilitators. Future efforts in this area will benefit from incorporating systematic qualitative feedback from providers, patients, administrators, and payers on the use of these new approaches in routine care, especially given the complex, sensitive, and unfortunately still stigmatized nature of suicide risk.

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA