Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 98
Filtrar
1.
J Clin Neurosci ; 126: 128-134, 2024 Jun 12.
Artigo em Inglês | MEDLINE | ID: mdl-38870642

RESUMO

OBJECTIVE: Intracranial aneurysms (IA) and aortic aneurysms (AA) are both abnormal dilations of arteries with familial predisposition and have been proposed to share co-prevalence and pathophysiology. Associations of IA and non-aortic peripheral aneurysms are less well-studied. The goal of the study was to understand the patterns of aortic and peripheral (extracranial) aneurysms in patients with IA, and risk factors associated with the development of these aneurysms. METHODS: 4701 patients were included in our retrospective analysis of all patients with intracranial aneurysms at our institution over the past 26 years. Patient demographics, comorbidities, and aneurysmal locations were analyzed. Univariate and multivariate analyses were performed to study associations with and without extracranial aneurysms. RESULTS: A total of 3.4% of patients (161 of 4701) with IA had at least one extracranial aneurysm. 2.8% had thoracic or abdominal aortic aneurysms. Age, male sex, hypertension, coronary artery disease, history of ischemic cerebral infarction, connective tissues disease, and family history of extracranial aneurysms in a 1st degree relative were associated with the presence of extracranial aneurysms and a higher number of extracranial aneurysms. In addition, family history of extracranial aneurysms in a second degree relative is associated with the presence of extracranial aneurysms and atrial fibrillation is associated with a higher number of extracranial aneurysms. CONCLUSION: Significant comorbidities are associated with extracranial aneurysms in patients with IA. Family history of extracranial aneurysms has the strongest association and suggests that IA patients with a family history of extracranial aneurysms may benefit from screening.

2.
medRxiv ; 2024 May 27.
Artigo em Inglês | MEDLINE | ID: mdl-38854098

RESUMO

Objective: Postpartum depression (PPD) represents a major contributor to postpartum morbidity and mortality. Beyond efforts at routine screening, risk stratification models could enable more targeted interventions in settings with limited resources. Thus, we aimed to develop and estimate the performance of a generalizable risk stratification model for PPD in patients without a history of depression using information collected as part of routine clinical care. Methods: We performed a retrospective cohort study of all individuals who delivered between 2017 and 2022 in one of two large academic medical centers and six community hospitals. An elastic net model was constructed and externally validated to predict PPD using sociodemographic factors, medical history, and prenatal depression screening information, all of which was known before discharge from the delivery hospitalization. Results: The cohort included 29,168 individuals; 2,703 (9.3%) met at least one criterion for postpartum depression in the 6 months following delivery. In the external validation data, the model had good discrimination and remained well-calibrated: area under the receiver operating characteristic curve 0.721 (95% CI: 0.707-0.734), Brier calibration score 0.088 (95% CI: 0.084 - 0.092). At a specificity of 90%, the positive predictive value was 28.0% (95% CI: 26.0-30.1%), and the negative predictive value was 92.2% (95% CI: 91.8-92.7%). Conclusions: These findings demonstrate that a simple machine-learning model can be used to stratify the risk for PPD before delivery hospitalization discharge. This tool could help identify patients within a practice at the highest risk and facilitate individualized postpartum care planning regarding the prevention of, screening for, and management of PPD at the start of the postpartum period and potentially the onset of symptoms.

3.
Am J Psychiatry ; 181(7): 608-619, 2024 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-38745458

RESUMO

OBJECTIVE: Treatment-resistant depression (TRD) occurs in roughly one-third of all individuals with major depressive disorder (MDD). Although research has suggested a significant common variant genetic component of liability to TRD, with heritability estimated at 8% when compared with non-treatment-resistant MDD, no replicated genetic loci have been identified, and the genetic architecture of TRD remains unclear. A key barrier to this work has been the paucity of adequately powered cohorts for investigation, largely because of the challenge in prospectively investigating this phenotype. The objective of this study was to perform a well-powered genetic study of TRD. METHODS: Using receipt of electroconvulsive therapy (ECT) as a surrogate for TRD, the authors applied standard machine learning methods to electronic health record data to derive predicted probabilities of receiving ECT. These probabilities were then applied as a quantitative trait in a genome-wide association study of 154,433 genotyped patients across four large biobanks. RESULTS: Heritability estimates ranged from 2% to 4.2%, and significant genetic overlap was observed with cognition, attention deficit hyperactivity disorder, schizophrenia, alcohol and smoking traits, and body mass index. Two genome-wide significant loci were identified, both previously implicated in metabolic traits, suggesting shared biology and potential pharmacological implications. CONCLUSIONS: This work provides support for the utility of estimation of disease probability for genomic investigation and provides insights into the genetic architecture and biology of TRD.


Assuntos
Transtorno Depressivo Maior , Transtorno Depressivo Resistente a Tratamento , Eletroconvulsoterapia , Estudo de Associação Genômica Ampla , Humanos , Transtorno Depressivo Resistente a Tratamento/genética , Transtorno Depressivo Resistente a Tratamento/terapia , Feminino , Masculino , Transtorno Depressivo Maior/genética , Transtorno Depressivo Maior/terapia , Pessoa de Meia-Idade , Aprendizado de Máquina , Adulto , Fenótipo , Idoso , Índice de Massa Corporal , Esquizofrenia/genética , Esquizofrenia/terapia
4.
Schizophr Bull ; 2024 May 10.
Artigo em Inglês | MEDLINE | ID: mdl-38728421

RESUMO

BACKGROUND AND HYPOTHESIS: Psychosis-associated diagnostic codes are increasingly being utilized as case definitions for electronic health record (EHR)-based algorithms to predict and detect psychosis. However, data on the validity of psychosis-related diagnostic codes is limited. We evaluated the positive predictive value (PPV) of International Classification of Diseases (ICD) codes for psychosis. STUDY DESIGN: Using EHRs at 3 health systems, ICD codes comprising primary psychotic disorders and mood disorders with psychosis were grouped into 5 higher-order groups. 1133 records were sampled for chart review using the full EHR. PPVs (the probability of chart-confirmed psychosis given ICD psychosis codes) were calculated across multiple treatment settings. STUDY RESULTS: PPVs across all diagnostic groups and hospital systems exceeded 70%: Mass General Brigham 0.72 [95% CI 0.68-0.77], Boston Children's Hospital 0.80 [0.75-0.84], and Boston Medical Center 0.83 [0.79-0.86]. Schizoaffective disorder PPVs were consistently the highest across sites (0.80-0.92) and major depressive disorder with psychosis were the most variable (0.57-0.79). To determine if the first documented code captured first-episode psychosis (FEP), we excluded cases with prior chart evidence of a diagnosis of or treatment for a psychotic illness, yielding substantially lower PPVs (0.08-0.62). CONCLUSIONS: We found that the first documented psychosis diagnostic code accurately captured true episodes of psychosis but was a poor index of FEP. These data have important implications for the case definitions used in the development of risk prediction models designed to predict or detect undiagnosed psychosis.

5.
medRxiv ; 2024 Feb 29.
Artigo em Inglês | MEDLINE | ID: mdl-38464074

RESUMO

Background and Hypothesis: Early detection of psychosis is critical for improving outcomes. Algorithms to predict or detect psychosis using electronic health record (EHR) data depend on the validity of the case definitions used, typically based on diagnostic codes. Data on the validity of psychosis-related diagnostic codes is limited. We evaluated the positive predictive value (PPV) of International Classification of Diseases (ICD) codes for psychosis. Study Design: Using EHRs at three health systems, ICD codes comprising primary psychotic disorders and mood disorders with psychosis were grouped into five higher-order groups. 1,133 records were sampled for chart review using the full EHR. PPVs (the probability of chart-confirmed psychosis given ICD psychosis codes) were calculated across multiple treatment settings. Study Results: PPVs across all diagnostic groups and hospital systems exceeded 70%: Massachusetts General Brigham 0.72 [95% CI 0.68-0.77], Boston Children's Hospital 0.80 [0.75-0.84], and Boston Medical Center 0.83 [0.79-0.86]. Schizoaffective disorder PPVs were consistently the highest across sites (0.80-0.92) and major depressive disorder with psychosis were the most variable (0.57-0.79). To determine if the first documented code captured first-episode psychosis (FEP), we excluded cases with prior chart evidence of a diagnosis of or treatment for a psychotic illness, yielding substantially lower PPVs (0.08-0.62). Conclusions: We found that the first documented psychosis diagnostic code accurately captured true episodes of psychosis but was a poor index of FEP. These data have important implications for the development of risk prediction models designed to predict or detect undiagnosed psychosis.

6.
Appl Clin Inform ; 15(2): 250-264, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38359876

RESUMO

BACKGROUND: Timelines have been used for patient review. While maintaining a compact overview is important, merged event representations caused by the intricate and voluminous patient data bring event recognition, access ambiguity, and inefficient interaction problems. Handling large patient data efficiently is another challenge. OBJECTIVE: This study aims to develop a scalable, efficient timeline to enhance patient review for research purposes. The focus is on addressing the challenges presented by the intricate and voluminous patient data. METHODS: We propose a high-throughput, space-efficient HistoriView timeline for an individual patient. For a compact overview, it uses nonstacking event representation. An overlay detection algorithm, y-shift visualization, and popup-based interaction facilitate comprehensive analysis of overlapping datasets. An i2b2 HistoriView plugin was deployed, using split query and event reduction approaches, delivering the entire history efficiently without losing information. For evaluation, 11 participants completed a usability survey and a preference survey, followed by qualitative feedback. To evaluate scalability, 100 randomly selected patients over 60 years old were tested on the plugin and were compared with a baseline visualization. RESULTS: Most participants found that HistoriView was easy to use and learn and delivered information clearly without zooming. All preferred HistoriView over a stacked timeline. They expressed satisfaction on display, ease of learning and use, and efficiency. However, challenges and suggestions for improvement were also identified. In the performance test, the largest patient had 32,630 records, which exceeds the baseline limit. HistoriView reduced it to 2,019 visual artifacts. All patients were pulled and visualized within 45.40 seconds. Visualization took less than 3 seconds for all. DISCUSSION AND CONCLUSION: HistoriView allows complete data exploration without exhaustive interactions in a compact overview. It is useful for dense data or iterative comparisons. However, issues in exploring subconcept records were reported. HistoriView handles large patient data preserving original information in a reasonable time.


Assuntos
Algoritmos , Aprendizagem , Humanos , Pessoa de Meia-Idade , Satisfação Pessoal , Pacientes
7.
Obesity (Silver Spring) ; 32(5): 969-978, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38351665

RESUMO

OBJECTIVE: The objective of this study is to determine whether in utero exposure to SARS-CoV-2 is associated with increased risk for a cardiometabolic diagnosis by 18 months of age. METHODS: This retrospective electronic health record (EHR)-based cohort study included the live-born offspring of all individuals who delivered during the COVID-19 pandemic (April 1, 2020-December 31, 2021) at eight hospitals in Massachusetts. Offspring exposure was defined as a positive maternal SARS-CoV-2 polymerase chain reaction test during pregnancy. The primary outcome was presence of an ICD-10 code for a cardiometabolic disorder in offspring EHR by 18 months. Weight-, length-, and BMI-for-age z scores were calculated and compared at 6-month intervals from birth to 18 months. RESULTS: A total of 29,510 offspring (1599 exposed and 27,911 unexposed) were included. By 18 months, 6.7% of exposed and 4.4% of unexposed offspring had received a cardiometabolic diagnosis (crude odds ratio [OR] 1.47 [95% CI: 1.10 to 1.94], p = 0.007; adjusted OR 1.38 [1.06 to 1.77], p = 0.01). Exposed offspring had a significantly greater mean BMI-for-age z score versus unexposed offspring at 6 months (z score difference 0.19 [95% CI: 0.10 to 0.29], p < 0.001; adjusted difference 0.04 [-0.06 to 0.13], p = 0.4). CONCLUSIONS: Exposure to maternal SARS-CoV-2 infection was associated with an increased risk of receiving a cardiometabolic diagnosis by 18 months preceded by greater BMI-for-age at 6 months.


Assuntos
COVID-19 , Complicações Infecciosas na Gravidez , Efeitos Tardios da Exposição Pré-Natal , SARS-CoV-2 , Humanos , Feminino , COVID-19/epidemiologia , Gravidez , Estudos Retrospectivos , Lactente , Adulto , Masculino , Complicações Infecciosas na Gravidez/virologia , Complicações Infecciosas na Gravidez/epidemiologia , Massachusetts/epidemiologia , Recém-Nascido , Índice de Massa Corporal , Fatores de Risco Cardiometabólico , Desenvolvimento Infantil , Doenças Metabólicas/epidemiologia , Doenças Metabólicas/etiologia
8.
Patterns (N Y) ; 5(1): 100906, 2024 Jan 12.
Artigo em Inglês | MEDLINE | ID: mdl-38264714

RESUMO

Electronic health record (EHR) data are increasingly used to support real-world evidence studies but are limited by the lack of precise timings of clinical events. Here, we propose a label-efficient incident phenotyping (LATTE) algorithm to accurately annotate the timing of clinical events from longitudinal EHR data. By leveraging the pre-trained semantic embeddings, LATTE selects predictive features and compresses their information into longitudinal visit embeddings through visit attention learning. LATTE models the sequential dependency between the target event and visit embeddings to derive the timings. To improve label efficiency, LATTE constructs longitudinal silver-standard labels from unlabeled patients to perform semi-supervised training. LATTE is evaluated on the onset of type 2 diabetes, heart failure, and relapses of multiple sclerosis. LATTE consistently achieves substantial improvements over benchmark methods while providing high prediction interpretability. The event timings are shown to help discover risk factors of heart failure among patients with rheumatoid arthritis.

9.
Transl Psychiatry ; 14(1): 58, 2024 Jan 25.
Artigo em Inglês | MEDLINE | ID: mdl-38272862

RESUMO

Bipolar disorder is a leading contributor to disability, premature mortality, and suicide. Early identification of risk for bipolar disorder using generalizable predictive models trained on diverse cohorts around the United States could improve targeted assessment of high risk individuals, reduce misdiagnosis, and improve the allocation of limited mental health resources. This observational case-control study intended to develop and validate generalizable predictive models of bipolar disorder as part of the multisite, multinational PsycheMERGE Network across diverse and large biobanks with linked electronic health records (EHRs) from three academic medical centers: in the Northeast (Massachusetts General Brigham), the Mid-Atlantic (Geisinger) and the Mid-South (Vanderbilt University Medical Center). Predictive models were developed and valid with multiple algorithms at each study site: random forests, gradient boosting machines, penalized regression, including stacked ensemble learning algorithms combining them. Predictors were limited to widely available EHR-based features agnostic to a common data model including demographics, diagnostic codes, and medications. The main study outcome was bipolar disorder diagnosis as defined by the International Cohort Collection for Bipolar Disorder, 2015. In total, the study included records for 3,529,569 patients including 12,533 cases (0.3%) of bipolar disorder. After internal and external validation, algorithms demonstrated optimal performance in their respective development sites. The stacked ensemble achieved the best combination of overall discrimination (AUC = 0.82-0.87) and calibration performance with positive predictive values above 5% in the highest risk quantiles at all three study sites. In conclusion, generalizable predictive models of risk for bipolar disorder can be feasibly developed across diverse sites to enable precision medicine. Comparison of a range of machine learning methods indicated that an ensemble approach provides the best performance overall but required local retraining. These models will be disseminated via the PsycheMERGE Network website.


Assuntos
Transtorno Bipolar , Humanos , Transtorno Bipolar/diagnóstico , Estudos de Casos e Controles , Medição de Risco/métodos , Aprendizado de Máquina , Registros Eletrônicos de Saúde
10.
Psychiatry Res ; 323: 115175, 2023 05.
Artigo em Inglês | MEDLINE | ID: mdl-37003169

RESUMO

Growing evidence has shown that applying machine learning models to large clinical data sources may exceed clinician performance in suicide risk stratification. However, many existing prediction models either suffer from "temporal bias" (a bias that stems from using case-control sampling) or require training on all available patient visit data. Here, we adopt a "landmark model" framework that aligns with clinical practice for prediction of suicide-related behaviors (SRBs) using a large electronic health record database. Using the landmark approach, we developed models for SRB prediction (regularized Cox regression and random survival forest) that establish a time-point (e.g., clinical visit) from which predictions are made over user-specified prediction windows using historical information up to that point. We applied this approach to cohorts from three clinical settings: general outpatient, psychiatric emergency department, and psychiatric inpatients, for varying prediction windows and lengths of historical data. Models achieved high discriminative performance (area under the Receiver Operating Characteristic curve 0.74-0.93 for the Cox model) across different prediction windows and settings, even with relatively short periods of historical data. In short, we developed accurate, dynamic SRB risk prediction models with the landmark approach that reduce bias and enhance the reliability and portability of suicide risk prediction models.


Assuntos
Serviço Hospitalar de Emergência , Tentativa de Suicídio , Humanos , Tentativa de Suicídio/psicologia , Reprodutibilidade dos Testes , Curva ROC
11.
medRxiv ; 2023 Feb 26.
Artigo em Inglês | MEDLINE | ID: mdl-36865341

RESUMO

Bipolar disorder is a leading contributor to disability, premature mortality, and suicide. Early identification of risk for bipolar disorder using generalizable predictive models trained on diverse cohorts around the United States could improve targeted assessment of high risk individuals, reduce misdiagnosis, and improve the allocation of limited mental health resources. This observational case-control study intended to develop and validate generalizable predictive models of bipolar disorder as part of the multisite, multinational PsycheMERGE Consortium across diverse and large biobanks with linked electronic health records (EHRs) from three academic medical centers: in the Northeast (Massachusetts General Brigham), the Mid-Atlantic (Geisinger) and the Mid-South (Vanderbilt University Medical Center). Predictive models were developed and validated with multiple algorithms at each study site: random forests, gradient boosting machines, penalized regression, including stacked ensemble learning algorithms combining them. Predictors were limited to widely available EHR-based features agnostic to a common data model including demographics, diagnostic codes, and medications. The main study outcome was bipolar disorder diagnosis as defined by the International Cohort Collection for Bipolar Disorder, 2015. In total, the study included records for 3,529,569 patients including 12,533 cases (0.3%) of bipolar disorder. After internal and external validation, algorithms demonstrated optimal performance in their respective development sites. The stacked ensemble achieved the best combination of overall discrimination (AUC = 0.82 - 0.87) and calibration performance with positive predictive values above 5% in the highest risk quantiles at all three study sites. In conclusion, generalizable predictive models of risk for bipolar disorder can be feasibly developed across diverse sites to enable precision medicine. Comparison of a range of machine learning methods indicated that an ensemble approach provides the best performance overall but required local retraining. These models will be disseminated via the PsycheMERGE Consortium website.

12.
JAMA Netw Open ; 6(3): e234415, 2023 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-36951861

RESUMO

Importance: Prior studies using large registries have suggested a modest increase in risk for neurodevelopmental diagnoses among children of mothers with immune activation during pregnancy, and such risk may be sex-specific. Objective: To determine whether in utero exposure to SARS-CoV-2 is associated with sex-specific risk for neurodevelopmental disorders up to 18 months after birth, compared with unexposed offspring born during or prior to the COVID-19 pandemic period. Design, Setting, and Participants: This retrospective cohort study included the live offspring of all mothers who delivered between January 1 and December 31, 2018 (born and followed up before the COVID-19 pandemic), between March 1 and December 31, 2019 (born before and followed up during the COVID-19 pandemic), and between March 1, 2020, and May 31, 2021 (born and followed up during the COVID-19 pandemic). Offspring were born at any of 8 hospitals across 2 health systems in Massachusetts. Exposures: Polymerase chain reaction evidence of maternal SARS-CoV-2 infection during pregnancy. Main Outcomes and Measures: Electronic health record documentation of International Statistical Classification of Diseases and Related Health Problems, Tenth Revision diagnostic codes corresponding to neurodevelopmental disorders. Results: The COVID-19 pandemic cohort included 18 355 live births (9399 boys [51.2%]), including 883 (4.8%) with maternal SARS-CoV-2 positivity during pregnancy. The cohort included 1809 Asian individuals (9.9%), 1635 Black individuals (8.9%), 12 718 White individuals (69.3%), and 1714 individuals (9.3%) who were of other race (American Indian or Alaska Native, Native Hawaiian or other Pacific Islander, more than 1 race); 2617 individuals (14.3%) were of Hispanic ethnicity. Mean maternal age was 33.0 (IQR, 30.0-36.0) years. In adjusted regression models accounting for race, ethnicity, insurance status, hospital type (academic center vs community), maternal age, and preterm status, maternal SARS-CoV-2 positivity was associated with a statistically significant elevation in risk for neurodevelopmental diagnoses at 12 months among male offspring (adjusted OR, 1.94 [95% CI 1.12-3.17]; P = .01) but not female offspring (adjusted OR, 0.89 [95% CI, 0.39-1.76]; P = .77). Similar effects were identified using matched analyses in lieu of regression. At 18 months, more modest effects were observed in male offspring (adjusted OR, 1.42 [95% CI, 0.92-2.11]; P = .10). Conclusions and Relevance: In this cohort study of offspring with SARS-CoV-2 exposure in utero, such exposure was associated with greater magnitude of risk for neurodevelopmental diagnoses among male offspring at 12 months following birth. As with prior studies of maternal infection, substantially larger cohorts and longer follow-up will be required to reliably estimate or refute risk.


Assuntos
COVID-19 , Gravidez , Criança , Feminino , Recém-Nascido , Humanos , Masculino , Adulto , COVID-19/epidemiologia , Estudos de Coortes , SARS-CoV-2 , Estudos Retrospectivos , Pandemias
13.
Sci Rep ; 13(1): 1971, 2023 02 03.
Artigo em Inglês | MEDLINE | ID: mdl-36737471

RESUMO

The electronic Medical Records and Genomics (eMERGE) Network assessed the feasibility of deploying portable phenotype rule-based algorithms with natural language processing (NLP) components added to improve performance of existing algorithms using electronic health records (EHRs). Based on scientific merit and predicted difficulty, eMERGE selected six existing phenotypes to enhance with NLP. We assessed performance, portability, and ease of use. We summarized lessons learned by: (1) challenges; (2) best practices to address challenges based on existing evidence and/or eMERGE experience; and (3) opportunities for future research. Adding NLP resulted in improved, or the same, precision and/or recall for all but one algorithm. Portability, phenotyping workflow/process, and technology were major themes. With NLP, development and validation took longer. Besides portability of NLP technology and algorithm replicability, factors to ensure success include privacy protection, technical infrastructure setup, intellectual property agreement, and efficient communication. Workflow improvements can improve communication and reduce implementation time. NLP performance varied mainly due to clinical document heterogeneity; therefore, we suggest using semi-structured notes, comprehensive documentation, and customization options. NLP portability is possible with improved phenotype algorithm performance, but careful planning and architecture of the algorithms is essential to support local customizations.


Assuntos
Registros Eletrônicos de Saúde , Processamento de Linguagem Natural , Genômica , Algoritmos , Fenótipo
14.
PLoS One ; 18(2): e0277483, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36795700

RESUMO

Several recent studies have applied machine learning techniques to develop risk algorithms that predict subsequent suicidal behavior based on electronic health record data. In this study we used a retrospective cohort study design to test whether developing more tailored predictive models-within specific subpopulations of patients-would improve predictive accuracy. A retrospective cohort of 15,117 patients diagnosed with multiple sclerosis (MS), a diagnosis associated with increased risk of suicidal behavior, was used. The cohort was randomly divided into equal sized training and validation sets. Overall, suicidal behavior was identified among 191 (1.3%) of the patients with MS. A Naïve Bayes Classifier model was trained on the training set to predict future suicidal behavior. With 90% specificity, the model detected 37% of subjects who later demonstrated suicidal behavior, on average 4.6 years before the first suicide attempt. The performance of a model trained only on MS patients was better at predicting suicide in MS patients than that a model trained on a general patient sample of a similar size (AUC of 0.77 vs. 0.66). Unique risk factors for suicidal behavior among patients with MS included pain-related codes, gastroenteritis and colitis, and history of smoking. Future studies are needed to further test the value of developing population-specific risk models.


Assuntos
Esclerose Múltipla , Ideação Suicida , Humanos , Teorema de Bayes , Estudos Retrospectivos , Tentativa de Suicídio
15.
JAMA Psychiatry ; 80(3): 230-240, 2023 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-36652267

RESUMO

Importance: The months after psychiatric hospital discharge are a time of high risk for suicide. Intensive postdischarge case management, although potentially effective in suicide prevention, is likely to be cost-effective only if targeted at high-risk patients. A previously developed machine learning (ML) model showed that postdischarge suicides can be predicted from electronic health records and geospatial data, but it is unknown if prediction could be improved by adding additional information. Objective: To determine whether model prediction could be improved by adding information extracted from clinical notes and public records. Design, Setting, and Participants: Models were trained to predict suicides in the 12 months after Veterans Health Administration (VHA) short-term (less than 365 days) psychiatric hospitalizations between the beginning of 2010 and September 1, 2012 (299 050 hospitalizations, with 916 hospitalizations followed within 12 months by suicides) and tested in the hospitalizations from September 2, 2012, to December 31, 2013 (149 738 hospitalizations, with 393 hospitalizations followed within 12 months by suicides). Validation focused on net benefit across a range of plausible decision thresholds. Predictor importance was assessed with Shapley additive explanations (SHAP) values. Data were analyzed from January to August 2022. Main Outcomes and Measures: Suicides were defined by the National Death Index. Base model predictors included VHA electronic health records and patient residential data. The expanded predictors came from natural language processing (NLP) of clinical notes and a social determinants of health (SDOH) public records database. Results: The model included 448 788 unique hospitalizations. Net benefit over risk horizons between 3 and 12 months was generally highest for the model that included both NLP and SDOH predictors (area under the receiver operating characteristic curve range, 0.747-0.780; area under the precision recall curve relative to the suicide rate range, 3.87-5.75). NLP and SDOH predictors also had the highest predictor class-level SHAP values (proportional SHAP = 64.0% and 49.3%, respectively), although the single highest positive variable-level SHAP value was for a count of medications classified by the US Food and Drug Administration as increasing suicide risk prescribed the year before hospitalization (proportional SHAP = 15.0%). Conclusions and Relevance: In this study, clinical notes and public records were found to improve ML model prediction of suicide after psychiatric hospitalization. The model had positive net benefit over 3-month to 12-month risk horizons for plausible decision thresholds. Although caution is needed in inferring causality based on predictor importance, several key predictors have potential intervention implications that should be investigated in future studies.


Assuntos
Prevenção do Suicídio , Suicídio , Humanos , Suicídio/psicologia , Alta do Paciente , Pacientes Internados , Assistência ao Convalescente
16.
medRxiv ; 2022 Nov 18.
Artigo em Inglês | MEDLINE | ID: mdl-36415457

RESUMO

Importance: Prior studies using large registries suggested a modest increase in risk for neurodevelopmental diagnoses among children of mothers with immune activation during pregnancy, and such risk may be sex-specific. Objective: To determine whether in utero exposure to the novel coronavirus SARS-CoV-2 is associated with sex-specific risk for neurodevelopmental disorders up to 18 months after birth, compared to unexposed offspring born during or prior to the pandemic period. Design: Retrospective cohort. Participants: Live offspring of all mothers who delivered between March 2018 and May 2021 at any of eight hospitals across two health systems in Massachusetts. Exposure: PCR evidence of maternal SARS-CoV-2 infection during pregnancy. Main Outcome and Measures: Electronic health record documentation of ICD-10 diagnostic codes corresponding to neurodevelopmental disorders. Results: The pandemic cohort included 18,323 live births, including 877 (4.8%) to individuals with SARS-CoV-2 positivity during pregnancy. The cohort included 1806 (9.9%) Asian individuals, 1634 (8.9%) Black individuals, 1711 (9.3%) individuals of another race, and 12,694 (69%) White individuals; 2614 (14%) were of Hispanic ethnicity. Mean maternal age was 33.0 years (IQR 30.0-36.0). In adjusted regression models accounting for race, ethnicity, insurance status, hospital type (academic center vs. community), maternal age, and preterm status, SARS-CoV-2 positivity was associated with statistically significant elevation in risk for neurodevelopmental diagnoses among male offspring (adjusted OR 1.99, 95% CI 1.19-3.34; p=0.009) but not female offspring (adjusted OR 0.90, 95% CI 0.43-1.88; p=0.8). Similar effects were identified using matched analyses in lieu of regression. Conclusion and Relevance: SARS-CoV-2 exposure in utero was associated with greater magnitude of risk for neurodevelopmental diagnoses among male offspring in the 12 months following birth. As with prior studies of maternal infection, substantially larger cohorts and longer follow-up will be required to reliably estimate or refute risk. Trial Registration: NA. Key Points: Question: Are rates of neurodevelopmental disorder diagnoses greater among male or female children with COVID-19 exposure in utero compared to those with no such exposure?Findings: In a cohort of 18,323 infants delivered after February 2020, males but not females born to mothers with a positive SARS-CoV-2 PCR test during pregnancy were more likely to receive a neurodevelopmental diagnosis in the first 12 months after delivery, even after accounting for preterm delivery.Meaning: These findings suggest that male offspring exposed to COVID-19 in utero may be at increased risk for neurodevelopmental disorders.

17.
Methods Inf Med ; 61(5-06): 167-173, 2022 12.
Artigo em Inglês | MEDLINE | ID: mdl-36070785

RESUMO

OBJECTIVE: To provide high-quality data for coronavirus disease 2019 (COVID-19) research, we validated derived COVID-19 clinical indicators and 22 associated machine learning phenotypes, in the Mass General Brigham (MGB) COVID-19 Data Mart. METHODS: Fifteen reviewers performed a retrospective manual chart review for 150 COVID-19-positive patients in the data mart. To support rapid chart review for a wide range of target data, we offered a natural language processing (NLP)-based chart review tool, the Digital Analytic Patient Reviewer (DAPR). For this work, we designed a dedicated patient summary view and developed new 127 NLP logics to extract COVID-19 relevant medical concepts and target phenotypes. Moreover, we transformed DAPR for research purposes so that patient information is used for an approved research purpose only and enabled fast access to the integrated patient information. Lastly, we performed a survey to evaluate the validation difficulty and usefulness of the DAPR. RESULTS: The concepts for COVID-19-positive cohort, COVID-19 index date, COVID-19-related admission, and the admission date were shown to have high values in all evaluation metrics. However, three phenotypes showed notable performance degradation than the positive predictive value in the prepandemic population. Based on these results, we removed the three phenotypes from our data mart. In the survey about using the tool, participants expressed positive attitudes toward using DAPR for chart review. They assessed that the validation was easy and DAPR helped find relevant information. Some validation difficulties were also discussed. CONCLUSION: Use of NLP technology in the chart review helped to cope with the challenges of the COVID-19 data validation task and accelerated the process. As a result, we could provide more reliable research data promptly and respond to the COVID-19 crisis. DAPR's benefit can be expanded to other domains. We plan to operationalize it for wider research groups.


Assuntos
COVID-19 , Humanos , Estudos Retrospectivos , Data Warehousing , Processamento de Linguagem Natural , Confiabilidade dos Dados
18.
J Biomed Inform ; 133: 104147, 2022 09.
Artigo em Inglês | MEDLINE | ID: mdl-35872266

RESUMO

OBJECTIVE: The growing availability of electronic health records (EHR) data opens opportunities for integrative analysis of multi-institutional EHR to produce generalizable knowledge. A key barrier to such integrative analyses is the lack of semantic interoperability across different institutions due to coding differences. We propose a Multiview Incomplete Knowledge Graph Integration (MIKGI) algorithm to integrate information from multiple sources with partially overlapping EHR concept codes to enable translations between healthcare systems. METHODS: The MIKGI algorithm combines knowledge graph information from (i) embeddings trained from the co-occurrence patterns of medical codes within each EHR system and (ii) semantic embeddings of the textual strings of all medical codes obtained from the Self-Aligning Pretrained BERT (SAPBERT) algorithm. Due to the heterogeneity in the coding across healthcare systems, each EHR source provides partial coverage of the available codes. MIKGI synthesizes the incomplete knowledge graphs derived from these multi-source embeddings by minimizing a spherical loss function that combines the pairwise directional similarities of embeddings computed from all available sources. MIKGI outputs harmonized semantic embedding vectors for all EHR codes, which improves the quality of the embeddings and enables direct assessment of both similarity and relatedness between any pair of codes from multiple healthcare systems. RESULTS: With EHR co-occurrence data from Veteran Affairs (VA) healthcare and Mass General Brigham (MGB), MIKGI algorithm produces high quality embeddings for a variety of downstream tasks including detecting known similar or related entity pairs and mapping VA local codes to the relevant EHR codes used at MGB. Based on the cosine similarity of the MIKGI trained embeddings, the AUC was 0.918 for detecting similar entity pairs and 0.809 for detecting related pairs. For cross-institutional medical code mapping, the top 1 and top 5 accuracy were 91.0% and 97.5% when mapping medication codes at VA to RxNorm medication codes at MGB; 59.1% and 75.8% when mapping VA local laboratory codes to LOINC hierarchy. When trained with 500 labels, the lab code mapping attained top 1 and 5 accuracy at 77.7% and 87.9%. MIKGI also attained best performance in selecting VA local lab codes for desired laboratory tests and COVID-19 related features for COVID EHR studies. Compared to existing methods, MIKGI attained the most robust performance with accuracy the highest or near the highest across all tasks. CONCLUSIONS: The proposed MIKGI algorithm can effectively integrate incomplete summary data from biomedical text and EHR data to generate harmonized embeddings for EHR codes for knowledge graph modeling and cross-institutional translation of EHR codes.


Assuntos
COVID-19 , Registros Eletrônicos de Saúde , Algoritmos , Humanos , Logical Observation Identifiers Names and Codes , Reconhecimento Automatizado de Padrão
19.
JAMA Netw Open ; 5(6): e2215787, 2022 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-35679048

RESUMO

Importance: Epidemiologic studies suggest maternal immune activation during pregnancy may be associated with neurodevelopmental effects in offspring. Objective: To evaluate whether in utero exposure to SARS-CoV-2 is associated with risk for neurodevelopmental disorders in the first 12 months after birth. Design, Setting, and Participants: This retrospective cohort study examined live offspring of all mothers who delivered between March and September 2020 at any of 6 Massachusetts hospitals across 2 health systems. Statistical analysis was performed from October to December 2021. Exposures: Maternal SARS-CoV-2 infection confirmed by a polymerase chain reaction test during pregnancy. Main Outcomes and Measures: Neurodevelopmental disorders determined from International Statistical Classification of Diseases and Related Health Problems, Tenth Revision (ICD-10) diagnostic codes over the first 12 months of life; sociodemographic and clinical features of mothers and offspring; all drawn from the electronic health record. Results: The cohort included 7772 live births (7466 pregnancies, 96% singleton, 222 births to SARS-CoV-2 positive mothers), with mean (SD) maternal age of 32.9 (5.0) years; offspring were 9.9% Asian (772), 8.4% Black (656), and 69.0% White (5363); 15.1% (1134) were of Hispanic ethnicity. Preterm delivery was more likely among exposed mothers: 14.4% (32) vs 8.7% (654) (P = .003). Maternal SARS-CoV-2 positivity during pregnancy was associated with greater rate of neurodevelopmental diagnoses in unadjusted models (odds ratio [OR], 2.17 [95% CI, 1.24-3.79]; P = .006) as well as those adjusted for race, ethnicity, insurance status, offspring sex, maternal age, and preterm status (adjusted OR, 1.86 [95% CI, 1.03-3.36]; P = .04). Third-trimester infection was associated with effects of larger magnitude (adjusted OR, 2.34 [95% CI, 1.23-4.44]; P = .01). Conclusions and Relevance: This cohort study of SARS-CoV-2 exposure in utero found preliminary evidence that maternal SARS-CoV-2 may be associated with neurodevelopmental sequelae in some offspring. Prospective studies with longer follow-up duration will be required to exclude confounding and confirm these associations.


Assuntos
COVID-19 , SARS-CoV-2 , Adulto , COVID-19/diagnóstico , COVID-19/epidemiologia , Estudos de Coortes , Feminino , Humanos , Lactente , Recém-Nascido , Mães , Gravidez , Estudos Prospectivos , Estudos Retrospectivos
20.
Mol Psychiatry ; 27(9): 3898-3903, 2022 09.
Artigo em Inglês | MEDLINE | ID: mdl-35705635

RESUMO

Neuropsychiatric symptoms may persist following acute COVID-19 illness, but the extent to which these symptoms are specific to COVID-19 has not been established. We utilized electronic health records across 6 hospitals in Massachusetts to characterize cohorts of individuals discharged following admission for COVID-19 between March 2020 and May 2021, and compared them to individuals hospitalized for other indications during this period. Natural language processing was applied to narrative clinical notes to identify neuropsychiatric symptom domains up to 150 days following hospitalization, in addition to those reflected in diagnostic codes as measured in prior studies. Among 6619 individuals hospitalized for COVID-19 drawn from a total of 42,961 hospital discharges, the most commonly-documented symptom domains between 31 and 90 days after initial positive test were fatigue (13.4%), mood and anxiety symptoms (11.2%), and impaired cognition (8.0%). In regression models adjusted for sociodemographic features and hospital course, none of these were significantly more common among COVID-19 patients; indeed, mood and anxiety symptoms were less frequent (adjusted OR 0.72 95% CI 0.64-0.92). Between 91 and 150 days after positivity, most commonly-detected symptoms were fatigue (10.9%), mood and anxiety symptoms (8.2%), and sleep disruption (6.8%), with impaired cognition in 5.8%. Frequency was again similar among non-COVID-19 post-hospital patients, with mood and anxiety symptoms less common (aOR 0.63, 95% CI 0.52-0.75). Propensity-score matched analyses yielded similar results. Overall, neuropsychiatric symptoms were common up to 150 days after initial hospitalization, but occurred at generally similar rates among individuals hospitalized for other indications during the same period. Post-acute sequelae of COVID-19 may benefit from standard if less-specific treatments developed for rehabilitation after hospitalization.


Assuntos
COVID-19 , Humanos , Estudos de Casos e Controles , Registros Eletrônicos de Saúde , Hospitalização , Fadiga
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...