Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 43
Filtrar
1.
N C Med J ; 85(4): 256-259, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-39466095

RESUMO

As a biomedical data scientist, when I think of the future of artificial intelligence in health care, the potential fills me with both excitement and caution. A promising area of innovation, AI can be used to assess the impact of social determinants of health on health outcomes, though more standardization is needed.


Assuntos
Inteligência Artificial , Determinantes Sociais da Saúde , Humanos , Atenção à Saúde
2.
Bioinformatics ; 38(12): 3252-3258, 2022 06 13.
Artigo em Inglês | MEDLINE | ID: mdl-35441678

RESUMO

MOTIVATION: As the number of public data resources continues to proliferate, identifying relevant datasets across heterogenous repositories is becoming critical to answering scientific questions. To help researchers navigate this data landscape, we developed Dug: a semantic search tool for biomedical datasets utilizing evidence-based relationships from curated knowledge graphs to find relevant datasets and explain why those results are returned. RESULTS: Developed through the National Heart, Lung and Blood Institute's (NHLBI) BioData Catalyst ecosystem, Dug has indexed more than 15 911 study variables from public datasets. On a manually curated search dataset, Dug's total recall (total relevant results/total results) of 0.79 outperformed default Elasticsearch's total recall of 0.76. When using synonyms or related concepts as search queries, Dug (0.36) far outperformed Elasticsearch (0.14) in terms of total recall with no significant loss in the precision of its top results. AVAILABILITY AND IMPLEMENTATION: Dug is freely available at https://github.com/helxplatform/dug. An example Dug deployment is also available for use at https://search.biodatacatalyst.renci.org/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Ferramenta de Busca , Semântica , Ecossistema , Indexação e Redação de Resumos
4.
J Med Internet Res ; 23(10): e31400, 2021 10 11.
Artigo em Inglês | MEDLINE | ID: mdl-34533459

RESUMO

BACKGROUND: Many countries have experienced 2 predominant waves of COVID-19-related hospitalizations. Comparing the clinical trajectories of patients hospitalized in separate waves of the pandemic enables further understanding of the evolving epidemiology, pathophysiology, and health care dynamics of the COVID-19 pandemic. OBJECTIVE: In this retrospective cohort study, we analyzed electronic health record (EHR) data from patients with SARS-CoV-2 infections hospitalized in participating health care systems representing 315 hospitals across 6 countries. We compared hospitalization rates, severe COVID-19 risk, and mean laboratory values between patients hospitalized during the first and second waves of the pandemic. METHODS: Using a federated approach, each participating health care system extracted patient-level clinical data on their first and second wave cohorts and submitted aggregated data to the central site. Data quality control steps were adopted at the central site to correct for implausible values and harmonize units. Statistical analyses were performed by computing individual health care system effect sizes and synthesizing these using random effect meta-analyses to account for heterogeneity. We focused the laboratory analysis on C-reactive protein (CRP), ferritin, fibrinogen, procalcitonin, D-dimer, and creatinine based on their reported associations with severe COVID-19. RESULTS: Data were available for 79,613 patients, of which 32,467 were hospitalized in the first wave and 47,146 in the second wave. The prevalence of male patients and patients aged 50 to 69 years decreased significantly between the first and second waves. Patients hospitalized in the second wave had a 9.9% reduction in the risk of severe COVID-19 compared to patients hospitalized in the first wave (95% CI 8.5%-11.3%). Demographic subgroup analyses indicated that patients aged 26 to 49 years and 50 to 69 years; male and female patients; and black patients had significantly lower risk for severe disease in the second wave than in the first wave. At admission, the mean values of CRP were significantly lower in the second wave than in the first wave. On the seventh hospital day, the mean values of CRP, ferritin, fibrinogen, and procalcitonin were significantly lower in the second wave than in the first wave. In general, countries exhibited variable changes in laboratory testing rates from the first to the second wave. At admission, there was a significantly higher testing rate for D-dimer in France, Germany, and Spain. CONCLUSIONS: Patients hospitalized in the second wave were at significantly lower risk for severe COVID-19. This corresponded to mean laboratory values in the second wave that were more likely to be in typical physiological ranges on the seventh hospital day compared to the first wave. Our federated approach demonstrated the feasibility and power of harmonizing heterogeneous EHR data from multiple international health care systems to rapidly conduct large-scale studies to characterize how COVID-19 clinical trajectories evolve.


Assuntos
COVID-19 , Pandemias , Adulto , Idoso , Feminino , Hospitalização , Hospitais , Humanos , Masculino , Pessoa de Meia-Idade , Estudos Retrospectivos , SARS-CoV-2
5.
J Manipulative Physiol Ther ; 39(4): 279-87, 2016 05.
Artigo em Inglês | MEDLINE | ID: mdl-27072513

RESUMO

OBJECTIVE: The purpose of this study was to determine electromyographic threshold parameters that most reliably characterize the muscular response to spinal manipulation and compare 2 methods that detect muscle activity onset delay: the double-threshold method and cross-correlation method. METHODS: Surface and indwelling electromyography were recorded during lumbar side-lying manipulations in 17 asymptomatic participants. Muscle activity onset delays in relation to the thrusting force were compared across methods and muscles using a generalized linear model. RESULTS: The threshold combinations that resulted in the lowest Detection Failures were the "8 SD-0 milliseconds" threshold (Detection Failures = 8) and the "8 SD-10 milliseconds" threshold (Detection Failures = 9). The average muscle activity onset delay for the double-threshold method across all participants was 149 ± 152 milliseconds for the multifidus and 252 ± 204 milliseconds for the erector spinae. The average onset delay for the cross-correlation method was 26 ± 101 for the multifidus and 67 ± 116 for the erector spinae. There were no statistical interactions, and a main effect of method demonstrated that the delays were higher when using the double-threshold method compared with cross-correlation. CONCLUSIONS: The threshold parameters that best characterized activity onset delays were an 8-SD amplitude and a 10-millisecond duration threshold. The double-threshold method correlated well with visual supervision of muscle activity. The cross-correlation method provides several advantages in signal processing; however, supervision was required for some results, negating this advantage. These results help standardize methods when recording neuromuscular responses of spinal manipulation and improve comparisons within and across investigations.


Assuntos
Músculos do Dorso/fisiologia , Eletromiografia/métodos , Manipulação da Coluna , Contração Muscular/fisiologia , Reflexo/fisiologia , Adolescente , Adulto , Fenômenos Biomecânicos , Feminino , Humanos , Vértebras Lombares/fisiologia , Masculino , Pessoa de Meia-Idade , Fatores de Tempo , Adulto Jovem
6.
Res Sq ; 2024 Jun 05.
Artigo em Inglês | MEDLINE | ID: mdl-38883709

RESUMO

Accurate identification of acute coronary syndrome (ACS) in the prehospital sestting is important for timely treatments that reduce damage to the compromised myocardium. Current machine learning approaches lack sufficient performance to safely rule-in or rule-out ACS. Our goal is to identify a method that bridges this gap. To do so, we retrospectively evaluate two promising approaches, an ensemble of gradient boosted decision trees (GBDT) and selective classification (SC) on consecutive patients transported by ambulance to the ED with chest pain and/or anginal equivalents. On the task of ACS classification with 23 prehospital covariates, we found the fusion of the two (GBDT+SC) improves the best reported sensitivity and specificity by 8% and 23% respectively. Accordingly, GBDT+SC is safer than current machine learning approaches to rule-in and rule-out of ACS in the prehospital setting.

7.
Ann Am Thorac Soc ; 2024 Nov 05.
Artigo em Inglês | MEDLINE | ID: mdl-39499775

RESUMO

RATIONALE: Epidemiologic studies on asthmatics and in vitro data suggest a protective role of T2 inflammation in SARS-CoV-2 infection. OBJECTIVE: Using a large, multisite cohort, we studied clinical outcomes following SARS-CoV-2 infection in multiple asthma endotypes and examined the effects of T2-directed biologics in infected asthmatics.in Methods: The National COVID Cohort Collaborative (N3C) Data Enclave was used to identify and stratify asthmatic patients by endotype to include non-T2 and T2 asthmatics, as well as exposure to T2-directed biologic therapy. We evaluated the risk of hospitalization, invasive mechanical ventilation, and 90-day mortality by endotype and exposure to biologics. RESULTS: For this study, 402,376 patients met inclusion criteria, of which 138,142 (34%) were characterized as non-T2 and 264,234 (66%) as T2 asthmatics, a group further divided into 104,823 (26%) atopic, 84,440 (21%) eosinophilic, and 74,971 (19%) T2-high asthmatic endotypes. Compared to non-T2 asthmatics, atopic and T2-high asthmatics experienced decreased odds of hospitalization, and 90-day mortality. Conversely, eosinophilic asthmatics experienced higher odds of hospitalization, intubation, and 90-day mortality. Exposure to T2-directed biologic therapies did not alter outcomes after propensity score matching. In contrast, maximum eosinophil count and recent systemic corticosteroid use were directly correlated with increased odds of all outcomes. CONCLUSIONS: COVID-19 outcomes differ depending on asthma endotype, with atopic asthmatics experiencing lower odds and eosinophilic asthmatics experiencing higher odds of deleterious outcomes. T2-directed biologic treatment did not alter these outcomes but recent systemic corticosteroid use predisposes all asthmatics patients to adverse outcomes. This article is open access and distributed under the terms of the Creative Commons Attribution Non-Commercial No Derivatives License 4.0 (http://creativecommons.org/licenses/by-nc-nd/4.0/).

8.
Front Public Health ; 12: 1347862, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38737862

RESUMO

The COVID-19 pandemic has necessitated the development of robust tools for tracking and modeling the spread of the virus. We present 'K-Track-Covid,' an interactive web-based dashboard developed using the R Shiny framework, to offer users an intuitive dashboard for analyzing the geographical and temporal spread of COVID-19 in South Korea. Our dashboard employs dynamic user interface elements, employs validated epidemiological models, and integrates regional data to offer tailored visual displays. The dashboard allows users to customize their data views by selecting specific time frames, geographic regions, and demographic groups. This customization enables the generation of charts and statistical summaries pertinent to both daily fluctuations and cumulative counts of COVID-19 cases, as well as mortality statistics. Additionally, the dashboard offers a simulation model based on mathematical models, enabling users to make predictions under various parameter settings. The dashboard is designed to assist researchers, policymakers, and the public in understanding the spread and impact of COVID-19, thereby facilitating informed decision-making. All data and resources related to this study are publicly available to ensure transparency and facilitate further research.


Assuntos
COVID-19 , Internet , Humanos , República da Coreia/epidemiologia , COVID-19/epidemiologia , SARS-CoV-2 , Interface Usuário-Computador , Pandemias , Modelos Epidemiológicos
9.
BMC Musculoskelet Disord ; 14: 322, 2013 Nov 14.
Artigo em Inglês | MEDLINE | ID: mdl-24228747

RESUMO

BACKGROUND: Because of symptoms, people with lumbar spinal stenosis (LSS) are often inactive, and this sedentary behaviour implies risk for diseases including obesity. Research has identified body mass index as the most powerful predictor of function in LSS. This suggests that function may be improved by targeting weight as a modifiable factor. An e-health lifestyle intervention was developed aimed at reducing fat mass and increasing physical activity in people with LSS. The main components of this intervention include pedometer-based physical activity promotion and nutrition education. METHODS/DESIGN: The Spinal Stenosis Pedometer and Nutrition Lifestyle INTERVENTION (SSPANLI) was developed and piloted with 10 individuals. The protocol for a randomized controlled trail comparing the SSPANLI intervention to usual non-surgical care follows. One hundred six (106) overweight or obese individuals with LSS will be recruited. Baseline and follow-up testing includes dual energy x-ray absorptiometry, blood draw, 3-day food record, 7-day accelerometry, questionnaire, maximal oxygen consumption, neurological exam, balance testing and a Self-Paced Walking Test. INTERVENTION: During Week 1, the intervention group will receive a pedometer, and a personalized consultation with both a Dietitian and an exercise specialist. For 12 weeks participants will log on to the e-health website to access personal step goals, walking maps, nutrition videos, and motivational quotes. Participants will also have access to in-person Coffee Talk meetings every 3 weeks, and meet with the Dietitian and exercise specialist at week 6. The control group will proceed with usual care for the 12-week period. Follow-up testing will occur at Weeks 13 and 24. DISCUSSION: This lifestyle intervention has the potential to provide a unique, non-surgical management option for people with LSS. Through decreased fat mass and increased function, we may reduce risk for obesity, chronic diseases of inactivity, and pain. The use of e-health interventions provides an opportunity for patients to become more involved in managing their own health. Behaviour changes including increased physical activity, and improved dietary habits promote overall health and quality of life, and may decrease future health care needs in this population. TRIAL REGISTRATION: Clinicaltrials.gov, NCT01902979.


Assuntos
Actigrafia/métodos , Promoção da Saúde/métodos , Avaliação Nutricional , Comportamento de Redução do Risco , Estenose Espinal/terapia , Registros de Dieta , Comportamento Alimentar/fisiologia , Feminino , Seguimentos , Humanos , Masculino , Pessoa de Meia-Idade , Atividade Motora/fisiologia , Projetos Piloto , Método Simples-Cego , Estenose Espinal/diagnóstico , Estenose Espinal/epidemiologia
10.
Health Informatics J ; 29(2): 14604582231170892, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37066514

RESUMO

The Integrated Clinical and Environmental Exposures Service (ICEES) provides open regulatory-compliant access to clinical data, including electronic health record data, that have been integrated with environmental exposures data. While ICEES has been validated in the context of an asthma use case and several other use cases, the regulatory constraints on the ICEES open application programming interface (OpenAPI) result in data loss when using the service for multivariate analysis. In this study, we investigated the robustness of the ICEES OpenAPI through a comparative analysis, in which we applied a generalized linear model (GLM) to the OpenAPI data and the constraint-free source data to examine factors predictive of asthma exacerbations. Consistent with previous studies, we found that the main predictors identified by both analyses were sex, prednisone, race, obesity, and airborne particulate exposure. Comparison of GLM model fit revealed that data loss impacts model quality, but only with select interaction terms. We conclude that the ICEES OpenAPI supports multivariate analysis, albeit with potential data loss that users should be aware of.


Assuntos
Asma , Registros Eletrônicos de Saúde , Humanos , Modelos Lineares , Exposição Ambiental , Software , Asma/epidemiologia
11.
J Am Med Inform Assoc ; 30(3): 447-455, 2023 02 16.
Artigo em Inglês | MEDLINE | ID: mdl-36451264

RESUMO

OBJECTIVE: This article describes the implementation of a privacy-preserving record linkage (PPRL) solution across PCORnet®, the National Patient-Centered Clinical Research Network. MATERIAL AND METHODS: Using a PPRL solution from Datavant, we quantified the degree of patient overlap across the network and report a de-duplicated analysis of the demographic and clinical characteristics of the PCORnet population. RESULTS: There were ∼170M patient records across the responding Network Partners, with ∼138M (81%) of those corresponding to a unique patient. 82.1% of patients were found in a single partner and 14.7% were in 2. The percentage overlap between Partners ranged between 0% and 80% with a median of 0%. Linking patients' electronic health records with claims increased disease prevalence in every clinical characteristic, ranging between 63% and 173%. DISCUSSION: The overlap between Partners was variable and depended on timeframe. However, patient data linkage changed the prevalence profile of the PCORnet patient population. CONCLUSIONS: This project was one of the largest linkage efforts of its kind and demonstrates the potential value of record linkage. Linkage between Partners may be most useful in cases where there is geographic proximity between Partners, an expectation that potential linkage Partners will be able to fill gaps in data, or a longer study timeframe.


Assuntos
Confidencialidade , Privacidade , Humanos , Registro Médico Coordenado , Segurança Computacional , Registros Eletrônicos de Saúde , Assistência Centrada no Paciente , Demografia
12.
J Am Med Inform Assoc ; 30(7): 1293-1300, 2023 06 20.
Artigo em Inglês | MEDLINE | ID: mdl-37192819

RESUMO

Research increasingly relies on interrogating large-scale data resources. The NIH National Heart, Lung, and Blood Institute developed the NHLBI BioData CatalystⓇ (BDC), a community-driven ecosystem where researchers, including bench and clinical scientists, statisticians, and algorithm developers, find, access, share, store, and compute on large-scale datasets. This ecosystem provides secure, cloud-based workspaces, user authentication and authorization, search, tools and workflows, applications, and new innovative features to address community needs, including exploratory data analysis, genomic and imaging tools, tools for reproducibility, and improved interoperability with other NIH data science platforms. BDC offers straightforward access to large-scale datasets and computational resources that support precision medicine for heart, lung, blood, and sleep conditions, leveraging separately developed and managed platforms to maximize flexibility based on researcher needs, expertise, and backgrounds. Through the NHLBI BioData Catalyst Fellows Program, BDC facilitates scientific discoveries and technological advances. BDC also facilitated accelerated research on the coronavirus disease-2019 (COVID-19) pandemic.


Assuntos
COVID-19 , Computação em Nuvem , Humanos , Ecossistema , Reprodutibilidade dos Testes , Pulmão , Software
13.
IEEE/ACM Trans Comput Biol Bioinform ; 19(4): 1920-1932, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-34133284

RESUMO

Image-based cell counting is a fundamental yet challenging task with wide applications in biological research. In this paper, we propose a novel unified deep network framework designed to solve this problem for various cell types in both 2D and 3D images. Specifically, we first propose SAU-Net for cell counting by extending the segmentation network U-Net with a Self-Attention module. Second, we design an extension of Batch Normalization (BN) to facilitate the training process for small datasets. In addition, a new 3D benchmark dataset based on the existing mouse blastocyst (MBC) dataset is developed and released to the community. Our SAU-Net achieves state-of-the-art results on four benchmark 2D datasets - synthetic fluorescence microscopy (VGG) dataset, Modified Bone Marrow (MBM) dataset, human subcutaneous adipose tissue (ADI) dataset, and Dublin Cell Counting (DCC) dataset, and the new 3D dataset, MBC. The BN extension is validated using extensive experiments on the 2D datasets, since GPU memory constraints preclude use of 3D datasets. The source code is available at https://github.com/mzlr/sau-net.


Assuntos
Imageamento Tridimensional , Microscopia , Animais , Atenção , Humanos , Processamento de Imagem Assistida por Computador/métodos , Camundongos
14.
Front Artif Intell ; 5: 918888, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35837616

RESUMO

Research on rare diseases has received increasing attention, in part due to the realized profitability of orphan drugs. Biomedical informatics holds promise in accelerating translational research on rare disease, yet challenges remain, including the lack of diagnostic codes for rare diseases and privacy concerns that prevent research access to electronic health records when few patients exist. The Integrated Clinical and Environmental Exposures Service (ICEES) provides regulatory-compliant open access to electronic health record data that have been integrated with environmental exposures data, as well as analytic tools to explore the integrated data. We describe a proof-of-concept application of ICEES to examine demographics, clinical characteristics, environmental exposures, and health outcomes among a cohort of patients enriched for phenotypes associated with cystic fibrosis (CF), idiopathic bronchiectasis (IB), and primary ciliary dyskinesia (PCD). We then focus on a subset of patients with CF, leveraging the availability of a diagnostic code for CF and serving as a benchmark for our development work. We use ICEES to examine select demographics, co-diagnoses, and environmental exposures that may contribute to poor health outcomes among patients with CF, defined as emergency department or inpatient visits for respiratory issues. We replicate current understanding of the pathogenesis and clinical manifestations of CF by identifying co-diagnoses of asthma, chronic nasal congestion, cough, middle ear disease, and pneumonia as factors that differentiate patients with poor health outcomes from those with better health outcomes. We conclude by discussing our preliminary findings in relation to other published work, the strengths and limitations of our approach, and our future directions.

15.
IEEE J Biomed Health Inform ; 26(2): 572-580, 2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-34288883

RESUMO

This paper proposes a novel deep learning architecture involving combinations of Convolutional Neural Networks (CNN) layers and Recurrent neural networks (RNN) layers that can be used to perform segmentation and classification of 5 cardiac rhythms based on ECG recordings. The algorithm is developed in a sequence to sequence setting where the input is a sequence of five second ECG signal sliding windows and the output is a sequence of cardiac rhythm labels. The novel architecture processes as input both the spectrograms of the ECG signal as well as the heartbeats' signal waveform. Additionally, we are able to train the model in the presence of label noise. The model's performance and generalizability is verified on an external database different from the one we used to train. Experimental result shows this approach can achieve an average F1 scores of 0.89 (averaged across 5 classes). The proposed model also achieves comparable classification performance to existing state-of-the-art approach with considerably less number of training parameters.


Assuntos
Arritmias Cardíacas , Eletrocardiografia , Algoritmos , Arritmias Cardíacas/diagnóstico por imagem , Frequência Cardíaca , Humanos , Redes Neurais de Computação
16.
BMC Res Notes ; 15(1): 337, 2022 Oct 31.
Artigo em Inglês | MEDLINE | ID: mdl-36316778

RESUMO

OBJECTIVE: The aim of this study was to determine whether a secure, privacy-preserving record linkage (PPRL) methodology can be implemented in a scalable manner for use in a large national clinical research network. RESULTS: We established the governance and technical capacity to support the use of PPRL across the National Patient-Centered Clinical Research Network (PCORnet®). As a pilot, four sites used the Datavant software to transform patient personally identifiable information (PII) into de-identified tokens. We queried the sites for patients with a clinical encounter in 2018 or 2019 and matched their tokens to determine whether overlap existed. We described patient overlap among the sites and generated a "deduplicated" table of patient demographic characteristics. Overlapping patients were found in 3 of the 6 site-pairs. Following deduplication, the total patient count was 3,108,515 (0.11% reduction), with the largest reduction in count for patients with an "Other/Missing" value for Sex; from 198 to 163 (17.6% reduction). The PPRL solution successfully links patients across data sources using distributed queries without directly accessing patient PII. The overlap queries and analysis performed in this pilot is being replicated across the full network to provide additional insight into patient linkages among a distributed research network.


Assuntos
Registros Eletrônicos de Saúde , Privacidade , Humanos , Registro Médico Coordenado/métodos , Bases de Dados Factuais , Assistência Centrada no Paciente
17.
JAMIA Open ; 4(3): ooaa069, 2021 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-34514351

RESUMO

OBJECTIVES: Social determinants of health (SDH), key contributors to health, are rarely systematically measured and collected in the electronic health record (EHR). We investigate how to leverage clinical notes using novel applications of multi-label learning (MLL) to classify SDH in mental health and substance use disorder patients who frequent the emergency department. METHODS AND MATERIALS: We labeled a gold-standard corpus of EHR clinical note sentences (N = 4063) with 6 identified SDH-related domains recommended by the Institute of Medicine for inclusion in the EHR. We then trained 5 classification models: linear-Support Vector Machine, K-Nearest Neighbors, Random Forest, XGBoost, and bidirectional Long Short-Term Memory (BI-LSTM). We adopted 5 common evaluation measures: accuracy, average precision-recall (AP), area under the curve receiver operating characteristic (AUC-ROC), Hamming loss, and log loss to compare the performance of different methods for MLL classification using the F1 score as the primary evaluation metric. RESULTS: Our results suggested that, overall, BI-LSTM outperformed the other classification models in terms of AUC-ROC (93.9), AP (0.76), and Hamming loss (0.12). The AUC-ROC values of MLL models of SDH related domains varied between (0.59-1.0). We found that 44.6% of our study population (N = 1119) had at least one positive documentation of SDH. DISCUSSION AND CONCLUSION: The proposed approach of training an MLL model on an SDH rich data source can produce a high performing classifier using only unstructured clinical notes. We also provide evidence that model performance is associated with lexical diversity by health professionals and the auto-generation of clinical note sentences to document SDH.

18.
Artigo em Inglês | MEDLINE | ID: mdl-35875189

RESUMO

The Integrated Clinical and Environmental Exposures Service (ICEES) provides regulatory-compliant open access to sensitive patient data that have been integrated with public exposures data. ICEES was designed initially to support dynamic cohort creation and bivariate contingency tests. The objective of the present study was to develop an open approach to support multivariate analyses using existing ICEES functionalities and abiding by all regulatory constraints. We first developed an open approach for generating a multivariate table that maintains contingencies between clinical and environmental variables using programmatic calls to the open ICEES application programming interface. We then applied the approach to data on a large cohort (N = 22,365) of patients with asthma or related conditions and generated an eight-feature table. Due to regulatory constraints, data loss was incurred with the incorporation of each successive feature variable, from a starting sample size of N = 22,365 to a final sample size of N = 4,556 (20.4%), but data loss was < 10% until the addition of the final two feature variables. We then applied a generalized linear model to the subsequent dataset and focused on the impact of seven select feature variables on asthma exacerbations, defined as annual emergency department or inpatient visits for respiratory issues. We identified five feature variables-sex, race, obesity, prednisone, and airborne particulate exposure-as significant predictors of asthma exacerbations. We discuss the advantages and disadvantages of ICEES open multivariate analysis and conclude that, despite limitations, ICEES can provide a valuable resource for open multivariate analysis and can serve as an exemplar for regulatory-compliant informatic solutions to open patient data, with capabilities to explore the impact of environmental exposures on health outcomes.

19.
Artigo em Inglês | MEDLINE | ID: mdl-34769911

RESUMO

ICEES (Integrated Clinical and Environmental Exposures Service) provides a disease-agnostic, regulatory-compliant approach for openly exposing and analyzing clinical data that have been integrated at the patient level with environmental exposures data. ICEES is equipped with basic features to support exploratory analysis using statistical approaches, such as bivariate chi-square tests. We recently developed a method for using ICEES to generate multivariate tables for subsequent application of machine learning and statistical models. The objective of the present study was to use this approach to identify predictors of asthma exacerbations through the application of three multivariate methods: conditional random forest, conditional tree, and generalized linear model. Among seven potential predictor variables, we found five to be of significant importance using both conditional random forest and conditional tree: prednisone, race, airborne particulate exposure, obesity, and sex. The conditional tree method additionally identified several significant two-way and three-way interactions among the same variables. When we applied a generalized linear model, we identified four significant predictor variables, namely prednisone, race, airborne particulate exposure, and obesity. When ranked in order by effect size, the results were in agreement with the results from the conditional random forest and conditional tree methods as well as the published literature. Our results suggest that the open multivariate analytic capabilities provided by ICEES are valid in the context of an asthma use case and likely will have broad value in advancing open research in environmental and public health.


Assuntos
Asma , Exposição Ambiental , Asma/epidemiologia , Asma/etiologia , Humanos , Aprendizado de Máquina , Modelos Estatísticos
20.
JMIR Public Health Surveill ; 7(9): e29310, 2021 09 02.
Artigo em Inglês | MEDLINE | ID: mdl-34298500

RESUMO

BACKGROUND: As the world faced the pandemic caused by the novel coronavirus disease 2019 (COVID-19), medical professionals, technologists, community leaders, and policy makers sought to understand how best to leverage data for public health surveillance and community education. With this complex public health problem, North Carolinians relied on data from state, federal, and global health organizations to increase their understanding of the pandemic and guide decision-making. OBJECTIVE: We aimed to describe the role that stakeholders involved in COVID-19-related data played in managing the pandemic in North Carolina. The study investigated the processes used by organizations throughout the state in using, collecting, and reporting COVID-19 data. METHODS: We used an exploratory qualitative study design to investigate North Carolina's COVID-19 data collection efforts. To better understand these processes, key informant interviews were conducted with employees from organizations that collected COVID-19 data across the state. We developed an interview guide, and open-ended semistructured interviews were conducted during the period from June through November 2020. Interviews lasted between 30 and 45 minutes and were conducted by data scientists by videoconference. Data were subsequently analyzed using qualitative data analysis software. RESULTS: Results indicated that electronic health records were primary sources of COVID-19 data. Often, data were also used to create dashboards to inform the public or other health professionals, to aid in decision-making, or for reporting purposes. Cross-sector collaboration was cited as a major success. Consistency among metrics and data definitions, data collection processes, and contact tracing were cited as challenges. CONCLUSIONS: Findings suggest that, during future outbreaks, organizations across regions could benefit from data centralization and data governance. Data should be publicly accessible and in a user-friendly format. Additionally, established cross-sector collaboration networks are demonstrably beneficial for public health professionals across the state as these established relationships facilitate a rapid response to evolving public health challenges.


Assuntos
COVID-19/epidemiologia , Análise de Dados , Coleta de Dados , Pandemias/prevenção & controle , Participação dos Interessados/psicologia , Feminino , Educação em Saúde , Humanos , Masculino , North Carolina/epidemiologia , Vigilância em Saúde Pública , Pesquisa Qualitativa
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA