Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 63
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
País de afiliação
Intervalo de ano de publicação
1.
Clin Infect Dis ; 77(6): 816-826, 2023 09 18.
Artigo em Inglês | MEDLINE | ID: mdl-37207367

RESUMO

BACKGROUND: Identifying individuals with a higher risk of developing severe coronavirus disease 2019 (COVID-19) outcomes will inform targeted and more intensive clinical monitoring and management. To date, there is mixed evidence regarding the impact of preexisting autoimmune disease (AID) diagnosis and/or immunosuppressant (IS) exposure on developing severe COVID-19 outcomes. METHODS: A retrospective cohort of adults diagnosed with COVID-19 was created in the National COVID Cohort Collaborative enclave. Two outcomes, life-threatening disease and hospitalization, were evaluated by using logistic regression models with and without adjustment for demographics and comorbidities. RESULTS: Of the 2 453 799 adults diagnosed with COVID-19, 191 520 (7.81%) had a preexisting AID diagnosis and 278 095 (11.33%) had a preexisting IS exposure. Logistic regression models adjusted for demographics and comorbidities demonstrated that individuals with a preexisting AID (odds ratio [OR], 1.13; 95% confidence interval [CI]: 1.09-1.17; P < .001), IS exposure (OR, 1.27; 95% CI: 1.24-1.30; P < .001), or both (OR, 1.35; 95% CI: 1.29-1.40; P < .001) were more likely to have a life-threatening disease. These results were consistent when hospitalization was evaluated. A sensitivity analysis evaluating specific IS revealed that tumor necrosis factor inhibitors were protective against life-threatening disease (OR, 0.80; 95% CI: .66-.96; P = .017) and hospitalization (OR, 0.80; 95% CI: .73-.89; P < .001). CONCLUSIONS: Patients with preexisting AID, IS exposure, or both are more likely to have a life-threatening disease or hospitalization. These patients may thus require tailored monitoring and preventative measures to minimize negative consequences of COVID-19.


Assuntos
Autoimunidade , COVID-19 , Adulto , Humanos , COVID-19/epidemiologia , Estudos Retrospectivos , Hospitalização , Imunossupressores/uso terapêutico
2.
BMC Med ; 21(1): 58, 2023 02 16.
Artigo em Inglês | MEDLINE | ID: mdl-36793086

RESUMO

BACKGROUND: Naming a newly discovered disease is a difficult process; in the context of the COVID-19 pandemic and the existence of post-acute sequelae of SARS-CoV-2 infection (PASC), which includes long COVID, it has proven especially challenging. Disease definitions and assignment of a diagnosis code are often asynchronous and iterative. The clinical definition and our understanding of the underlying mechanisms of long COVID are still in flux, and the deployment of an ICD-10-CM code for long COVID in the USA took nearly 2 years after patients had begun to describe their condition. Here, we leverage the largest publicly available HIPAA-limited dataset about patients with COVID-19 in the US to examine the heterogeneity of adoption and use of U09.9, the ICD-10-CM code for "Post COVID-19 condition, unspecified." METHODS: We undertook a number of analyses to characterize the N3C population with a U09.9 diagnosis code (n = 33,782), including assessing person-level demographics and a number of area-level social determinants of health; diagnoses commonly co-occurring with U09.9, clustered using the Louvain algorithm; and quantifying medications and procedures recorded within 60 days of U09.9 diagnosis. We stratified all analyses by age group in order to discern differing patterns of care across the lifespan. RESULTS: We established the diagnoses most commonly co-occurring with U09.9 and algorithmically clustered them into four major categories: cardiopulmonary, neurological, gastrointestinal, and comorbid conditions. Importantly, we discovered that the population of patients diagnosed with U09.9 is demographically skewed toward female, White, non-Hispanic individuals, as well as individuals living in areas with low poverty and low unemployment. Our results also include a characterization of common procedures and medications associated with U09.9-coded patients. CONCLUSIONS: This work offers insight into potential subtypes and current practice patterns around long COVID and speaks to the existence of disparities in the diagnosis of patients with long COVID. This latter finding in particular requires further research and urgent remediation.


Assuntos
COVID-19 , Síndrome de COVID-19 Pós-Aguda , Humanos , Feminino , Classificação Internacional de Doenças , Pandemias , COVID-19/diagnóstico , COVID-19/epidemiologia , SARS-CoV-2
3.
BMC Public Health ; 23(1): 2103, 2023 10 25.
Artigo em Inglês | MEDLINE | ID: mdl-37880596

RESUMO

BACKGROUND: More than one-third of individuals experience post-acute sequelae of SARS-CoV-2 infection (PASC, which includes long-COVID). The objective is to identify risk factors associated with PASC/long-COVID diagnosis. METHODS: This was a retrospective case-control study including 31 health systems in the United States from the National COVID Cohort Collaborative (N3C). 8,325 individuals with PASC (defined by the presence of the International Classification of Diseases, version 10 code U09.9 or a long-COVID clinic visit) matched to 41,625 controls within the same health system and COVID index date within ± 45 days of the corresponding case's earliest COVID index date. Measurements of risk factors included demographics, comorbidities, treatment and acute characteristics related to COVID-19. Multivariable logistic regression, random forest, and XGBoost were used to determine the associations between risk factors and PASC. RESULTS: Among 8,325 individuals with PASC, the majority were > 50 years of age (56.6%), female (62.8%), and non-Hispanic White (68.6%). In logistic regression, middle-age categories (40 to 69 years; OR ranging from 2.32 to 2.58), female sex (OR 1.4, 95% CI 1.33-1.48), hospitalization associated with COVID-19 (OR 3.8, 95% CI 3.05-4.73), long (8-30 days, OR 1.69, 95% CI 1.31-2.17) or extended hospital stay (30 + days, OR 3.38, 95% CI 2.45-4.67), receipt of mechanical ventilation (OR 1.44, 95% CI 1.18-1.74), and several comorbidities including depression (OR 1.50, 95% CI 1.40-1.60), chronic lung disease (OR 1.63, 95% CI 1.53-1.74), and obesity (OR 1.23, 95% CI 1.16-1.3) were associated with increased likelihood of PASC diagnosis or care at a long-COVID clinic. Characteristics associated with a lower likelihood of PASC diagnosis or care at a long-COVID clinic included younger age (18 to 29 years), male sex, non-Hispanic Black race, and comorbidities such as substance abuse, cardiomyopathy, psychosis, and dementia. More doctors per capita in the county of residence was associated with an increased likelihood of PASC diagnosis or care at a long-COVID clinic. Our findings were consistent in sensitivity analyses using a variety of analytic techniques and approaches to select controls. CONCLUSIONS: This national study identified important risk factors for PASC diagnosis such as middle age, severe COVID-19 disease, and specific comorbidities. Further clinical and epidemiological research is needed to better understand underlying mechanisms and the potential role of vaccines and therapeutics in altering PASC course.


Assuntos
COVID-19 , SARS-CoV-2 , Pessoa de Meia-Idade , Feminino , Masculino , Humanos , Adulto , Idoso , Adolescente , Adulto Jovem , COVID-19/epidemiologia , Síndrome de COVID-19 Pós-Aguda , Estudos de Casos e Controles , Estudos Retrospectivos , Fatores de Risco , Progressão da Doença
4.
Virol J ; 19(1): 84, 2022 05 15.
Artigo em Inglês | MEDLINE | ID: mdl-35570298

RESUMO

BACKGROUND: Non-steroidal anti-inflammatory drugs (NSAIDs) are commonly used to reduce pain, fever, and inflammation but have been associated with complications in community-acquired pneumonia. Observations shortly after the start of the COVID-19 pandemic in 2020 suggested that ibuprofen was associated with an increased risk of adverse events in COVID-19 patients, but subsequent observational studies failed to demonstrate increased risk and in one case showed reduced risk associated with NSAID use. METHODS: A 38-center retrospective cohort study was performed that leveraged the harmonized, high-granularity electronic health record data of the National COVID Cohort Collaborative. A propensity-matched cohort of 19,746 COVID-19 inpatients was constructed by matching cases (treated with NSAIDs at the time of admission) and 19,746 controls (not treated) from 857,061 patients with COVID-19 available for analysis. The primary outcome of interest was COVID-19 severity in hospitalized patients, which was classified as: moderate, severe, or mortality/hospice. Secondary outcomes were acute kidney injury (AKI), extracorporeal membrane oxygenation (ECMO), invasive ventilation, and all-cause mortality at any time following COVID-19 diagnosis. RESULTS: Logistic regression showed that NSAID use was not associated with increased COVID-19 severity (OR: 0.57 95% CI: 0.53-0.61). Analysis of secondary outcomes using logistic regression showed that NSAID use was not associated with increased risk of all-cause mortality (OR 0.51 95% CI: 0.47-0.56), invasive ventilation (OR: 0.59 95% CI: 0.55-0.64), AKI (OR: 0.67 95% CI: 0.63-0.72), or ECMO (OR: 0.51 95% CI: 0.36-0.7). In contrast, the odds ratios indicate reduced risk of these outcomes, but our quantitative bias analysis showed E-values of between 1.9 and 3.3 for these associations, indicating that comparatively weak or moderate confounder associations could explain away the observed associations. CONCLUSIONS: Study interpretation is limited by the observational design. Recording of NSAID use may have been incomplete. Our study demonstrates that NSAID use is not associated with increased COVID-19 severity, all-cause mortality, invasive ventilation, AKI, or ECMO in COVID-19 inpatients. A conservative interpretation in light of the quantitative bias analysis is that there is no evidence that NSAID use is associated with risk of increased severity or the other measured outcomes. Our results confirm and extend analogous findings in previous observational studies using a large cohort of patients drawn from 38 centers in a nationally representative multicenter database.


Assuntos
Injúria Renal Aguda , COVID-19 , Anti-Inflamatórios não Esteroides/efeitos adversos , Teste para COVID-19 , Estudos de Coortes , Humanos , Pandemias , Estudos Retrospectivos
5.
J Biomed Inform ; 127: 104002, 2022 03.
Artigo em Inglês | MEDLINE | ID: mdl-35077901

RESUMO

OBJECTIVE: The large-scale collection of observational data and digital technologies could help curb the COVID-19 pandemic. However, the coexistence of multiple Common Data Models (CDMs) and the lack of data extract, transform, and load (ETL) tool between different CDMs causes potential interoperability issue between different data systems. The objective of this study is to design, develop, and evaluate an ETL tool that transforms the PCORnet CDM format data into the OMOP CDM. METHODS: We developed an open-source ETL tool to facilitate the data conversion from the PCORnet CDM and the OMOP CDM. The ETL tool was evaluated using a dataset with 1000 patients randomly selected from the PCORnet CDM at Mayo Clinic. Information loss, data mapping accuracy, and gap analysis approaches were conducted to assess the performance of the ETL tool. We designed an experiment to conduct a real-world COVID-19 surveillance task to assess the feasibility of the ETL tool. We also assessed the capacity of the ETL tool for the COVID-19 data surveillance using data collection criteria of the MN EHR Consortium COVID-19 project. RESULTS: After the ETL process, all the records of 1000 patients from 18 PCORnet CDM tables were successfully transformed into 12 OMOP CDM tables. The information loss for all the concept mapping was less than 0.61%. The string mapping process for the unit concepts lost 2.84% records. Almost all the fields in the manual mapping process achieved 0% information loss, except the specialty concept mapping. Moreover, the mapping accuracy for all the fields were 100%. The COVID-19 surveillance task collected almost the same set of cases (99.3% overlaps) from the original PCORnet CDM and target OMOP CDM separately. Finally, all the data elements for MN EHR Consortium COVID-19 project could be captured from both the PCORnet CDM and the OMOP CDM. CONCLUSION: We demonstrated that our ETL tool could satisfy the data conversion requirements between the PCORnet CDM and the OMOP CDM. The outcome of the work would facilitate the data retrieval, communication, sharing, and analysis between different institutions for not only COVID-19 related project, but also other real-world evidence-based observational studies.


Assuntos
COVID-19 , COVID-19/epidemiologia , Bases de Dados Factuais , Registros Eletrônicos de Saúde , Humanos , Armazenamento e Recuperação da Informação , Pandemias , SARS-CoV-2
6.
J Biomed Inform ; 134: 104201, 2022 10.
Artigo em Inglês | MEDLINE | ID: mdl-36089199

RESUMO

BACKGROUND: Knowledge graphs (KGs) play a key role to enable explainable artificial intelligence (AI) applications in healthcare. Constructing clinical knowledge graphs (CKGs) against heterogeneous electronic health records (EHRs) has been desired by the research and healthcare AI communities. From the standardization perspective, community-based standards such as the Fast Healthcare Interoperability Resources (FHIR) and the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) are increasingly used to represent and standardize EHR data for clinical data analytics, however, the potential of such a standard on building CKG has not been well investigated. OBJECTIVE: To develop and evaluate methods and tools that expose the OMOP CDM-based clinical data repositories into virtual clinical KGs that are compliant with FHIR Resource Description Framework (RDF) specification. METHODS: We developed a system called FHIR-Ontop-OMOP to generate virtual clinical KGs from the OMOP relational databases. We leveraged an OMOP CDM-based Medical Information Mart for Intensive Care (MIMIC-III) data repository to evaluate the FHIR-Ontop-OMOP system in terms of the faithfulness of data transformation and the conformance of the generated CKGs to the FHIR RDF specification. RESULTS: A beta version of the system has been released. A total of more than 100 data element mappings from 11 OMOP CDM clinical data, health system and vocabulary tables were implemented in the system, covering 11 FHIR resources. The generated virtual CKG from MIMIC-III contains 46,520 instances of FHIR Patient, 716,595 instances of Condition, 1,063,525 instances of Procedure, 24,934,751 instances of MedicationStatement, 365,181,104 instances of Observations, and 4,779,672 instances of CodeableConcept. Patient counts identified by five pairs of SQL (over the MIMIC database) and SPARQL (over the virtual CKG) queries were identical, ensuring the faithfulness of the data transformation. Generated CKG in RDF triples for 100 patients were fully conformant with the FHIR RDF specification. CONCLUSION: The FHIR-Ontop-OMOP system can expose OMOP database as a FHIR-compliant RDF graph. It provides a meaningful use case demonstrating the potentials that can be enabled by the interoperability between FHIR and OMOP CDM. Generated clinical KGs in FHIR RDF provide a semantic foundation to enable explainable AI applications in healthcare.


Assuntos
Inteligência Artificial , Reconhecimento Automatizado de Padrão , Data Warehousing , Atenção à Saúde , Registros Eletrônicos de Saúde , Humanos
7.
BMC Med Inform Decis Mak ; 20(1): 53, 2020 03 11.
Artigo em Inglês | MEDLINE | ID: mdl-32160884

RESUMO

BACKGROUND: Informatics tools to support the integration and subsequent interrogation of spatiotemporal data such as clinical data and environmental exposures data are lacking. Such tools are needed to support research in environmental health and any biomedical field that is challenged by the need for integrated spatiotemporal data to examine individual-level determinants of health and disease. RESULTS: We have developed an open-source software application-FHIR PIT (Health Level 7 Fast Healthcare Interoperability Resources Patient data Integration Tool)-to enable studies on the impact of individual-level environmental exposures on health and disease. FHIR PIT was motivated by the need to integrate patient data derived from our institution's clinical warehouse with a variety of public data sources on environmental exposures and then openly expose the data via ICEES (Integrated Clinical and Environmental Exposures Service). FHIR PIT consists of transformation steps or building blocks that can be chained together to form a transformation and integration workflow. Several transformation steps are generic and thus can be reused. As such, new types of data can be incorporated into the modular FHIR PIT pipeline by simply reusing generic steps or adding new ones. We validated FHIR PIT in the context of a driving use case designed to investigate the impact of airborne pollutant exposures on asthma. Specifically, we replicated published findings demonstrating racial disparities in the impact of airborne pollutants on asthma exacerbations. CONCLUSIONS: While FHIR PIT was developed to support our driving use case on asthma, the software can be used to integrate any type and number of spatiotemporal data sources at a level of granularity that enables individual-level study. We expect FHIR PIT to facilitate research in environmental health and numerous other biomedical disciplines.


Assuntos
Registros Eletrônicos de Saúde , Exposição Ambiental , Interoperabilidade da Informação em Saúde/normas , Design de Software , Software , Nível Sete de Saúde , Humanos , Análise Espaço-Temporal , Integração de Sistemas , Fluxo de Trabalho
8.
J Biomed Inform ; 100: 103325, 2019 12.
Artigo em Inglês | MEDLINE | ID: mdl-31676459

RESUMO

This special communication describes activities, products, and lessons learned from a recent hackathon that was funded by the National Center for Advancing Translational Sciences via the Biomedical Data Translator program ('Translator'). Specifically, Translator team members self-organized and worked together to conceptualize and execute, over a five-day period, a multi-institutional clinical research study that aimed to examine, using open clinical data sources, relationships between sex, obesity, diabetes, and exposure to airborne fine particulate matter among patients with severe asthma. The goal was to develop a proof of concept that this new model of collaboration and data sharing could effectively produce meaningful scientific results and generate new scientific hypotheses. Three Translator Clinical Knowledge Sources, each of which provides open access (via Application Programming Interfaces) to data derived from the electronic health record systems of major academic institutions, served as the source of study data. Jupyter Python notebooks, shared in GitHub repositories, were used to call the knowledge sources and analyze and integrate the results. The results replicated established or suspected relationships between sex, obesity, diabetes, exposure to airborne fine particulate matter, and severe asthma. In addition, the results demonstrated specific differences across the three Translator Clinical Knowledge Sources, suggesting cohort- and/or environment-specific factors related to the services themselves or the catchment area from which each service derives patient data. Collectively, this special communication demonstrates the power and utility of intense, team-oriented hackathons and offers general technical, organizational, and scientific lessons learned.


Assuntos
Asma/fisiopatologia , Diabetes Mellitus/fisiopatologia , Exposição Ambiental , Armazenamento e Recuperação da Informação , Obesidade/fisiopatologia , Material Particulado/toxicidade , Fatores Sexuais , Asma/complicações , Feminino , Humanos , Masculino , Obesidade/complicações , Índice de Gravidade de Doença
9.
Am J Obstet Gynecol ; 218(6): 610.e1-610.e7, 2018 06.
Artigo em Inglês | MEDLINE | ID: mdl-29432754

RESUMO

BACKGROUND: Women with symptomatic uterine fibroids can report a myriad of symptoms, including pain, bleeding, infertility, and psychosocial sequelae. Optimizing fibroid research requires the ability to enroll populations of women with image-confirmed symptomatic uterine fibroids. OBJECTIVE: Our objective was to develop an electronic health record-based algorithm to identify women with symptomatic uterine fibroids for a comparative effectiveness study of medical or surgical treatments on quality-of-life measures. Using an iterative process and text-mining techniques, an effective computable phenotype algorithm, composed of demographics, and clinical and laboratory characteristics, was developed with reasonable performance. Such algorithms provide a feasible, efficient way to identify populations of women with symptomatic uterine fibroids for the conduct of large traditional or pragmatic trials and observational comparative effectiveness studies. Symptomatic uterine fibroids, due to menorrhagia, pelvic pain, bulk symptoms, or infertility, are a source of substantial morbidity for reproductive-age women. Comparing Treatment Options for Uterine Fibroids is a multisite registry study to compare the effectiveness of hormonal or surgical fibroid treatments on women's perceptions of their quality of life. Electronic health record-based algorithms are able to identify large numbers of women with fibroids, but additional work is needed to develop electronic health record algorithms that can identify women with symptomatic fibroids to optimize fibroid research. We sought to develop an efficient electronic health record-based algorithm that can identify women with symptomatic uterine fibroids in a large health care system for recruitment into large-scale observational and interventional research in fibroid management. STUDY DESIGN: We developed and assessed the accuracy of 3 algorithms to identify patients with symptomatic fibroids using an iterative approach. The data source was the Carolina Data Warehouse for Health, a repository for the health system's electronic health record data. In addition to International Classification of Diseases, Ninth Revision diagnosis and procedure codes and clinical characteristics, text data-mining software was used to derive information from imaging reports to confirm the presence of uterine fibroids. Results of each algorithm were compared with expert manual review to calculate the positive predictive values for each algorithm. RESULTS: Algorithm 1 was composed of the following criteria: (1) age 18-54 years; (2) either ≥1 International Classification of Diseases, Ninth Revision diagnosis codes for uterine fibroids or mention of fibroids using text-mined key words in imaging records or documents; and (3) no International Classification of Diseases, Ninth Revision or Current Procedural Terminology codes for hysterectomy and no reported history of hysterectomy. The positive predictive value was 47% (95% confidence interval 39-56%). Algorithm 2 required ≥2 International Classification of Diseases, Ninth Revision diagnosis codes for fibroids and positive text-mined key words and had a positive predictive value of 65% (95% confidence interval 50-79%). In algorithm 3, further refinements included ≥2 International Classification of Diseases, Ninth Revision diagnosis codes for fibroids on separate outpatient visit dates, the exclusion of women who had a positive pregnancy test within 3 months of their fibroid-related visit, and exclusion of incidentally detected fibroids during prenatal or emergency department visits. Algorithm 3 achieved a positive predictive value of 76% (95% confidence interval 71-81%). CONCLUSION: An electronic health record-based algorithm is capable of identifying cases of symptomatic uterine fibroids with moderate positive predictive value and may be an efficient approach for large-scale study recruitment.


Assuntos
Algoritmos , Registros Eletrônicos de Saúde , Leiomioma/fisiopatologia , Neoplasias Uterinas/fisiopatologia , Adolescente , Adulto , Pesquisa Biomédica , Current Procedural Terminology , Coleta de Dados/métodos , Feminino , Humanos , Infertilidade Feminina/etiologia , Infertilidade Feminina/fisiopatologia , Classificação Internacional de Doenças , Leiomioma/complicações , Menorragia/etiologia , Menorragia/fisiopatologia , Pessoa de Meia-Idade , Dor Pélvica/etiologia , Dor Pélvica/fisiopatologia , Fenótipo , Neoplasias Uterinas/complicações , Adulto Jovem
10.
Am J Hematol ; 90(8): 691-5, 2015 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-25963831

RESUMO

Red blood cell (RBC) alloimmunization is a significant clinical complication of sickle cell disease (SCD). It can lead to difficulty with cross-matching for future transfusions and may sometimes trigger life-threatening delayed hemolytic transfusion reactions. We conducted a retrospective study to explore the association of clinical complications and age of RBC with alloimmunization in patients with SCD followed at a single institution from 2005 to 2012. One hundred and sixty six patients with a total of 488 RBC transfusions were evaluated. Nineteen patients (11%) developed new alloantibodies following blood transfusions during the period of review. The median age of RBC units was 20 days (interquartile range: 14-27 days). RBC antibody formation was significantly associated with the age of RBC units (P = 0.002), with a hazard ratio of 3.5 (95% CI: 1.71-7.11) for a RBC unit that was 7 days old and 9.8 (95% CI: 2.66-35.97) for a unit that was 35 days old, 28 days after the blood transfusion. No association was observed between RBC alloimmunization and acute vaso-occlusive complications. Although increased echocardiography-derived tricuspid regurgitant jet velocity (TRV) was associated with the presence of RBC alloantibodies (P = 0.02), TRV was not significantly associated with alloimmunization when adjusted for patient age and number of transfused RBC units. Our study suggests that RBC antibody formation is significantly associated with older age of RBCs at the time of transfusion. Prospective studies in patients with SCD are required to confirm this finding.


Assuntos
Anemia Falciforme/imunologia , Autoimunidade , Transfusão de Eritrócitos , Isoanticorpos/biossíntese , Adolescente , Adulto , Fatores Etários , Idoso , Anemia Falciforme/patologia , Anemia Falciforme/terapia , Incompatibilidade de Grupos Sanguíneos , Senescência Celular , Criança , Pré-Escolar , Feminino , Humanos , Lactente , Masculino , Pessoa de Meia-Idade , Modelos de Riscos Proporcionais , Estudos Retrospectivos , Valva Tricúspide/imunologia , Valva Tricúspide/fisiopatologia
11.
Pediatr Diabetes ; 15(8): 573-84, 2014 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-24913103

RESUMO

BACKGROUND: The performance of automated algorithms for childhood diabetes case ascertainment and type classification may differ by demographic characteristics. OBJECTIVE: This study evaluated the potential of administrative and electronic health record (EHR) data from a large academic care delivery system to conduct diabetes case ascertainment in youth according to type, age, and race/ethnicity. SUBJECTS: Of 57 767 children aged <20 yr as of 31 December 2011 seen at University of North Carolina Health Care System in 2011 were included. METHODS: Using an initial algorithm including billing data, patient problem lists, laboratory test results, and diabetes related medications between 1 July 2008 and 31 December 2011, presumptive cases were identified and validated by chart review. More refined algorithms were evaluated by type (type 1 vs. type 2), age (<10 vs. ≥10 yr) and race/ethnicity (non-Hispanic White vs. 'other'). Sensitivity, specificity, and positive predictive value were calculated and compared. RESULTS: The best algorithm for ascertainment of overall diabetes cases was billing data. The best type 1 algorithm was the ratio of the number of type 1 billing codes to the sum of type 1 and type 2 billing codes ≥0.5. A useful algorithm to ascertain youth with type 2 diabetes with 'other' race/ethnicity was identified. Considerable age and racial/ethnic differences were present in type-non-specific and type 2 algorithms. CONCLUSIONS: Administrative and EHR data may be used to identify cases of childhood diabetes (any type), and to identify type 1 cases. The performance of type 2 case ascertainment algorithms differed substantially by race/ethnicity.


Assuntos
Algoritmos , Diabetes Mellitus Tipo 1/classificação , Diabetes Mellitus Tipo 1/diagnóstico , Diabetes Mellitus Tipo 2/classificação , Diabetes Mellitus Tipo 2/diagnóstico , Registros Eletrônicos de Saúde , Adolescente , Adulto , Criança , Pré-Escolar , Diabetes Mellitus Tipo 1/epidemiologia , Diabetes Mellitus Tipo 2/epidemiologia , Registros Eletrônicos de Saúde/normas , Feminino , Humanos , Lactente , Recém-Nascido , Masculino , Programas de Rastreamento/métodos , Adulto Jovem
12.
JAMIA Open ; 7(3): ooae076, 2024 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-39132679

RESUMO

Objectives: To provide a foundational methodology for differentiating comorbidity patterns in subphenotypes through investigation of a multi-site dementia patient dataset. Materials and Methods: Employing the National Clinical Cohort Collaborative Tenant Pilot (N3C Clinical) dataset, our approach integrates machine learning algorithms-logistic regression and eXtreme Gradient Boosting (XGBoost)-with a diagnostic hierarchical model for nuanced classification of dementia subtypes based on comorbidities and gender. The methodology is enhanced by multi-site EHR data, implementing a hybrid sampling strategy combining 65% Synthetic Minority Over-sampling Technique (SMOTE), 35% Random Under-Sampling (RUS), and Tomek Links for class imbalance. The hierarchical model further refines the analysis, allowing for layered understanding of disease patterns. Results: The study identified significant comorbidity patterns associated with diagnosis of Alzheimer's, Vascular, and Lewy Body dementia subtypes. The classification models achieved accuracies up to 69% for Alzheimer's/Vascular dementia and highlighted challenges in distinguishing Dementia with Lewy Bodies. The hierarchical model elucidates the complexity of diagnosing Dementia with Lewy Bodies and reveals the potential impact of regional clinical practices on dementia classification. Conclusion: Our methodology underscores the importance of leveraging multi-site datasets and tailored sampling techniques for dementia research. This framework holds promise for extending to other disease subtypes, offering a pathway to more nuanced and generalizable insights into dementia and its complex interplay with comorbid conditions. Discussion: This study underscores the critical role of multi-site data analyzes in understanding the relationship between comorbidities and disease subtypes. By utilizing diverse healthcare data, we emphasize the need to consider site-specific differences in clinical practices and patient demographics. Despite challenges like class imbalance and variability in EHR data, our findings highlight the essential contribution of multi-site data to developing accurate and generalizable models for disease classification.

13.
Commun Med (Lond) ; 4(1): 129, 2024 Jul 11.
Artigo em Inglês | MEDLINE | ID: mdl-38992084

RESUMO

BACKGROUND: Although the COVID-19 pandemic has persisted for over 3 years, reinfections with SARS-CoV-2 are not well understood. We aim to characterize reinfection, understand development of Long COVID after reinfection, and compare severity of reinfection with initial infection. METHODS: We use an electronic health record study cohort of over 3 million patients from the National COVID Cohort Collaborative as part of the NIH Researching COVID to Enhance Recovery Initiative. We calculate summary statistics, effect sizes, and Kaplan-Meier curves to better understand COVID-19 reinfections. RESULTS: Here we validate previous findings of reinfection incidence (6.9%), the occurrence of most reinfections during the Omicron epoch, and evidence of multiple reinfections. We present findings that the proportion of Long COVID diagnoses is higher following initial infection than reinfection for infections in the same epoch. We report lower albumin levels leading up to reinfection and a statistically significant association of severity between initial infection and reinfection (chi-squared value: 25,697, p-value: <0.0001) with a medium effect size (Cramer's V: 0.20, DoF = 3). Individuals who experienced severe initial and first reinfection were older in age and at a higher mortality risk than those who had mild initial infection and reinfection. CONCLUSIONS: In a large patient cohort, we find that the severity of reinfection appears to be associated with the severity of initial infection and that Long COVID diagnoses appear to occur more often following initial infection than reinfection in the same epoch. Future research may build on these findings to better understand COVID-19 reinfections.


More than three years after the start of the COVID-19 pandemic, individuals are frequently reporting multiple COVID-19 infections. However, these reinfections remain poorly understood. Here, we investigate COVID-19 reinfections in a large electronic health record cohort of over 3 million patients. We use data summary techniques and statistical tests to characterize reinfections and their relationships with disease severity, biomarkers, and Long COVID. We find that individuals with severe initial infection are more likely to experience severe reinfection, that some protein levels are lower, leading to reinfection, and that a lower proportion of individuals are diagnosed with Long COVID following reinfection than initial infection. Our work highlights the prevalence and impact of reinfections and suggests the need for further research.

14.
medRxiv ; 2024 Jul 31.
Artigo em Inglês | MEDLINE | ID: mdl-38343863

RESUMO

Preventing and treating post-acute sequelae of SARS-CoV-2 infection (PASC), commonly known as Long COVID, has become a public health priority. In this study, we examined whether treatment with Paxlovid in the acute phase of COVID-19 helps prevent the onset of PASC. We used electronic health records from the National Covid Cohort Collaborative (N3C) to define a cohort of 426,352 patients who had COVID-19 since April 1, 2022, and were eligible for Paxlovid treatment due to risk for progression to severe COVID-19. We used the target trial emulation (TTE) framework to estimate the effect of Paxlovid treatment on PASC incidence. We estimated overall PASC incidence using a computable phenotype. We also measured the onset of novel cognitive, fatigue, and respiratory symptoms in the post-acute period. Paxlovid treatment did not have a significant effect on overall PASC incidence (relative risk [RR] = 0.98, 95% confidence interval [CI] 0.95-1.01). However, it had a protective effect on cognitive (RR = 0.90, 95% CI 0.84-0.96) and fatigue (RR = 0.95, 95% CI 0.91-0.98) symptom clusters, which suggests that the etiology of these symptoms may be more closely related to viral load than that of respiratory symptoms.

15.
NPJ Digit Med ; 7(1): 296, 2024 Oct 21.
Artigo em Inglês | MEDLINE | ID: mdl-39433942

RESUMO

Post-Acute Sequelae of SARS-CoV-2 infection (PASC), also known as Long-COVID, encompasses a variety of complex and varied outcomes following COVID-19 infection that are still poorly understood. We clustered over 600 million condition diagnoses from 14 million patients available through the National COVID Cohort Collaborative (N3C), generating hundreds of highly detailed clinical phenotypes. Assessing patient clinical trajectories using these clusters allowed us to identify individual conditions and phenotypes strongly increased after acute infection. We found many conditions increased in COVID-19 patients compared to controls, and using a novel method to associate patients with clusters over time, we additionally found phenotypes specific to patient sex, age, wave of infection, and PASC diagnosis status. While many of these results reflect known PASC symptoms, the resolution provided by this unprecedented data scale suggests avenues for improved diagnostics and mechanistic understanding of this multifaceted disease.

16.
medRxiv ; 2024 Jun 11.
Artigo em Inglês | MEDLINE | ID: mdl-38947087

RESUMO

Post-Acute Sequelae of SARS-CoV-2 infection (PASC), also known as Long-COVID, encompasses a variety of complex and varied outcomes following COVID-19 infection that are still poorly understood. We clustered over 600 million condition diagnoses from 14 million patients available through the National COVID Cohort Collaborative (N3C), generating hundreds of highly detailed clinical phenotypes. Assessing patient clinical trajectories using these clusters allowed us to identify individual conditions and phenotypes strongly increased after acute infection. We found many conditions increased in COVID-19 patients compared to controls, and using a novel method to associate patients with clusters over time, we additionally found phenotypes specific to patient sex, age, wave of infection, and PASC diagnosis status. While many of these results reflect known PASC symptoms, the resolution provided by this unprecedented data scale suggests avenues for improved diagnostics and mechanistic understanding of this multifaceted disease.

17.
JMIR Med Inform ; 12: e49997, 2024 Sep 09.
Artigo em Inglês | MEDLINE | ID: mdl-39250782

RESUMO

BACKGROUND: A wealth of clinically relevant information is only obtainable within unstructured clinical narratives, leading to great interest in clinical natural language processing (NLP). While a multitude of approaches to NLP exist, current algorithm development approaches have limitations that can slow the development process. These limitations are exacerbated when the task is emergent, as is the case currently for NLP extraction of signs and symptoms of COVID-19 and postacute sequelae of SARS-CoV-2 infection (PASC). OBJECTIVE: This study aims to highlight the current limitations of existing NLP algorithm development approaches that are exacerbated by NLP tasks surrounding emergent clinical concepts and to illustrate our approach to addressing these issues through the use case of developing an NLP system for the signs and symptoms of COVID-19 and PASC. METHODS: We used 2 preexisting studies on PASC as a baseline to determine a set of concepts that should be extracted by NLP. This concept list was then used in conjunction with the Unified Medical Language System to autonomously generate an expanded lexicon to weakly annotate a training set, which was then reviewed by a human expert to generate a fine-tuned NLP algorithm. The annotations from a fully human-annotated test set were then compared with NLP results from the fine-tuned algorithm. The NLP algorithm was then deployed to 10 additional sites that were also running our NLP infrastructure. Of these 10 sites, 5 were used to conduct a federated evaluation of the NLP algorithm. RESULTS: An NLP algorithm consisting of 12,234 unique normalized text strings corresponding to 2366 unique concepts was developed to extract COVID-19 or PASC signs and symptoms. An unweighted mean dictionary coverage of 77.8% was found for the 5 sites. CONCLUSIONS: The evolutionary and time-critical nature of the PASC NLP task significantly complicates existing approaches to NLP algorithm development. In this work, we present a hybrid approach using the Open Health Natural Language Processing Toolkit aimed at addressing these needs with a dictionary-based weak labeling step that minimizes the need for additional expert annotation while still preserving the fine-tuning capabilities of expert involvement.

18.
EBioMedicine ; 108: 105333, 2024 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-39321500

RESUMO

BACKGROUND: While many patients seem to recover from SARS-CoV-2 infections, many patients report experiencing SARS-CoV-2 symptoms for weeks or months after their acute COVID-19 ends, even developing new symptoms weeks after infection. These long-term effects are called post-acute sequelae of SARS-CoV-2 (PASC) or, more commonly, Long COVID. The overall prevalence of Long COVID is currently unknown, and tools are needed to help identify patients at risk for developing long COVID. METHODS: A working group of the Rapid Acceleration of Diagnostics-radical (RADx-rad) program, comprised of individuals from various NIH institutes and centers, in collaboration with REsearching COVID to Enhance Recovery (RECOVER) developed and organized the Long COVID Computational Challenge (L3C), a community challenge aimed at incentivizing the broader scientific community to develop interpretable and accurate methods for identifying patients at risk of developing Long COVID. From August 2022 to December 2022, participants developed Long COVID risk prediction algorithms using the National COVID Cohort Collaborative (N3C) data enclave, a harmonized data repository from over 75 healthcare institutions from across the United States (U.S.). FINDINGS: Over the course of the challenge, 74 teams designed and built 35 Long COVID prediction models using the N3C data enclave. The top 10 teams all scored above a 0.80 Area Under the Receiver Operator Curve (AUROC) with the highest scoring model achieving a mean AUROC of 0.895. Included in the top submission was a visualization dashboard that built timelines for each patient, updating the risk of a patient developing Long COVID in response to clinical events. INTERPRETATION: As a result of L3C, federal reviewers identified multiple machine learning models that can be used to identify patients at risk for developing Long COVID. Many of the teams used approaches in their submissions which can be applied to future clinical prediction questions. FUNDING: Research reported in this RADx® Rad publication was supported by the National Institutes of Health. Timothy Bergquist, Johanna Loomba, and Emily Pfaff were supported by Axle Subcontract: NCATS-STSS-P00438.


Assuntos
COVID-19 , Aprendizado de Máquina , SARS-CoV-2 , Humanos , COVID-19/epidemiologia , SARS-CoV-2/isolamento & purificação , Estados Unidos/epidemiologia , Algoritmos , Síndrome de COVID-19 Pós-Aguda , Estudos de Coortes , Crowdsourcing
19.
Otol Neurotol Open ; 4(2): e051, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38919767

RESUMO

Objective: Determine the incidence of vestibular disorders in patients with SARS-CoV-2 compared to the control population. Study Design: Retrospective. Setting: Clinical data in the National COVID Cohort Collaborative database (N3C). Methods: Deidentified patient data from the National COVID Cohort Collaborative database (N3C) were queried based on variant peak prevalence (untyped, alpha, delta, omicron 21K, and omicron 23A) from covariants.org to retrospectively analyze the incidence of vestibular disorders in patients with SARS-CoV-2 compared to control population, consisting of patients without documented evidence of COVID infection during the same period. Results: Patients testing positive for COVID-19 were significantly more likely to have a vestibular disorder compared to the control population. Compared to control patients, the odds ratio of vestibular disorders was significantly elevated in patients with untyped (odds ratio [OR], 2.39; confidence intervals [CI], 2.29-2.50; P < 0.001), alpha (OR, 3.63; CI, 3.48-3.78; P < 0.001), delta (OR, 3.03; CI, 2.94-3.12; P < 0.001), omicron 21K variant (OR, 2.97; CI, 2.90-3.04; P < 0.001), and omicron 23A variant (OR, 8.80; CI, 8.35-9.27; P < 0.001). Conclusions: The incidence of vestibular disorders differed between COVID-19 variants and was significantly elevated in COVID-19-positive patients compared to the control population. These findings have implications for patient counseling and further research is needed to discern the long-term effects of these findings.

20.
medRxiv ; 2023 Feb 04.
Artigo em Inglês | MEDLINE | ID: mdl-36778264

RESUMO

Importance: Identifying individuals with a higher risk of developing severe COVID-19 outcomes will inform targeted or more intensive clinical monitoring and management. Objective: To examine, using data from the National COVID Cohort Collaborative (N3C), whether patients with pre-existing autoimmune disease (AID) diagnosis and/or immunosuppressant (IS) exposure are at a higher risk of developing severe COVID-19 outcomes. Design setting and participants: A retrospective cohort of 2,453,799 individuals diagnosed with COVID-19 between January 1 st , 2020, and June 30 th , 2022, was created from the N3C data enclave, which comprises data of 15,231,849 patients from 75 USA data partners. Patients were stratified as those with/without a pre-existing diagnosis of AID and/or those with/without exposure to IS prior to COVID-19. Main outcomes and measures: Two outcomes of COVID-19 severity, derived from the World Health Organization severity score, were defined, namely life-threatening disease and hospitalization. Odds ratios (ORs) with 95% confidence intervals (CIs) were calculated using logistic regression models with and without adjustment for demographics (age, BMI, gender, race, ethnicity, smoking status), and comorbidities (cardiovascular disease, dementia, pulmonary disease, liver disease, type 2 diabetes mellitus, kidney disease, cancer, and HIV infection). Results: In total, 2,453,799 (16.11% of the N3C cohort) adults (age> 18 years) were diagnosed with COVID-19, of which 191,520 (7.81%) had a prior AID diagnosis, and 278,095 (11.33%) had a prior IS exposure. Logistic regression models adjusted for demographic factors and comorbidities demonstrated that individuals with a prior AID (OR = 1.13, 95% CI 1.09 - 1.17; p =2.43E-13), prior exposure to IS (OR= 1.27, 95% CI 1.24 - 1.30; p =3.66E-74), or both (OR= 1.35, 95% CI 1.29 - 1.40; p =7.50E-49) were more likely to have a life-threatening COVID-19 disease. These results were confirmed after adjusting for exposure to antivirals and vaccination in a cohort subset with COVID-19 diagnosis dates after December 2021 (AID OR = 1.18, 95% CI 1.02 - 1.36; p =2.46E-02; IS OR= 1.60, 95% CI 1.41 - 1.80; p =5.11E-14; AID+IS OR= 1.93, 95% CI 1.62 - 2.30; p =1.68E-13). These results were consistent when evaluating hospitalization as the outcome and also when stratifying by race and sex. Finally, a sensitivity analysis evaluating specific IS revealed that TNF inhibitors were protective against life-threatening disease (OR = 0.80, 95% CI 0.66-0.96; p =1.66E-2) and hospitalization (OR = 0.80, 95% CI 0.73 - 0.89; p =1.06E-05). Conclusions and Relevance: Patients with pre-existing AID, exposure to IS, or both are more likely to have a life-threatening disease or hospitalization. These patients may thus require tailored monitoring and preventative measures to minimize negative consequences of COVID-19.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA