Your browser doesn't support javascript.
loading
Multi-ancestry genome- and phenome-wide association studies of diverticular disease in electronic health records with natural language processing enriched phenotyping algorithm.
Joo, Yoonjung Yoonie; Pacheco, Jennifer A; Thompson, William K; Rasmussen-Torvik, Laura J; Rasmussen, Luke V; Lin, Frederick T J; Andrade, Mariza de; Borthwick, Kenneth M; Bottinger, Erwin; Cagan, Andrew; Carrell, David S; Denny, Joshua C; Ellis, Stephen B; Gottesman, Omri; Linneman, James G; Pathak, Jyotishman; Peissig, Peggy L; Shang, Ning; Tromp, Gerard; Veerappan, Annapoorani; Smith, Maureen E; Chisholm, Rex L; Gawron, Andrew J; Hayes, M Geoffrey; Kho, Abel N.
Afiliação
  • Joo YY; Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, United States of America.
  • Pacheco JA; Center for Genetic Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, United States of America.
  • Thompson WK; Center for Health Information Partnerships, Northwestern University Feinberg School of Medicine, Chicago, IL, United States of America.
  • Rasmussen-Torvik LJ; Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, United States of America.
  • Rasmussen LV; Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, United States of America.
  • Lin FTJ; Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, United States of America.
  • Andrade M; College of Medicine, Mayo Clinic, Rochester, MN, United States of America.
  • Borthwick KM; Geisinger, Danville, PA, United States of America.
  • Bottinger E; Icahn School of Medicine at Mount Sinai, New York, NY, United States of America.
  • Cagan A; Partners Healthcare, Charlestown, MA, United States of America.
  • Carrell DS; Kaiser Permanente Washington Health Research Institute, Seattle, Washington, United States of America.
  • Denny JC; Departments of Biomedical Informatics and Medicine, Vanderbilt University, Nashville, TN, United States of America.
  • Ellis SB; The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, United States of America.
  • Gottesman O; The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, United States of America.
  • Linneman JG; Office of Research Computing and Analytics, Marshfield Clinic Research Institute, Marshfield, WI, United States of America.
  • Pathak J; Department of Healthcare Policy and Research, Weill Cornell Medical College, New York, NY, United States of America.
  • Peissig PL; Center for Precision Medicine Research, Marshfield Clinic Research Institute, Marshfield, WI, United States of America.
  • Shang N; Department of Biomedical Informatics, Columbia University, New York, NY, United States of America.
  • Tromp G; Division of Molecular Biology and Human Genetics, Department of Biomedical Sciences, Faculty of Medicine and Health Sciences, Stellenbosch University, Stellenbosch, South Africa.
  • Veerappan A; Department of Medicine, Gastroenterology, Duke University, Durham, NC, United States of America.
  • Smith ME; Center for Genetic Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, United States of America.
  • Chisholm RL; Center for Genetic Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, United States of America.
  • Gawron AJ; Division of Gastroenterology, Hepatology & Nutrition, University of Utah, Salt Lake City, UT, United States of America.
  • Hayes MG; Department of Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, United States of America.
  • Kho AN; Center for Genetic Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, United States of America.
PLoS One ; 18(5): e0283553, 2023.
Article em En | MEDLINE | ID: mdl-37196047
ABSTRACT

OBJECTIVE:

Diverticular disease (DD) is one of the most prevalent conditions encountered by gastroenterologists, affecting ~50% of Americans before the age of 60. Our aim was to identify genetic risk variants and clinical phenotypes associated with DD, leveraging multiple electronic health record (EHR) data sources of 91,166 multi-ancestry participants with a Natural Language Processing (NLP) technique. MATERIALS AND

METHODS:

We developed a NLP-enriched phenotyping algorithm that incorporated colonoscopy or abdominal imaging reports to identify patients with diverticulosis and diverticulitis from multicenter EHRs. We performed genome-wide association studies (GWAS) of DD in European, African and multi-ancestry participants, followed by phenome-wide association studies (PheWAS) of the risk variants to identify their potential comorbid/pleiotropic effects in clinical phenotypes.

RESULTS:

Our developed algorithm showed a significant improvement in patient classification performance for DD analysis (algorithm PPVs ≥ 0.94), with up to a 3.5 fold increase in terms of the number of identified patients than the traditional method. Ancestry-stratified analyses of diverticulosis and diverticulitis of the identified subjects replicated the well-established associations between ARHGAP15 loci with DD, showing overall intensified GWAS signals in diverticulitis patients compared to diverticulosis patients. Our PheWAS analyses identified significant associations between the DD GWAS variants and circulatory system, genitourinary, and neoplastic EHR phenotypes.

DISCUSSION:

As the first multi-ancestry GWAS-PheWAS study, we showcased that heterogenous EHR data can be mapped through an integrative analytical pipeline and reveal significant genotype-phenotype associations with clinical interpretation.

CONCLUSION:

A systematic framework to process unstructured EHR data with NLP could advance a deep and scalable phenotyping for better patient identification and facilitate etiological investigation of a disease with multilayered data.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Divertículo / Diverticulite / Doenças Diverticulares Tipo de estudo: Clinical_trials / Prognostic_studies / Qualitative_research / Risk_factors_studies Limite: Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Divertículo / Diverticulite / Doenças Diverticulares Tipo de estudo: Clinical_trials / Prognostic_studies / Qualitative_research / Risk_factors_studies Limite: Humans Idioma: En Ano de publicação: 2023 Tipo de documento: Article