Your browser doesn't support javascript.
loading
Automated Coding of Job Descriptions From a General Population Study: Overview of Existing Tools, Their Application and Comparison.
Wan, Wenxin; Ge, Calvin B; Friesen, Melissa C; Locke, Sarah J; Russ, Daniel E; Burstyn, Igor; Baker, Christopher J O; Adisesh, Anil; Lan, Qing; Rothman, Nathaniel; Huss, Anke; van Tongeren, Martie; Vermeulen, Roel; Peters, Susan.
Afiliação
  • Wan W; Department Population Health Sciences, Institute for Risk Assessment Sciences, Utrecht University, Utrecht, The Netherlands.
  • Ge CB; Department Population Health Sciences, Institute for Risk Assessment Sciences, Utrecht University, Utrecht, The Netherlands.
  • Friesen MC; Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA.
  • Locke SJ; Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA.
  • Russ DE; Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA.
  • Burstyn I; Department of Environmental and Occupational Health, Drexel University, Dornsife School of Public Health, Philadelphia, PA, USA.
  • Baker CJO; Department of Computer Science, Faculty of Science, Applied Science and Engineering, University of New Brunswick, Saint John, NB, Canada.
  • Adisesh A; Division of Occupational Medicine, Department of Medicine, University of Toronto, Toronto, ON, Canada.
  • Lan Q; Department of Medicine, Dalhousie Medicine New Brunswick, Saint John, NB, Canada.
  • Rothman N; Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH, DHHS, Rockville, MD, USA.
  • Huss A; Occupational and Environmental Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA.
  • van Tongeren M; Department Population Health Sciences, Institute for Risk Assessment Sciences, Utrecht University, Utrecht, The Netherlands.
  • Vermeulen R; Centre for Occupational and Environmental Health, School of Health Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Manchester, UK.
  • Peters S; Department Population Health Sciences, Institute for Risk Assessment Sciences, Utrecht University, Utrecht, The Netherlands.
Ann Work Expo Health ; 67(5): 663-672, 2023 06 06.
Article em En | MEDLINE | ID: mdl-36734402
ABSTRACT

OBJECTIVES:

Automatic job coding tools were developed to reduce the laborious task of manually assigning job codes based on free-text job descriptions in census and survey data sources, including large occupational health studies. The objective of this study is to provide a case study of comparative performance of job coding and JEM (Job-Exposure Matrix)-assigned exposures agreement using existing coding tools.

METHODS:

We compared three automatic job coding tools [AUTONOC, CASCOT (Computer-Assisted Structured Coding Tool), and LabourR], which were selected based on availability, coding of English free-text into coding systems closely related to the 1988 version of the International Standard Classification of Occupations (ISCO-88), and capability to perform batch coding. We used manually coded job histories from the AsiaLymph case-control study that were translated into English prior to auto-coding to assess their performance. We applied two general population JEMs to assess agreement at exposure level. Percent agreement and PABAK (Prevalence-Adjusted Bias-Adjusted Kappa) were used to compare the agreement of results from manual coders and automatic coding tools.

RESULTS:

The coding per cent agreement among the three tools ranged from 17.7 to 26.0% for exact matches at the most detailed 4-digit ISCO-88 level. The agreement was better at a more general level of job coding (e.g. 43.8-58.1% in 1-digit ISCO-88), and in exposure assignments (median values of PABAK coefficient ranging 0.69-0.78 across 12 JEM-assigned exposures). Based on our testing data, CASCOT was found to outperform others in terms of better agreement in both job coding (26% 4-digit agreement) and exposure assignment (median kappa 0.61).

CONCLUSIONS:

In this study, we observed that agreement on job coding was generally low for the three tools but noted a higher degree of agreement in assigned exposures. The results indicate the need for study-specific evaluations prior to their automatic use in general population studies, as well as improvements in the evaluated automatic coding tools.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Exposição Ocupacional / Descrição de Cargo Tipo de estudo: Observational_studies / Qualitative_research / Risk_factors_studies Limite: Humans Idioma: En Revista: Ann Work Expo Health Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Holanda

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Exposição Ocupacional / Descrição de Cargo Tipo de estudo: Observational_studies / Qualitative_research / Risk_factors_studies Limite: Humans Idioma: En Revista: Ann Work Expo Health Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Holanda