Your browser doesn't support javascript.
loading
Machine learning-based donor permission extraction from informed consent documents.
Zhang, Meng; Sankaranarayanapillai, Madhuri; Du, Jingcheng; Xiang, Yang; Manion, Frank J; Harris, Marcelline R; Stansbury, Cooper; Pham, Huy Anh; Tao, Cui.
Afiliação
  • Zhang M; McWilliam School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA.
  • Sankaranarayanapillai M; McWilliam School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA.
  • Du J; McWilliam School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA.
  • Xiang Y; McWilliam School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA.
  • Manion FJ; School of Nursing, University of Michigan, Ann Arbor, MI, 48104, USA.
  • Harris MR; School of Nursing, University of Michigan, Ann Arbor, MI, 48104, USA.
  • Stansbury C; School of Nursing, University of Michigan, Ann Arbor, MI, 48104, USA.
  • Pham HA; McWilliam School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA.
  • Tao C; McWilliam School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, 77030, USA. tao.cui@mayo.edu.
BMC Bioinformatics ; 24(Suppl 3): 477, 2023 Dec 15.
Article em En | MEDLINE | ID: mdl-38102593
ABSTRACT

BACKGROUND:

With more clinical trials are offering optional participation in the collection of bio-specimens for biobanking comes the increasing complexity of requirements of informed consent forms. The aim of this study is to develop an automatic natural language processing (NLP) tool to annotate informed consent documents to promote biorepository data regulation, sharing, and decision support. We collected informed consent documents from several publicly available sources, then manually annotated them, covering sentences containing permission information about the sharing of either bio-specimens or donor data, or conducting genetic research or future research using bio-specimens or donor data.

RESULTS:

We evaluated a variety of machine learning algorithms including random forest (RF) and support vector machine (SVM) for the automatic identification of these sentences. 120 informed consent documents containing 29,204 sentences were annotated, of which 1250 sentences (4.28%) provide answers to a permission question. A support vector machine (SVM) model achieved a F-1 score of 0.95 on classifying the sentences when using a gold standard, which is a prefiltered corpus containing all relevant sentences.

CONCLUSIONS:

This study provides the feasibility of using machine learning tools to classify permission-related sentences in informed consent documents.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Bancos de Espécimes Biológicos / Termos de Consentimento Idioma: En Revista: BMC Bioinformatics Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Bancos de Espécimes Biológicos / Termos de Consentimento Idioma: En Revista: BMC Bioinformatics Ano de publicação: 2023 Tipo de documento: Article