Your browser doesn't support javascript.
loading
Considerations for the Use of Machine Learning Extracted Real-World Data to Support Evidence Generation: A Research-Centric Evaluation Framework.
Estevez, Melissa; Benedum, Corey M; Jiang, Chengsheng; Cohen, Aaron B; Phadke, Sharang; Sarkar, Somnath; Bozkurt, Selen.
Afiliação
  • Estevez M; Flatiron Health, Inc., 233 Spring Street, New York, NY 10013, USA.
  • Benedum CM; Flatiron Health, Inc., 233 Spring Street, New York, NY 10013, USA.
  • Jiang C; Flatiron Health, Inc., 233 Spring Street, New York, NY 10013, USA.
  • Cohen AB; Flatiron Health, Inc., 233 Spring Street, New York, NY 10013, USA.
  • Phadke S; Department of Medicine, NYU Grossman School of Medicine, New York, NY 10016, USA.
  • Sarkar S; Flatiron Health, Inc., 233 Spring Street, New York, NY 10013, USA.
  • Bozkurt S; Flatiron Health, Inc., 233 Spring Street, New York, NY 10013, USA.
Cancers (Basel) ; 14(13)2022 Jun 22.
Article em En | MEDLINE | ID: mdl-35804834
ABSTRACT
A vast amount of real-world data, such as pathology reports and clinical notes, are captured as unstructured text in electronic health records (EHRs). However, this information is both difficult and costly to extract through human abstraction, especially when scaling to large datasets is needed. Fortunately, Natural Language Processing (NLP) and Machine Learning (ML) techniques provide promising solutions for a variety of information extraction tasks such as identifying a group of patients who have a specific diagnosis, share common characteristics, or show progression of a disease. However, using these ML-extracted data for research still introduces unique challenges in assessing validity and generalizability to different cohorts of interest. In order to enable effective and accurate use of ML-extracted real-world data (RWD) to support research and real-world evidence generation, we propose a research-centric evaluation framework for model developers, ML-extracted data users and other RWD stakeholders. This framework covers the fundamentals of evaluating RWD produced using ML methods to maximize the use of EHR data for research purposes.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article