Effective and efficient active learning for deep learning-based tissue image analysis.

Meirelles, André L S; Kurc, Tahsin; Kong, Jun; Ferreira, Renato; Saltz, Joel; Teodoro, George

Meirelles, André L S; Kurc, Tahsin; Kong, Jun; Ferreira, Renato; Saltz, Joel; Teodoro, George.

Afiliación

Meirelles ALS; Department of Computer Science, University of Brasília, Brasília 70910-900, Brazil.
Kurc T; Biomedical Informatics Department, Stony Brook University, Stony Brook, NY 11794-8322, USA.
Kong J; Department of Mathematics and Statistics and Computer Science, Georgia State University, Atlanta, GA 30302-4110, USA.
Ferreira R; Department of Computer Science, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Brazil.
Saltz J; Biomedical Informatics Department, Stony Brook University, Stony Brook, NY 11794-8322, USA.
Teodoro G; Department of Computer Science, University of Brasília, Brasília 70910-900, Brazil.

Bioinformatics ; 39(4)2023 04 03.

Article en En | MEDLINE | ID: mdl-36943380

ABSTRACT

ABSTRACT

MOTIVATION Deep learning attained excellent results in digital pathology recently. A challenge with its use is that high quality, representative training datasets are required to build robust models. Data annotation in the domain is labor intensive and demands substantial time commitment from expert pathologists. Active learning (AL) is a strategy to minimize annotation. The goal is to select samples from the pool of unlabeled data for annotation that improves model accuracy. However, AL is a very compute demanding approach. The benefits for model learning may vary according to the strategy used, and it may be hard for a domain specialist to fine tune the solution without an integrated interface.

RESULTS:

We developed a framework that includes a friendly user interface along with run-time optimizations to reduce annotation and execution time in AL in digital pathology. Our solution implements several AL strategies along with our diversity-aware data acquisition (DADA) acquisition function, which enforces data diversity to improve the prediction performance of a model. In this work, we employed a model simplification strategy [Network Auto-Reduction (NAR)] that significantly improves AL execution time when coupled with DADA. NAR produces less compute demanding models, which replace the target models during the AL process to reduce processing demands. An evaluation with a tumor-infiltrating lymphocytes classification application shows that (i) DADA attains superior performance compared to state-of-the-art AL strategies for different convolutional neural networks (CNNs), (ii) NAR improves the AL execution time by up to 4.3×, and (iii) target models trained with patches/data selected by the NAR reduced versions achieve similar or superior classification quality to using target CNNs for data selection. AVAILABILITY AND IMPLEMENTATION Source code https//github.com/alsmeirelles/DADA.

Asunto(s)

Aprendizaje Profundo; Redes Neurales de la Computación; Programas Informáticos; Procesamiento de Imagen Asistido por Computador; Curaduría de Datos

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Aprendizaje Profundo Tipo de estudio: Prognostic_studies Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2023 Tipo del documento: Article País de afiliación: Brasil

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google