RESUMO
Validation metrics are key for tracking scientific progress and bridging the current chasm between artificial intelligence research and its translation into practice. However, increasing evidence shows that, particularly in image analysis, metrics are often chosen inadequately. Although taking into account the individual strengths, weaknesses and limitations of validation metrics is a critical prerequisite to making educated choices, the relevant knowledge is currently scattered and poorly accessible to individual researchers. Based on a multistage Delphi process conducted by a multidisciplinary expert consortium as well as extensive community feedback, the present work provides a reliable and comprehensive common point of access to information on pitfalls related to validation metrics in image analysis. Although focused on biomedical image analysis, the addressed pitfalls generalize across application domains and are categorized according to a newly created, domain-agnostic taxonomy. The work serves to enhance global comprehension of a key topic in image analysis validation.
Assuntos
Inteligência ArtificialRESUMO
Increasing evidence shows that flaws in machine learning (ML) algorithm validation are an underestimated global problem. In biomedical image analysis, chosen performance metrics often do not reflect the domain interest, and thus fail to adequately measure scientific progress and hinder translation of ML techniques into practice. To overcome this, we created Metrics Reloaded, a comprehensive framework guiding researchers in the problem-aware selection of metrics. Developed by a large international consortium in a multistage Delphi process, it is based on the novel concept of a problem fingerprint-a structured representation of the given problem that captures all aspects that are relevant for metric selection, from the domain interest to the properties of the target structure(s), dataset and algorithm output. On the basis of the problem fingerprint, users are guided through the process of choosing and applying appropriate validation metrics while being made aware of potential pitfalls. Metrics Reloaded targets image analysis problems that can be interpreted as classification tasks at image, object or pixel level, namely image-level classification, object detection, semantic segmentation and instance segmentation tasks. To improve the user experience, we implemented the framework in the Metrics Reloaded online tool. Following the convergence of ML methodology across application domains, Metrics Reloaded fosters the convergence of validation methodology. Its applicability is demonstrated for various biomedical use cases.
Assuntos
Algoritmos , Processamento de Imagem Assistida por Computador , Aprendizado de Máquina , SemânticaRESUMO
Ultrasound (US) has gained popularity as a guidance modality for percutaneous needle insertions because it is widely available and non-ionizing. However, coordinating scanning and needle insertion still requires significant experience. Current assistance solutions utilize optical or electromagnetic tracking (EMT) technology directly integrated into the US device or probe. This results in specialized devices or introduces additional hardware, limiting the ergonomics of both the scanning and insertion process. We developed the first ultrasound (US) navigation solution designed to be used as a non-permanent accessory for existing US devices while maintaining the ergonomics during the scanning process. A miniaturized EMT source is reversibly attached to the US probe, temporarily creating a combined modality that provides real-time anatomical imaging and instrument tracking at the same time. Studies performed with 11 clinical operators show that the proposed navigation solution can guide needle insertions with a targeting accuracy of about 5 mm, which is comparable to existing approaches and unaffected by repeated attachment and detachment of the miniaturized tracking solution. The assistance proved particularly helpful for non-expert users and needle insertions performed outside of the US plane. The small size and reversible attachability of the proposed navigation solution promises streamlined integration into the clinical workflow and widespread access to US navigated punctures.