Rechercher | Portail Régional BVS

Gamified Crowdsourcing as a Novel Approach to Lung Ultrasound Data Set Labeling: Prospective Analysis.

Duggan, Nicole M; Jin, Mike; Duran Mendicuti, Maria Alejandra; Hallisey, Stephen; Bernier, Denie; Selame, Lauren A; Asgari-Targhi, Ameneh; Fischetti, Chanel E; Lucassen, Ruben; Samir, Anthony E; Duhaime, Erik; Kapur, Tina; Goldsmith, Andrew J.

J Med Internet Res ; 26: e51397, 2024 Jul 04.

Article de Anglais | MEDLINE | ID: mdl-38963923

RÉSUMÉ

BACKGROUND: Machine learning (ML) models can yield faster and more accurate medical diagnoses; however, developing ML models is limited by a lack of high-quality labeled training data. Crowdsourced labeling is a potential solution but can be constrained by concerns about label quality. OBJECTIVE: This study aims to examine whether a gamified crowdsourcing platform with continuous performance assessment, user feedback, and performance-based incentives could produce expert-quality labels on medical imaging data. METHODS: In this diagnostic comparison study, 2384 lung ultrasound clips were retrospectively collected from 203 emergency department patients. A total of 6 lung ultrasound experts classified 393 of these clips as having no B-lines, one or more discrete B-lines, or confluent B-lines to create 2 sets of reference standard data sets (195 training clips and 198 test clips). Sets were respectively used to (1) train users on a gamified crowdsourcing platform and (2) compare the concordance of the resulting crowd labels to the concordance of individual experts to reference standards. Crowd opinions were sourced from DiagnosUs (Centaur Labs) iOS app users over 8 days, filtered based on past performance, aggregated using majority rule, and analyzed for label concordance compared with a hold-out test set of expert-labeled clips. The primary outcome was comparing the labeling concordance of collated crowd opinions to trained experts in classifying B-lines on lung ultrasound clips. RESULTS: Our clinical data set included patients with a mean age of 60.0 (SD 19.0) years; 105 (51.7%) patients were female and 114 (56.1%) patients were White. Over the 195 training clips, the expert-consensus label distribution was 114 (58%) no B-lines, 56 (29%) discrete B-lines, and 25 (13%) confluent B-lines. Over the 198 test clips, expert-consensus label distribution was 138 (70%) no B-lines, 36 (18%) discrete B-lines, and 24 (12%) confluent B-lines. In total, 99,238 opinions were collected from 426 unique users. On a test set of 198 clips, the mean labeling concordance of individual experts relative to the reference standard was 85.0% (SE 2.0), compared with 87.9% crowdsourced label concordance (P=.15). When individual experts' opinions were compared with reference standard labels created by majority vote excluding their own opinion, crowd concordance was higher than the mean concordance of individual experts to reference standards (87.4% vs 80.8%, SE 1.6 for expert concordance; P<.001). Clips with discrete B-lines had the most disagreement from both the crowd consensus and individual experts with the expert consensus. Using randomly sampled subsets of crowd opinions, 7 quality-filtered opinions were sufficient to achieve near the maximum crowd concordance. CONCLUSIONS: Crowdsourced labels for B-line classification on lung ultrasound clips via a gamified approach achieved expert-level accuracy. This suggests a strategic role for gamified crowdsourcing in efficiently generating labeled image data sets for training ML systems.

Sujet(s)

Externalisation ouverte , Poumon , Échographie , Externalisation ouverte/méthodes , Humains , Échographie/méthodes , Échographie/normes , Poumon/imagerie diagnostique , Études prospectives , Femelle , Mâle , Apprentissage machine , Adulte , Adulte d'âge moyen , Études rétrospectives

Deep Learning for Detection and Localization of B-Lines in Lung Ultrasound.

Lucassen, Ruben T; Jafari, Mohammad H; Duggan, Nicole M; Jowkar, Nick; Mehrtash, Alireza; Fischetti, Chanel; Bernier, Denie; Prentice, Kira; Duhaime, Erik P; Jin, Mike; Abolmaesumi, Purang; Heslinga, Friso G; Veta, Mitko; Duran-Mendicuti, Maria A; Frisken, Sarah; Shyn, Paul B; Golby, Alexandra J; Boyer, Edward; Wells, William M; Goldsmith, Andrew J; Kapur, Tina.

IEEE J Biomed Health Inform ; 27(9): 4352-4361, 2023 09.

Article de Anglais | MEDLINE | ID: mdl-37276107

RÉSUMÉ

Lung ultrasound (LUS) is an important imaging modality used by emergency physicians to assess pulmonary congestion at the patient bedside. B-line artifacts in LUS videos are key findings associated with pulmonary congestion. Not only can the interpretation of LUS be challenging for novice operators, but visual quantification of B-lines remains subject to observer variability. In this work, we investigate the strengths and weaknesses of multiple deep learning approaches for automated B-line detection and localization in LUS videos. We curate and publish, BEDLUS, a new ultrasound dataset comprising 1,419 videos from 113 patients with a total of 15,755 expert-annotated B-lines. Based on this dataset, we present a benchmark of established deep learning methods applied to the task of B-line detection. To pave the way for interpretable quantification of B-lines, we propose a novel "single-point" approach to B-line localization using only the point of origin. Our results show that (a) the area under the receiver operating characteristic curve ranges from 0.864 to 0.955 for the benchmarked detection methods, (b) within this range, the best performance is achieved by models that leverage multiple successive frames as input, and (c) the proposed single-point approach for B-line localization reaches an F 1-score of 0.65, performing on par with the inter-observer agreement. The dataset and developed methods can facilitate further biomedical research on automated interpretation of lung ultrasound with the potential to expand the clinical utility.

Sujet(s)

Apprentissage profond , Oedème pulmonaire , Humains , Poumon/imagerie diagnostique , Échographie/méthodes , Oedème pulmonaire/diagnostic , Thorax

Comparison of pulmonary congestion severity using artificial intelligence-assisted scoring versus clinical experts: A secondary analysis of BLUSHED-AHF.

Goldsmith, Andrew J; Jin, Mike; Lucassen, Ruben; Duggan, Nicole M; Harrison, Nicholas E; Wells, William; Ehrman, Robert R; Ferre, Robinson; Gargani, Luna; Noble, Vicki; Levy, Phil; Lane, Katie; Li, Xiaochun; Collins, Sean; Pang, Peter; Kapur, Tina; Russell, Frances M.

Eur J Heart Fail ; 25(7): 1166-1169, 2023 07.

Article de Anglais | MEDLINE | ID: mdl-37218619

RÉSUMÉ

AIM: Acute decompensated heart failure (ADHF) is the leading cause of cardiovascular hospitalizations in the United States. Detecting B-lines through lung ultrasound (LUS) can enhance clinicians' prognostic and diagnostic capabilities. Artificial intelligence/machine learning (AI/ML)-based automated guidance systems may allow novice users to apply LUS to clinical care. We investigated whether an AI/ML automated LUS congestion score correlates with expert's interpretations of B-line quantification from an external patient dataset. METHODS AND RESULTS: This was a secondary analysis from the BLUSHED-AHF study which investigated the effect of LUS-guided therapy on patients with ADHF. In BLUSHED-AHF, LUS was performed and B-lines were quantified by ultrasound operators. Two experts then separately quantified the number of B-lines per ultrasound video clip recorded. Here, an AI/ML-based lung congestion score (LCS) was calculated for all LUS clips from BLUSHED-AHF. Spearman correlation was computed between LCS and counts from each of the original three raters. A total of 3858 LUS clips were analysed on 130 patients. The LCS demonstrated good agreement with the two experts' B-line quantification score (r = 0.894, 0.882). Both experts' B-line quantification scores had significantly better agreement with the LCS than they did with the ultrasound operator's score (p < 0.005, p < 0.001). CONCLUSION: Artificial intelligence/machine learning-based LCS correlated with expert-level B-line quantification. Future studies are needed to determine whether automated tools may assist novice users in LUS interpretation.

Sujet(s)

Défaillance cardiaque , Oedème pulmonaire , Humains , Intelligence artificielle , Défaillance cardiaque/imagerie diagnostique , Défaillance cardiaque/complications , Poumon/imagerie diagnostique , Oedème pulmonaire/imagerie diagnostique , Oedème pulmonaire/étiologie , Échographie/méthodes

Corneal pachymetry by AS-OCT after Descemet's membrane endothelial keratoplasty.

Heslinga, Friso G; Lucassen, Ruben T; van den Berg, Myrthe A; van der Hoek, Luuk; Pluim, Josien P W; Cabrerizo, Javier; Alberti, Mark; Veta, Mitko.

Sci Rep ; 11(1): 13976, 2021 07 07.

Article de Anglais | MEDLINE | ID: mdl-34234179

RÉSUMÉ

Corneal thickness (pachymetry) maps can be used to monitor restoration of corneal endothelial function, for example after Descemet's membrane endothelial keratoplasty (DMEK). Automated delineation of the corneal interfaces in anterior segment optical coherence tomography (AS-OCT) can be challenging for corneas that are irregularly shaped due to pathology, or as a consequence of surgery, leading to incorrect thickness measurements. In this research, deep learning is used to automatically delineate the corneal interfaces and measure corneal thickness with high accuracy in post-DMEK AS-OCT B-scans. Three different deep learning strategies were developed based on 960 B-scans from 50 patients. On an independent test set of 320 B-scans, corneal thickness could be measured with an error of 13.98 to 15.50 µm for the central 9 mm range, which is less than 3% of the average corneal thickness. The accurate thickness measurements were used to construct detailed pachymetry maps. Moreover, follow-up scans could be registered based on anatomical landmarks to obtain differential pachymetry maps. These maps may enable a more comprehensive understanding of the restoration of the endothelial function after DMEK, where thickness often varies throughout different regions of the cornea, and subsequently contribute to a standardized postoperative regime.

Sujet(s)

Pachymétrie cornéenne , Lame limitante postérieure/imagerie diagnostique , Lame limitante postérieure/chirurgie , Tomographie par cohérence optique , Pachymétrie cornéenne/méthodes , Kératoplastie endothéliale automatisée par le stripping de Descemet , Humains

RÉSUMÉ

Sujet(s)

RÉSUMÉ

Sujet(s)

RÉSUMÉ

Sujet(s)

RÉSUMÉ

Sujet(s)

ENVOYER À:

SÉLECTION CITATIONS

DÉTAIL DE RECHERCHE