Your browser doesn't support javascript.
loading
Optimizing skin disease diagnosis: harnessing online community data with contrastive learning and clustering techniques.
Shen, Yue; Li, Huanyu; Sun, Can; Ji, Hongtao; Zhang, Daojun; Hu, Kun; Tang, Yiqi; Chen, Yu; Wei, Zikun; Lv, Junwei.
Affiliation
  • Shen Y; Simulation of Complex Systems Lab, Department of Human and Engineered Environmental Studies, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan.
  • Li H; Shanghai Beforteen AI Lab, Shanghai, China.
  • Sun C; Institution of Aix-marseille, Wuhan University of Technology WHUT, Wuhan City, China.
  • Ji H; Shanghai Business School No. 6333, Oriental Meigu Avenue, Shanghai, China.
  • Zhang D; The third affiliated hospital of CQMU, Chongqing, China.
  • Hu K; Shanghai Beforteen AI Lab, Shanghai, China.
  • Tang Y; Shanghai Beforteen AI Lab, Shanghai, China.
  • Chen Y; Simulation of Complex Systems Lab, Department of Human and Engineered Environmental Studies, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan.
  • Wei Z; Shanghai Beforteen AI Lab, Shanghai, China. 694994343@qq.com.
  • Lv J; Shanghai Beforteen AI Lab, Shanghai, China. lvjunwei@joyingmed.com.
NPJ Digit Med ; 7(1): 28, 2024 Feb 08.
Article in En | MEDLINE | ID: mdl-38332257
ABSTRACT
Skin diseases pose significant challenges in China. Internet health forums offer a platform for millions of users to discuss skin diseases and share images for early intervention, leaving large amount of valuable dermatology images. However, data quality and annotation challenges limit the potential of these resources for developing diagnostic models. In this study, we proposed a deep-learning model that utilized unannotated dermatology images from diverse online sources. We adopted a contrastive learning approach to learn general representations from unlabeled images and fine-tuned the model on coarsely annotated images from Internet forums. Our model classified 22 common skin diseases. To improve annotation quality, we used a clustering method with a small set of standardized validation images. We tested the model on images collected by 33 experienced dermatologists from 15 tertiary hospitals and achieved a 45.05% top-1 accuracy, outperforming the published baseline model by 3%. Accuracy increased with additional validation images, reaching 49.64% with 50 images per category. Our model also demonstrated transferability to new tasks, such as detecting monkeypox, with a 61.76% top-1 accuracy using only 50 additional images in the training process. We also tested our model on benchmark datasets to show the generalization ability. Our findings highlight the potential of unannotated images from online forums for future dermatology applications and demonstrate the effectiveness of our model for early diagnosis and potential outbreak mitigation.

Full text: 1 Collection: 01-internacional Database: MEDLINE Type of study: Diagnostic_studies / Prognostic_studies / Screening_studies Language: En Journal: NPJ Digit Med Year: 2024 Document type: Article Affiliation country: Japan Country of publication: United kingdom

Full text: 1 Collection: 01-internacional Database: MEDLINE Type of study: Diagnostic_studies / Prognostic_studies / Screening_studies Language: En Journal: NPJ Digit Med Year: 2024 Document type: Article Affiliation country: Japan Country of publication: United kingdom