RESUMEN
BACKGROUND: Application of deep learning to diagnostic dermatology has been the subject of numerous studies, with some reporting skin lesion classification performance on curated datasets comparable to that of experienced dermatologists. Most skin disease images encountered in clinical settings are macroscopic, without dermoscopic information, and exhibit considerable variability. Further research is necessary to determine the generalisability of deep learning algorithms across populations and acquisition settings. OBJECTIVES: We assessed the extent to which deep learning can generalise to non-dermoscopic datasets acquired at the primary-secondary care interface in the National Health Service (NHS). We explored how to obtain clinically satisfactory performance on non-standardised, real-world local data without availability of large diagnostically labelled local datasets. We measured the impact of pre-training deep learning algorithms on external, public-domain datasets. METHODS: Diagnostic macroscopic image datasets were created from previous referrals from primary to secondary care. These included 2213 images referred from primary care practitioners in NHS Tayside and 1510 images from NHS Forth Valley acquired by medical photographers. Two further datasets with identical diagnostic labels were obtained from public domain sources, namely the International Skin Imaging Collaboration (ISIC) dermoscopic dataset and the SD-260 non-dermoscopic dataset. Deep learning algorithms, specifically SWIN transformers and an EfficientNets, were trained using data from each of these datasets. Algorithms were also fine-tuned on images from the NHS datasets after pre-training on different data combinations, including the larger public domain datasets. ROC curves and area under such curves (AUC) were used to assess performance. RESULTS: SWIN transformers tested on Forth Valley data had AUCs of 0.85 and 0.89 when trained on SD-260 and Forth Valley data, respectively. Training on SD-260 followed by fine-tuning of Forth Valley data gave an AUC of 0.91. Similar effects of pre-training and tuning on local data were observed using Tayside data, and EfficientNets. Pre-training on the larger dermoscopic image dataset (ISIC-2019) provided no additional benefit. CONCLUSIONS: Pre-training on public macroscopic images, followed by tuning to local data, gave promising results. Further improvements are needed to afford deployment in real clinical pathways. Larger datasets local to the target domain might be expected to yield further improved performance.
RESUMEN
BACKGROUND: The COVID-19 pandemic necessitated a change in the provision of outpatient care in dermatology. OBJECTIVE: A novel, asynchronous, digital consultation platform was codeveloped with 2 National Health Service dermatology teams to improve access and enhance choice in outpatient care. METHODS: The rollout of the platform was accelerated during the initial COVID-19 lockdown, and its wider use across 2 Scottish health boards was retrospectively evaluated. Integrated with the hospital booking system and electronic patient record, the platform provides an alternative to face-to-face consultations, using information and images submitted by the patients. RESULTS: In total, 297 new patient consultations and 108 return patient consultations were assessed, and 80% (324/405) of the images submitted were of satisfactory quality. The consultations were, on average, 3 minutes shorter than equivalent face-to-face interactions, and a total of 5758 km of patient travel was avoided. Outcomes included web-based reviews (66/405, 16.3%), face-to-face reviews (190/405, 46.9%), biopsies (46/405, 11.4%), discharge (89/405, 22%), and other treatments or investigations (14/405, 3.5%). High levels of patient satisfaction (92/112, 82.1%) were reported. CONCLUSIONS: Digital dermatology assessments are now included in the choices for consultation types that are available to patients, helping to augment service capacity during pandemic recovery.