RESUMO
Automated region of interest detection in histopathological image analysis is a challenging and important topic with tremendous potential impact on clinical practice. The deep learning methods used in computational pathology may help us to reduce costs and increase the speed and accuracy of cancer diagnosis. We started with the UNC Melanocytic Tumor Dataset cohort which contains 160 hematoxylin and eosin whole slide images of primary melanoma (86) and nevi (74). We randomly assigned 80% (134) as a training set and built an in-house deep learning method to allow for classification, at the slide level, of nevi and melanoma. The proposed method performed well on the other 20% (26) test dataset; the accuracy of the slide classification task was 92.3% and our model also performed well in terms of predicting the region of interest annotated by the pathologists, showing excellent performance of our model on melanocytic skin tumors. Even though we tested the experiments on a skin tumor dataset, our work could also be extended to other medical image detection problems to benefit the clinical evaluation and diagnosis of different tumors.
RESUMO
Cluster analysis has proved to be an invaluable tool for the exploratory and unsupervised analysis of high-dimensional datasets. Among methods for clustering, hierarchical approaches have enjoyed substantial popularity in genomics and other fields for their ability to simultaneously uncover multiple layers of clustering structure. A critical and challenging question in cluster analysis is whether the identified clusters represent important underlying structure or are artifacts of natural sampling variation. Few approaches have been proposed for addressing this problem in the context of hierarchical clustering, for which the problem is further complicated by the natural tree structure of the partition, and the multiplicity of tests required to parse the layers of nested clusters. In this article, we propose a Monte Carlo based approach for testing statistical significance in hierarchical clustering which addresses these issues. The approach is implemented as a sequential testing procedure guaranteeing control of the family-wise error rate. Theoretical justification is provided for our approach, and its power to detect true clustering structure is illustrated through several simulation studies and applications to two cancer gene expression datasets.
Assuntos
Análise por Conglomerados , Algoritmos , Genômica , HumanosRESUMO
AIMS: We applied digital image analysis techniques to study selected types of melanocytic lesions. METHODS AND RESULTS: We used advanced digital image analysis to compare melanocytic lesions as follows: (i) melanoma to nevi, (ii) melanoma subtypes to nevi, (iii) severely dysplastic nevi to other nevi and (iv) melanoma to severely dysplastic nevi. We were successful in differentiating melanoma from nevi [receiver operating characteristic area (ROC) 0.95] using image-derived features, among which those related to nuclear size and shape and distance between nuclei were most important. Dividing melanoma into subtypes, even greater separation was obtained (ROC area 0.98 for superficial spreading melanoma; 0.95 for lentigo maligna melanoma; and 0.99 for unclassified). Severely dysplastic nevi were best differentiated from conventional and mildly dysplastic nevi by differences in cellular staining qualities (ROC area 0.84). We found that melanomas were separated from severely dysplastic nevi by features related to shape and staining qualities (ROC area 0.95). All comparisons were statistically significant (P < 0.0001). CONCLUSIONS: We offer a unique perspective into the evaluation of melanocytic lesions and demonstrate a technological application with increasing prevalence, and with potential use as an adjunct to traditional diagnosis in the future.