TY - JOUR
T1 - Skin lesions of face and scalp – Classification by a market-approved convolutional neural network in comparison with 64 dermatologists
AU - Haenssle, Holger Andreas
AU - Winkler, Julia Katharina
AU - Fink, Christine
AU - Toberer, Ferdinand
AU - Enk, Alexander
AU - Stolz, Wilhelm
AU - Deinlein, Teresa
AU - Hofmann-Wellenhof, Rainer
AU - Kittler, Harald
AU - Tschandl, Philipp
AU - Rosendahl, Cliff
AU - Lallas, Aimilios
AU - Blum, Andreas
AU - Abassi, Mohamed Souhayel
AU - Thomas, Luc
AU - Tromme, Isabelle
AU - Rosenberger, Albert
AU - Reader study level-I and level-II Groups Christina Alt
A2 - Bachelerie, Marie
A2 - Bajaj, Sonali
A2 - Balcere, Alise
A2 - Baricault, Sophie
A2 - Barthaux, Clément
A2 - Beckenbauer, Yvonne
A2 - Bertlich, Ines
A2 - Bouthenet, Marie France
A2 - Brassat, Sophie
A2 - Buck, Philipp Marcel
A2 - Buder-Bakhaya, Kristina
A2 - Cappelletti, Maria Letizia
A2 - Chabbert, Cécile
A2 - De Labarthe, Julie
A2 - DeCoster, Eveline
A2 - Dobler, Michèle
A2 - Dumon, Daphnée
A2 - Emmert, Steffen
A2 - Gachon-Buffet, Julie
A2 - Gusarov, Mikhail
A2 - Hartmann, Franziska
A2 - Hartmann, Julia
A2 - Herrmann, Anke
A2 - Hoorens, Isabelle
A2 - Hulstaert, Eva
A2 - Karls, Raimonds
A2 - Kolonte, Andreea
A2 - Kromer, Christian
A2 - Le Blanc Vasseux, Céline
A2 - Levy-Roy, Annabelle
A2 - Majenka, Pawel
A2 - Marc, Marine
A2 - Bourret, Veronique Martin
N1 - Funding Information:
The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: A Blum received honoraria and/or travel expenses from Heine Optotechnik GmbH and FotoFinder Systems GmbH. C Fink received travel expenses from Magnosco GmbH. HA Haenssle received honoraria and/or travel expenses from companies involved in the development of devices for skin cancer screening: Scibase AB, FotoFinder Systems GmbH, Heine Optotechnik GmbH, Magnosco GmbH. P Tschandl has received honoraria from Silverchair, and an unrestricted research grant from MetaOptima Technology Inc.
Funding Information:
This research project received funding from a public, non-for-profit domain, namely the Skin Cancer Council Germany ( Nationale Versorgungskonferenz Hautkrebs (NVKH e.V.)), www.nvkh.de/ .
Publisher Copyright:
© 2020 Elsevier Ltd
Copyright:
Copyright 2020 Elsevier B.V., All rights reserved.
PY - 2021/2/1
Y1 - 2021/2/1
N2 - Background: The clinical differentiation of face and scalp lesions (FSLs) is challenging even for trained dermatologists. Studies comparing the diagnostic performance of a convolutional neural network (CNN) with dermatologists in FSL are lacking. Methods: A market-approved CNN (Moleanalyzer-Pro, FotoFinder Systems) was used for binary classifications of 100 dermoscopic images of FSL. The same lesions were used in a two-level reader study including 64 dermatologists (level I: dermoscopy only; level II: dermoscopy, clinical close-up images, textual information). Primary endpoints were the CNN's sensitivity and specificity in comparison with the dermatologists' management decisions in level II. Generalizability of the CNN results was tested by using four additional external data sets. Results: The CNN's sensitivity, specificity and ROC AUC were 96.2% [87.0%–98.9%], 68.8% [54.7%–80.1%] and 0.929 [0.880–0.978], respectively. In level II, the dermatologists' management decisions showed a mean sensitivity of 84.2% [82.2%–86.2%] and specificity of 69.4% [66.0%–72.8%]. When fixing the CNN's specificity at the dermatologists' mean specificity (69.4%), the CNN's sensitivity (96.2% [87.0%–98.9%]) was significantly higher than that of dermatologists (84.2% [82.2%–86.2%]; p < 0.001). Dermatologists of all training levels were outperformed by the CNN (all p < 0.001). In confirmation, the CNN's accuracy (83.0%) was significantly higher than dermatologists' accuracies in level II management decisions (all p < 0.001). The CNN's performance was largely confirmed in three additional external data sets but particularly showed a reduced specificity in one Australian data set including FSL on severely sun-damaged skin. Conclusions: When applied as an assistant system, the CNN's higher sensitivity at an equivalent specificity may result in an improved early detection of face and scalp skin cancers.
AB - Background: The clinical differentiation of face and scalp lesions (FSLs) is challenging even for trained dermatologists. Studies comparing the diagnostic performance of a convolutional neural network (CNN) with dermatologists in FSL are lacking. Methods: A market-approved CNN (Moleanalyzer-Pro, FotoFinder Systems) was used for binary classifications of 100 dermoscopic images of FSL. The same lesions were used in a two-level reader study including 64 dermatologists (level I: dermoscopy only; level II: dermoscopy, clinical close-up images, textual information). Primary endpoints were the CNN's sensitivity and specificity in comparison with the dermatologists' management decisions in level II. Generalizability of the CNN results was tested by using four additional external data sets. Results: The CNN's sensitivity, specificity and ROC AUC were 96.2% [87.0%–98.9%], 68.8% [54.7%–80.1%] and 0.929 [0.880–0.978], respectively. In level II, the dermatologists' management decisions showed a mean sensitivity of 84.2% [82.2%–86.2%] and specificity of 69.4% [66.0%–72.8%]. When fixing the CNN's specificity at the dermatologists' mean specificity (69.4%), the CNN's sensitivity (96.2% [87.0%–98.9%]) was significantly higher than that of dermatologists (84.2% [82.2%–86.2%]; p < 0.001). Dermatologists of all training levels were outperformed by the CNN (all p < 0.001). In confirmation, the CNN's accuracy (83.0%) was significantly higher than dermatologists' accuracies in level II management decisions (all p < 0.001). The CNN's performance was largely confirmed in three additional external data sets but particularly showed a reduced specificity in one Australian data set including FSL on severely sun-damaged skin. Conclusions: When applied as an assistant system, the CNN's higher sensitivity at an equivalent specificity may result in an improved early detection of face and scalp skin cancers.
KW - Actinic keratosis
KW - Basal cell carcinoma
KW - Deep learning
KW - Dermoscopy
KW - Lentigo maligna
KW - Melanoma
KW - Moleanalyzer-pro
KW - Neural network
KW - Seborrheic keratosis
KW - Skin cancer
KW - Solar lentigo
UR - http://www.scopus.com/inward/record.url?scp=85098207132&partnerID=8YFLogxK
U2 - 10.1016/j.ejca.2020.11.034
DO - 10.1016/j.ejca.2020.11.034
M3 - Article
AN - SCOPUS:85098207132
SN - 0959-8049
VL - 144
SP - 192
EP - 199
JO - European Journal of Cancer
JF - European Journal of Cancer
ER -