Abstract
AbstractApproaches for skin neoplasm diagnosis include physical exam, skin biopsy, lab tests of biopsy samples, and image analyses. These approaches often involve error-prone and time-consuming processes. Recent studies show that machine learning has promises to effectively classify skin images into different classes such as melanoma and melanocytic nevi. In this work, we investigate machine learning approaches to enhance the performance of computer-aided diagnosis (CADx) systems to diagnose skin diseases. In the proposed CADx system, generative adversarial network (GAN) is used to identify (and remove) fake images. Exploratory data analysis (EDA) is applied to normalize the original dataset for preventing model overfitting. Synthetic minority over-sampling technique (SMOTE) is employed to rectify class imbalances in the original dataset. To accurately classify skin images, the following four machine learning models are utilized: linear discriminant analysis (LDA), support vector machine (SVM), convolutional neural network (CNN), and an ensemble CNN-SVM. Experimental results using the HAM10000 dataset demonstrate the ability of the machine learning models to improve CADx performance in treating skin neoplasm. Initially, the LDA, SVM, CNN, and ensemble CNN-SVM show 49%, 72%, 77%, and 79% accuracy, respectively. After applying GAN, EDA, and SMOTE, the LDA, SVM, CNN, and ensemble CNN-SVM show 76%, 83%, 87%, and 94% accuracy, respectively. We plan to explore other machine learning models and datasets in our next endeavor.
Publisher
Cold Spring Harbor Laboratory