Skin Cancer Detection with ViTs

Date:

Skin cancer is one of the most common types of cancer worldwide, with millions of new cases diagnosed each year. Early detection is critical in improving patient outcomes, as it increases the chances of successful treatment. In recent years, the advancement of artificial intelligence (AI) techniques has opened new possibilities for detecting skin cancer at an early stage using automated methods. Specifically, Vision Transformers (ViTs) have emerged as a powerful tool in the field of medical imaging, providing state-of-the-art performance in tasks like image classification and segmentation. ViT The goal of this project is to explore the use of ViTs for the classification of skin cancer images. Using data from the ISIC 2024 Challenge, this project investigates the efficacy of ViTs in identifying high-risk cancerous lesions from high-resolution 3D Total Body Photography (3D-TBP) images. By comparing multiple variants of ViT models, this study aims to determine which architecture offers the best performance for skin cancer detection. The dataset used in this project is provided by the ISIC 2024 Challenge, which focuses on detecting skin cancer from high-resolution 3D-TBP images. These images pose a challenge due to their high resolution, requiring efficient model architectures that can handle large amounts of data while maintaining precision in prediction. The LeViT model achieved the highest accuracy (94%), while models like CaiT and Simple ViT also performed well. Token-to-Token ViT ran into memory issues, likely due to its high-resolution patching technique.

code

result

Modelreal scoreIs it Ploto?Best AccNumber of EpochsNumber of parapaper
Simple ViT101yes925053533698https://arxiv.org/abs/2205.01580
CaiT97yes9310120801282https://arxiv.org/abs/2103.17239
Vision Transformer for Small Datasets132yes925054528520https://arxiv.org/abs/2112.13492
Token to Token ViT113out of memory925019801116https://arxiv.org/abs/2101.11986
LeViT125yes9410017348972https://arxiv.org/abs/2104.01136
ats ViT113yes922552463080https://arxiv.org/abs/2111.15667
xcit122yes9225122141864https://arxiv.org/abs/2106.09681
vit_with_patch_merger118yes922577687272https://arxiv.org/abs/2202.12015