Model Description
SkinSAM is on the 12-layer ViT-b model, the mask decoder module of SAM is fine-tuned on a combined dataset of ISIC and PH2 skin lesion images and masks. SkinSAM was trained on an Nvidia Tesla A100 40GB GPU.
Some of the notable results taken:
ISIC Dataset:
- IOU 78.25%
- Pixel Accuracy 92.18%
- F1 Score 87.47%
PH2 Dataset:
- IOU 86.68%
- Pixel Accuracy 93.33%
- F1 Score 93.95%
- Downloads last month
- 48
Inference API (serverless) does not yet support transformers models for this pipeline type.