This is the capstone project for the Datasciene bootcamp of neuefische GmbH ffm-ds-23, where the task was to utilize the knowledge from the bootcamp to solve an everyday problem.
The teams were self organized and the topics were chosen by the trainees themselfes. Team DermaNerds formed from the passion of using neural networks in Lifescience and Healthcare applications and chose the HAM10000 ("Human Against Machine with 10000 training images") dataset for this project as a stellar example of image classification in medicine.
The HAM10000 dataset addresses the challenge of training neural networks for automated skin lesion diagnosis by providing a diverse collection of 10,015 dermatoscopic images from different populations, modalities, and diagnostic categories, making it a valuable resource for academic machine learning research in this field.
ResNet-50 is a deep convolutional neural network known for its superior accuracy, reduced overfitting, and ease of transfer learning. After comparing to other CNN architectures like VGG16, MobileNet and others ResNet50 showed the best performence on the dataset. All final evaluations were performed on three classes namely "benign", "malignant", "non-neoplastic and the classes were grouped via the metadata_engineering notebook.
Use the requirements file in this repo to create a new environment.
make setup
#or
pyenv local 3.11.3
python -m venv .venv
source .venv/bin/activate
pip install --upgrade pip
pip install -r requirements.txt
In this 4 week project the following was achieved: