Deep Learning in Computer Vision (deep-vision)
PyTorch / Tensorflow implementations of classic deep neural networks and training scripts for computer vision tasks. This is used to ease the learning curve for new DL practitioners by two principles: 1) Keep the coding style consistent accross all networks 2) Focus on the code readability and avoid obscure tricks. If you think my work is helpful, please ⭐star⭐ this repo. If you have any questions regarding the code, feel free create an issue.
The directory is categorized by model architecture, then further by framework. Some pretrained models, Jupyter notebook visuliazation script, and training logs are also provided for your reference.
Image Classification
- AlexNet
- VGG
- Inception (GoogLeNet)
- PyTorch
- Inception V1
- Inception V3
- ResNet
- PyTorch
- TensorFlow
- ResNet-50/152 V1
- ResNet-50 V2
- MobileNet
- LeNet
Object Detection
Generative Adversarial Network
Pose Estimation
Disclaimer
- This repo is mainly for study purpose. Hence I write the code in a readable and understandable way, but may not be scalable and reusable. I've also added comments and referrence for those catches I ran into during replication.
- I'm not a researcher so don't have that much of time to tune the training and achieve the best benchmark. If you are looking for pre-trained models for transfer learning, there're some good ones from PyTorch torchvision or TensorFlow slim.
Acknowledgement
Without the following resources I wouldn't be able to finish this project: