Note: this repo is currently under heavy development. It's not ready for general consumption. So, please refrain yourself from using it in production.
The goal of this project is to buid a single end-to-end deep learning model for more accurate and faster (near real-time) multi-object detection that can be train in single-pass of multiple different pieces:
These techniques and methods from various research papers will be implemented using PyTorch.
We will be using Pascal VOC2007 dataset.
# Select the script that you want to train for reproducing a results
./retina_ce_sgd_0.001.sh
# For the focal loss use ./retina_focal_sgd_0.0001.sh
You can see the details in trainer.py
# specify a directory for dataset to be downloaded into, else default is ~/data/
sh data/scripts/VOC2007.sh # <directory>
# specify a directory for dataset to be downloaded into, else default is ~/data/
sh data/scripts/VOC2012.sh # <directory>