Caffe-Data-Augmentation
The original repository for Caffe, developed by the Berkeley Vision and Learning Center (BVLC) and community contributors, is at (BVLC\caffe)
This project adds a data augmentation feature to caffe, augmenting the data in 9 several ways.
The ways in which the data is augmentated is explained here:
- Image Translation - a random shift in a x and y axis pixels of the entire image. The shift has uniform probability between -7 and 7.
- Image Rescailing - shrinking or enalrgin the image (before cropping) by a random unifrom factor between 0.8 and 1.2.
- Horizontal Flipping - flipping the image in the horizontal axis.
- Vertical Flipping - flipping the image in the vertical axis.
- Elastic Deformation with Random Interpolation - dislocate pixels and use OpenCV interpolations method randomly.
- Color Noising - adding a small independent noise to each color channel of the image.
- Brightness Noising - adding a small noise to the brightness of each pixel.
- Small Blurring - convolving the image with small random-sized blurring kernel.
- Single Random Transformation - choosing a transformation at random.
- Multiple Random Transformations - chooses each transofrmation with probability 1/7, such that the mean is one transformation for every image.
The desired transformation(s) is chosen by parameter transform_type within the prototxt file for the data layer. The value of the parameter corresponds to the transform schemes described above.
For example, transform_type=4 uses vertical flipping as it's transformation.
This project was developed by Shani Rehana, Baruch Epstein and Shahar Katz.
The latest version for this project is rc2.