This is tensorflow implementation of 'Multimodal Transfer: A Hierarchical Deep Convolutional Neural Network for Fast Artistic Style Transfer' which generates stylized image in high resulution such as 1024 pixels.
$ git clone https://github.com/fullfanta/multimodal_style_transfer.git
To train network, I use MS coco dataset.
$ cd multimodal_style_transfer
$ bash get_coco.sh
For stylization, pretrained VGG16 is necessary.
$ bash get_vgg16.sh
Then training is SIMPLE.
$ python train.py
During training, you can see generated images through tensorboard.
$ tensorboard --logdir=summary
$ sh freeze.sh 10000
$ python stylize.py --model=models/starry_night.pb --input_image=test_images/jolie.jpg
--model : freezed model path
--input_image : image file path to stylize
--hierarchical_short_edges : three short edge length to generate images. (default is 256, 512, 1024)
Input | Output(256px) | Output(512px) | Output(1024px) | |
---|---|---|---|---|
Angelina Jolie | ||||
Dinosour | ||||
Ryan | ||||
Cheez | ||||
Herb |