Tensorflow models for Anime character object detection.
We provide two pretrained models: One for face detection and one for figure segmentation. Download them at Google Drive!
Here is a full example script to classify faces in the sample images:
# Assume you have saved the pretrained models under "model/" directory.
python infer_from_image.py \
--input_images=samples/inputs/*
--output_path=samples/face_detection_outputs
--inference_graph=model/face_detection/frozen_inference_graph.pb
--visualize_inference=True
In general, use the script with the following format
python infer_from_image.py \
--input_images=/PATH/TO/IMAGES/* \
--output_path=/PATH/TO/OUTPUT/FOLDER/ \
--inference_graph=/PATH/TO/frozen_inference_graph.pb \
--visualize_inference=True
For figure segmentation model, please point --inference_graph
to the segmentation model and add the additional flag
--detect_masks=True
The Face Detection is trained on the outputs of animeface-2009 detector, which is under the MIT license. The main motivation for a tensorflow version is for a much faster GPU-based detection (5-10x speedup).
The Figure Segmentation model is trained on a manually annotated dataset, which you can download here. Note some images are NSFW. We release this dataset purely for research. Please use at your own risk.
To train the model, we overlayed segmented anime figures on top of pure background images to create an artificial dataset. We found this gives a decent performance. You can find pure background images in the Danbooru 2018 dataset. Please contact us if you'd like to use our pre-generated tfrecords.
Both model is trained using the Tensorflow Object Detection repo. Our repo is a simplified fork of that.
Danbooru2019 Figures is a dataset generated by Gwern who ran AniSeg on Danbooru2019 solo SFW images to generate 855k crops of close-up single-character whole-body/profile images. Check out a BIGGAN model trained on that dataset here.
Shameless self promotion of my TwinGAN model to turn people into anime characteres and cats!
Sketch coloring using PaintsTransfer and PaintsChainer.
Create anime portraits at Crypko and MakeGirlsMoe
The all-encompassing anime dataset Danbooru2017 by gwern.
My hand-curated sketch-colored image dataset.
Getting clean data is hard. The object detectors provided here can make that process easier. For example it can be used to crop faces from the Danbooru2017 dataset. Here we show some sample results of combining the two mdoels:
There is always a tradeoff between precision and recall. Please adjust the --min_score_thresh
flag accordingly.
Higher threshold means more accurate results but also more false negatives (higher chance to miss objects). Lower value gives more faces but also contains non-face objects.
Inference using CPU is slow. Please make sure you have a GPU/TPU.
If you decide to use our model and/or data, we ask you to kindly cite this repo. Have fun!