Batch Size for Online Triplet Mining

davidsandberg / facenet

Face recognition using Tensorflow

MIT License

13.72k stars 4.8k forks source link

Hi,

I read through the official paper of FaceNet and there it is stated, that a batch size of 1800 is used for online triplet mining. This number seems to be quite high. I have acces to an IBM Power Instance with a 32GB Nvidia Tesla V100 GPU but having a batch size that large with images from the LFW is infeasible.

Is the triplet mining performed on CPU? I tried to create an embedding of one batch (with size 1800) on aformentioned IBM instance. However, my jupyternotebook crashes - I assume that the batch size is still too large.

The triplet mining on my side performs Batch Hard Mining. How should I determine a good batch size?

davidsandberg / facenet

Batch Size for Online Triplet Mining #1191