Closed printerwhale closed 10 months ago
In my system log, it's written that I run out of memory. However I have 72Gio of memory and I have nothing else running on this instance.
Hi, it is hard to debug from my side. I have not faced this error before. I think 72G is enough for running inference on VoxCeleb2 length videos (7-8secs).
I suggest you pinpoint where in the inference python script the memory is going out of bounds and debug accordingly.
The error occurs in the face detection step. There is the command causing the error : predictions.extend(detector.get_detections_for_batch(np.array(images[i:i + batch_size])))
face_det_batch_size is 64
Seems to work with face_det_batch_size = 32
Here is my settings
real_video_root='dataset/VoxCeleb2/vox2_test_mp4/mp4/' model_path="checkpoints/e7.24.1.3_model260000_paper.pt" sample_path="output_dir" sample_mode="cross" # or "reconstruction" NUM_GPUS=1
GEN_FLAGS="--is_voxceleb2=False --generate_from_filelist=0 --video_path=../test.mp4 --audio_path=../test.mp3 --out_path=../ --save_orig=False --face_det_batch_size 64 --pads 0,0,0,0"
However it doesn't seem to work. I always get :
MPI.COMM_WORLD.Get_rank() 0 os.environ["CUDA_VISIBLE_DEVICES"] 0 Logging to d2l_gen creating model... scripts/inference.sh: line 40: 31738 Killed