AlfredXiangWu / LightCNN

A Light CNN for Deep Face Representation with Noisy Labels, TIFS 2018
https://arxiv.org/abs/1511.02683
MIT License
1.01k stars 166 forks source link

The loss value does not converge #12

Closed pthai1204 closed 6 years ago

pthai1204 commented 6 years ago

Hi @AlfredXiangWu ,

Thank you for your great work. I have tried your code training on your MS-Celeb list from scratch. First of all, I used learning rate of 0.001 and I got Nan value after a few iteration . So, I tried to running with smaller learning rate of 0.0001 but the loss value did not decrease and fluctuated after 80 epoches.

Here is my configuration:

for name, value in model.named_parameters(): if 'bias' in name: if 'fc2' in name: params += [{'params':value, 'lr': 10 args.lr, 'weight_decay': 0}] else: params += [{'params':value, 'lr': 2 args.lr, 'weight_decay': 0}] else: if 'fc2' in name: params += [{'params':value, 'lr': 10 args.lr}] else: params += [{'params':value, 'lr': 1 args.lr}]

Thanks, Hai

AlfredXiangWu commented 6 years ago

How do you align the face images for training? The data preparation is important for training.

Besides, do you shuffle the image list when training? The train.py script sets shuffle=False for the train loader because I shuffle the image list manually.