VisionLearningGroup / DAL

Domain agnostic learning with disentangled representations
145 stars 28 forks source link

Refactoring to reproduce paper results #4

Closed marcociccone closed 4 years ago

marcociccone commented 4 years ago

Hi, first of all congrats for your work and thanks for sharing the code :) I've read the paper and checked the code and I found few inconsistencies, so I thought it would be good to refactor your codebase to have a better understanding of all the pieces. I am going to ask you a few questions that I hope will help me and other researchers in building upon your great work.

Thank you for your help!

image

marcociccone commented 4 years ago

Hi @xcpeng , I'm not sure you'd want to merge my PR into your master. I'm still debugging it. I've found a couple of bugs, for sure the Mutual Information minimization was wrong. Do you have time to check the correctness of the implementation with me?

marcociccone commented 4 years ago

Few more details: I believe the code was minimizing the MI between (di, ci) and (ds, ci) instead of (di, ci) and (di, ds) as described in the paper. Fixing this here I'm not having the nan loss anymore, but the training is still very unstable.

I've integrated tensorboard so you can easily visualize the loss functions. Let me know what you think!

YiDongOuYang commented 4 years ago

@marcociccone Thank you very much! I don't think that paper is a qualified work since the author cannot explain the results and the misleading code. But I would like to appreciate for your efforts.