snwagh / falcon-public

Implementation of protocols in Falcon
90 stars 46 forks source link

Weird communication results #45

Open khoaguin opened 2 years ago

khoaguin commented 2 years ago

Hi, I am training LeNet on localhost for 15 iterations. I got the results like in the picture, where it shows the total communication is 900 Mb, but the communication for P0 is already 1673.28 MB. Is there anything wrong with this?

image

snwagh commented 2 years ago

I think there is some issue with the communication part (there is some issue with parallelization as well). Either it is wrapping around the bit size or something else is off. Can you please probe into this issue further?

khoaguin commented 2 years ago

Yes, I am looking into it. So for example, if I change the number of iterations to 1, the communications become normal again image