ixxi-dante / an2vec

Bringing node2vec and word2vec together for cool stuff
GNU General Public License v3.0
22 stars 6 forks source link

Full batch sensitivity analysis (adjacency reconstruction, feature reconstruction, and both) #6

Open wehlutyk opened 6 years ago

wehlutyk commented 6 years ago

Test the following parameters:

For each run, save:

jaklevab commented 6 years ago

Finished parameter exploration for adjacency reconstruction without feature reconstruction: See gcn-ae-explore.ipynb in branch issue-6-sensitivity-analysis

wehlutyk commented 6 years ago

18 adds the following parameters of interest: mini-batch size, length of random walks in the mini-batch, and both those quantities w.r.t. the network size and average community sizes. (I might take over for that if you don't to explore them @jaklevab, as I keep adding parameters to this :p )

jaklevab commented 6 years ago

Ok so the minibatch sensitivity analysis gives the same result as the full batch when taking the full batch as minibatch size. Remains to be seen how the reconstruction loss behaves with the different parameters affecting the RW and the minibatch. @wehlutyk are you done with the minibatch parametrization?

wehlutyk commented 6 years ago

Ok so the minibatch sensitivity analysis gives the same result as the full batch when taking the full batch as minibatch size.

Great!

Remains to be seen how the reconstruction loss behaves with the different parameters affecting the RW and the minibatch. @wehlutyk are you done with the minibatch parametrization?

Well, I started, then realised that what I wanted to test would have taken 6 months to run, and decided I wasn't sure which were the right parameter ranges to choose. So I moved on to real data sets (#8) in order to know when it would be necessary (memory-wise) to use a mini-batch size that's not the full batch, which then led me to #21 because it's currently so slow on a ~10,000 nodes network.

So once I'm done with #21 (today or next week), doing #8 should show us the relevant parameter ranges we need to test for the mini-batch and RW (and should give more material for NetSci).

wehlutyk commented 6 years ago

(I started the minibatch parametrisation in #19)

wehlutyk commented 6 years ago

Ok, so we have a bit of a mess with the validation of minibatching and the sensitivity analyses. I'm reorganising: