context size in sampling

@deweihu96 Thank you for your interest in our work.

It seems the two variables you mentioned are conceptually the same (i.e. how much of surrounding words/nodes to use) but just different in terms of package implementation.

In gensim, there is a multiplication by 2 probably because the implementation directly extracts words before and after the target word.

On the other hand, in our work, which follows the pytorch-geometric package's node2vec implementation, each random walk is splitted into subsequences of length = _contextsize, and then for each subsequence, the initial node pairs up with all the remaining nodes to form a positive example.

For instance, if you set the random walk length as 8 and _contextsize as 3, then for a random walk [n1, n2, ... , n8], we have:
- subsequence [n1, n2, n3] -> positive examples (n1, n2), (n1, n3)
- subsequence [n2, n3, n4] -> positive examples (n2, n3), (n2, n4)
- ...
- subsequence [n6, n7, n8] -> positive examples (n6, n7), (n6, n8)
Note that this is essentially using the 2 nodes before and the 2 nodes after the target node: e.g. n4 forms positive examples with n2, n3, n5, and n6.

ncsoft / argew

context size in sampling #1