xgfs / deepwalk-c

DeepWalk implementation in C++
MIT License
99 stars 16 forks source link

how to evaluate of the correctness of the results? #10

Open chi2liu opened 5 years ago

chi2liu commented 5 years ago

we want to evaluate DeepWalk-c on the BlogCatalog dataset used in the DeepWalk paper. using the scripts from https://github.com/phanein/deepwalk. but we meet problem when construct the ouput vectors. how to get the vertex number from the csr?

xgfs commented 5 years ago

See #7

chi2liu commented 5 years ago

after convertion, in csr, node ids are from 1 to 10312? to get the original node ids, we need to sort the original node ids?

for example, if original node ids after sorted are from 4,8,13,...10234 in csr format ,node ids 1 is actual 4 node ids 2 is actual 8 node ids 10312 is actual 10234?

is that right?

xgfs commented 5 years ago

In CSR node IDs are from 0 to 10312.

I am not sure why do you have some "original" node IDs, blogcatalog was distributed as a matrix originally; the mapping should be 1-to-1.