williamleif / GraphSAGE

Representation learning on large graphs using stochastic graph convolutions.
Other
3.43k stars 843 forks source link

How to get the source data of citation and reddit? #82

Open ZhiliangYao opened 5 years ago

ZhiliangYao commented 5 years ago

Hi, Thanks for your code.

I'm doing research related to dynamic graphs. Datasets with information including node attributes, node labels and dynamic links are needed. In the SNAP website, I canot find datasets meet my needs. There seems no download link in https://pushshift.io/ where reddit dataset sourced from. Is there any way I can obtain those datasets?

Thanks in advance.

RexYing commented 5 years ago

You can also find the datasets at http://snap.stanford.edu/graphsage/ In the dataset section, we listed the Reddit and PPI datasets.

ZhiliangYao commented 5 years ago

Thank you for your reply.

However, the preprocessed datasets at http://snap.stanford.edu/graphsage/ lacks time information. It seems that the source PPI dataset lacks time information as well. And there seems no download link in https://pushshift.io/ where Reddit dataset sourced from.

I just wonder if there is any other way to obtain the source Reddit and PPI data with time information?

RexYing commented 5 years ago

I think the PPI dataset does not have time information. These interactions are biochemical effects of proteins on other proteins common to many animals of the same species.

ZhiliangYao commented 5 years ago

I apologize for my spelling mistakes. I would like to obtain the source Reddit and Citation data with time information actually. Can you please email them to me if you save the original datas. My email address is zhiliang_yao@126.com. Your help will be greatly appreciated.