snap-stanford / ogb

Benchmark datasets, data loaders, and evaluators for graph machine learning
https://ogb.stanford.edu
MIT License
1.89k stars 397 forks source link

Raw Text for ogbn-products #414

Closed HaniItani closed 10 months ago

HaniItani commented 1 year ago

Hello @weihua916,

Do you, by any chance, have the raw text Amazon dataset from which ogbn-products was generated? In particular, I'm interested to map graph node features to text descriptions and labels to product text labels. I'm looking for something similar to the raw text files you provide for ogbn-arxiv and ogbn-papers100M. I did not find it in the dataset linked in Cluster-GCN. I also tried to download Amazon3M from here, but I could not establish the mapping.

Any help or guidance will be much appreciated. Best regards, Hani

weihua916 commented 1 year ago

Hi Hani, thanks for reaching out. I do not have raw text but we provided the mapping directory that contains a file that maps each node into ASIN (amazon' product's unique ID). It can be used together with the Cluster-GCN's Amazon3M dataset to establish the mapping. Let me know if you have further questions.

xiehuanyi commented 4 months ago

Hi, weihua916. I have found an empty label in the labelidx2productcategory.csv.gz and a label #508510. What do these two labels mean?

截屏2024-02-28 19 24 03