Closed tingwl0122 closed 2 years ago
I think this is mainly due to data type?
For OpenKE datasets, the code is stuck at reading the first key: node_name
, where it supposes to be lots of characters;
while for datasets like cora, the information is mainly tensor or csr_matrix.
I will have a look at that! Thank you for reporting this issue.
I have reproduced the error. The problem is that
After discussion, we decide to skip non-numeric attributes. I am opening a new PR.
Got it! I'll test this after that. Thank you!
Hi, I am not sure whether this issue was resolved? I tried to run tests on these OpenKE datasets today, but the same problem still occurred.
Hi @tingwl0122 ! Could you please try this new PR on your machine?
Sure. I'll try this in a second.
Describe the bug I cannot read in downloaded .npz files for some OpenKE datasets at
/GLB-Repo/datasets
, such asFB13
andWN18RR
. I didn't test all of the OpenKE datasets, but I could successfully run codes forcora
,citeseer
, andPubMed
.(A PR #86 is created to update dataset preparation in
/GLB-Repo/glb/tags.py
)To Reproduce Run
python3 tags.py --metadata FB13--task task
at/GLB-Repo/glb
Expected behavior
Comments The error occurs at
_dfs_read_file
ingraph.py
.array = file_reader.get(path, d.get("key"), device)
cannot merge path, keys and device name successfully.