Closed MikeDoes closed 3 years ago
Hi! This is not an error. We index different node types differently. In other words, 0-th drug is different from 0-th function.
You can think of (node_type, idx)
to specify a distinct entity in the biomedical KG.
Ok, thanks now it's much clearer
It appears that the same index values have multiple different types.
For example, if we take head index value 0, then search for all the instances in the training set.
The head types have the following distribution: function 23 protein 2 disease 1 drug 1
Could you confirm that this is indeed an error in the dataset?
See Jupyter Notebook Here: https://colab.research.google.com/drive/1pUWrZVLve4Ohc3w3ZmsPYAIy55_T4NIC?usp=sharing