gnn4dr / DRKG

A knowledge graph and a set of tools for drug repurposing
Apache License 2.0
577 stars 155 forks source link

Gene with no id inside the file "drkg.tsv" #32

Open canbax opened 2 years ago

canbax commented 2 years ago

Thanks for all your effort. Great content!

Inside the file drkg.tsv lines 4123208-4123214 contain genes without id info. You might want to delete them or fix them. You can find with regex search for Gene::\t

image

Also, I see 22 more genes with no id. You can find them with regex search for Gene::\n. For example, you can see them on lines 4122083, 4138986, 4145818, etc...

canbax commented 2 years ago

In case you are interested, I created a kind of extended version of your data https://github.com/canbax/derman Basically it also contains some data properties.