sec2pri / mapping_preprocessing

Other
0 stars 3 forks source link

reducing the Wikidata output files #13

Closed tabbassidaloii closed 1 year ago

tabbassidaloii commented 1 year ago

@DeniseSl22 , I suggest removing the prefix before the IDs to reduce the file size, e.g. here, instead of http://www.wikidata.org/entity/Q209355,http://www.wikidata.org/entity/Q72471241, use Q209355,Q72471241.

DeniseSl22 commented 1 year ago

Yes, I can do that in the query, but it might affect the time to run the query itself; so I need to check if we don't get a time out then

tabbassidaloii commented 1 year ago

Maybe, adding a bash script for excluding them after downloading and before saving would be better. I can do that.