code-openness / Data

The pre-processing and formatting of the data to setup the Wikidata instance
0 stars 0 forks source link

Sanitise the names of Authors and Editors in the data #13

Closed AbdBarho closed 5 years ago

AbdBarho commented 5 years ago

a) shorten first names, so that all of them have one format. b) solve the problem with wrongly encoded characters c) (possibly) fill missing info in rows where "et. al." exists