k-int / gokb-phase1

Original GOKb repo - Moving to https://github.com/openlibraryenvironment/gokb
http://www.gokb.org
Other
11 stars 5 forks source link

Automated data manipulation on package deposit #502

Open kristenwilson opened 8 years ago

kristenwilson commented 8 years ago

The new “Package Deposit” mechanism for loading data into GOKb is intended to be an automated process which can operate without user intervention. This means that data manipulation and corrections that would have previously happened in OpenRefine need to built into the Package Deposit process.

While currently all data manipulation required is carried out in OpenRefine, the types of data manipulation that need to be carried out when bringing data into GOKb vary, and the solutions when using the Package Deposit mechanism will need to be tailored to the type of data manipulation required.

Full spec available here: https://docs.google.com/document/d/1koqr5GDwtUgSaw5G0vGqATdQP2wUUwD-U0jkvTAR7gY/edit?ts=572cad31