added parser script - Githubissues

PR as per https://gitlab.com/cdli/framework/issues/157

Extracting Sumerian language data (both translated and untranslated) is working fine. Extracting data based on genre is not working as expected probably due to the time complexity involved in the approach used. Working on new approach. Opening this draft PR to this ascertain that the requirements are being satisfied and this is what was supposed to be done.

To run the script, first merge the two catalogue files as mentioned in the readme and then type: python3 parser.py

Feedback welcome. :)

cdli-gh / data

added parser script #58