hochschule-darmstadt-UAS / ddk-artbrowser

Exploring the world of arts using open data
http://openartbrowser.org/
MIT License
1 stars 1 forks source link

3/first xml importer #70

Closed mauamy closed 3 years ago

mauamy commented 3 years ago

This is the first version of the new ETL process for the ddk artbrowser (see #3). It mainly consists of a xml-importer which parses the lido xml files and creates json files containing the required objects as output. These output files are then uploaded to elasticsearch with the elasticsearch_uploader.py script, which is a lightly modified version of the elasticsearch_helper.py from the openartbrowser project.