Admin UI enhancements to make it easier to track progress and track errors (e.g.403s or 404s at source) with batch uploads
Additional support for handling duplicates aimed at improving the performance of batch uploads so that duplicates URLs are recognised and stored (but we don't duplicate the storage of the actual image artefacts).
Clean up of /ws/exportDataset CSV to remove non printable characters from postgres COPY output (this was causing an issue * in pipelines loading)
INaturalist migration script (all INaturalist image URLs have changed)
/ws/exportDataset
CSV to remove non printable characters from postgres COPY output (this was causing an issue * in pipelines loading)Includes database changes and export script changes. DB change - additional field
is_duplicate_of
added toimage
table. See export scripts here: https://github.com/AtlasOfLivingAustralia/ala-install/blob/27_pipelines_spark_hadoop/ansible/roles/image-service/templates/sql/image-service-export.sql