This pull request is adding additional features to the image service to support the pipelines based ingestion.
See AtlasOfLivingAustralia/la-pipelines#6
This pull request includes the following:
Async web services for upload of zipped AVRO files for bulk ingestion of images from pipelines with persistent queue
Services to check status of an upload (to allow for external polling)
Admin UI for monitoring the progress of ingestion
Formalisation of metadata fields supported for update with the addition of standard dublin core terms in use by GBIF for multimedia (eg. audience, contributor, publisher, created, type)
Export web services to allow for fast retrieval of a list of images stored for a dataset/dataResourceUid
Admin UI
Through the UI, and administrator can:
Enable/disable processing of uploads
Reload a AVRO file
Clear current queue
Additional features
The following can now be adjusted via the setting table:
Number of background threads used for thumbnailing can now be adjusted via the setting
Number of threads used to download images from a data publisher
Throttling of download threads
Other changes:
Removal of staging logic as the UI was removed several versions ago.
Purge job for batch file uploads to remove completed jobs from database after a number of days (number of days is configurable in database).
This pull request is adding additional features to the image service to support the pipelines based ingestion. See AtlasOfLivingAustralia/la-pipelines#6
This pull request includes the following:
Async web services for upload of zipped AVRO files for bulk ingestion of images from pipelines with persistent queue
Services to check status of an upload (to allow for external polling)![image](https://user-images.githubusercontent.com/1750790/100705115-c717d300-33fa-11eb-96c8-9a7e6ad75a38.png)
Admin UI for monitoring the progress of ingestion
Formalisation of metadata fields supported for update with the addition of standard dublin core terms in use by GBIF for multimedia (eg. audience, contributor, publisher, created, type)![image](https://user-images.githubusercontent.com/1750790/100705150-d39c2b80-33fa-11eb-9b8e-e2410277419e.png)
Export web services to allow for fast retrieval of a list of images stored for a dataset/dataResourceUid![image](https://user-images.githubusercontent.com/1750790/100705186-e7e02880-33fa-11eb-8efc-857e88405e4b.png)
Admin UI![image](https://user-images.githubusercontent.com/1750790/100705207-f0386380-33fa-11eb-88ef-3378ae3ae6b3.png)
Through the UI, and administrator can:
Enable/disable processing of uploads Reload a AVRO file Clear current queue Additional features The following can now be adjusted via the setting table:
Number of background threads used for thumbnailing can now be adjusted via the setting Number of threads used to download images from a data publisher Throttling of download threads![image](https://user-images.githubusercontent.com/1750790/100705232-fa5a6200-33fa-11eb-9565-e9749cc2a57d.png)
Other changes:
Removal of staging logic as the UI was removed several versions ago. Purge job for batch file uploads to remove completed jobs from database after a number of days (number of days is configurable in database).