We need to cross reference the IDs available in the grid UI (i.e. ElasticSearch) before reaping loads of files from S3. There's no direct way of doing this in ElasticSearch, it has to be done with a query, which is most efficient as a 'scan and scroll' (see https://stackoverflow.com/a/30855670) so this adds a script to do just that and write to file - for example a CSV file for upload to AWS Athena (see #4111 )
Seems to work nicely for TEST (finished in a couple of mins)...
We need to cross reference the IDs available in the grid UI (i.e. ElasticSearch) before reaping loads of files from S3. There's no direct way of doing this in ElasticSearch, it has to be done with a query, which is most efficient as a 'scan and scroll' (see https://stackoverflow.com/a/30855670) so this adds a script to do just that and write to file - for example a CSV file for upload to AWS Athena (see #4111 )
Seems to work nicely for
TEST
(finished in a couple of mins)...