terascope / teraslice

Scalable data processing pipelines in JavaScript
https://terascope.github.io/teraslice/
Apache License 2.0
50 stars 13 forks source link

e2e tests shared asset_directory can cause test failure. #3567

Closed busma13 closed 5 months ago

busma13 commented 5 months ago

Adding functionality to store assets in an external storage like S3 instead of in Elasticsearch (see: #3563) has surfaced a race condition in e2e tests.

busma13 commented 5 months ago

There are a few ways that we may be able to resolve this issue:

busma13 commented 5 months ago

I decided the best approach was to add a function to the teraslice-harness that returns an array of IDs belonging to the "base assets" that the e2e tests autoload. recover-spec calls this function and assigns the return value to jobSpec.assets, ensuring that job validation only checks the assets that were autoloaded.