aws / sagemaker-feature-store-spark

Apache License 2.0
6 stars 3 forks source link

[Fix] Fix the issue of getFailedStreamIngestionDataFrame which would cause re-ingestion #9

Closed can-sun closed 1 year ago

can-sun commented 1 year ago

Issue #, if available: Fix the issue of getFailedStreamIngestionDataFrame which would cause re-ingestion

Description of changes: The dataframe should be cached otherwise spark will perform re-computation when spark action is called.

Testing done:

Merge Checklist

Put an x in the boxes that apply. You can also fill these out after creating the PR. If you're unsure about any of them, don't hesitate to ask. We're here to help! This is simply a reminder of what we are going to look for before merging your pull request.

General

Tests

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.