aws / sagemaker-feature-store-spark

Apache License 2.0
6 stars 3 forks source link

[FEAT] Expose Validate DataFrame Schema API #2

Open huaziyao opened 2 years ago

huaziyao commented 2 years ago

Issue #, if available:

Description of changes: The validation logic exists as part of the data ingestion. This change separates out the validation logic and expose it on the Scala and Python interface.

Data type conversion was part of validation. This change moves it to its own function.

Unit tests and integration tests are added.

Testing done:

Merge Checklist

Put an x in the boxes that apply. You can also fill these out after creating the PR. If you're unsure about any of them, don't hesitate to ask. We're here to help! This is simply a reminder of what we are going to look for before merging your pull request.

General

Tests

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.