IDGenerationStrategy setting enceladus.recordId.generation.strategy with values uuid, stableHashId, none (with uuid being the default) is available in Enceladus and expected to be used for both Standardizaton and Conformance.
The default (uuid) works as expected.
However, attempting to set non-default value (e.g. stableHashId) seems to result for this setting to be only respected in the Conformance phase, while Standardization running with the default uuid setting (effectively ignoring the setting).
To Reproduce
Steps to reproduce the behavior OR commands run:
set enceladus.recordId.generation.strategy='stableHashId in config or via -D
run std & conf
watch UUIDs being generated instead of stableHashId (integer values) being present for Std phase
(note that std/conf do not overwrite existing enceladus_record_id column)
Expected behavior
non-default enceladus.recordId.generation.strategy setting should be respected in both std & conf phases.
Additional context
Regression of this behavior may have appeared when the spark-data-standardization library was extracted. Thus consider:
remove any leftover code duplication (RecordIdGeneration)
add tests preventing from such a regression in the future
Describe the bug
IDGenerationStrategy setting
enceladus.recordId.generation.strategy
with valuesuuid
,stableHashId
,none
(withuuid
being the default) is available in Enceladus and expected to be used for both Standardizaton and Conformance.The default (
uuid
) works as expected. However, attempting to set non-default value (e.g.stableHashId
) seems to result for this setting to be only respected in the Conformance phase, while Standardization running with the defaultuuid
setting (effectively ignoring the setting).To Reproduce
Steps to reproduce the behavior OR commands run:
enceladus.recordId.generation.strategy='stableHashId
in config or via-D
enceladus_record_id
column)Expected behavior
non-default
enceladus.recordId.generation.strategy
setting should be respected in both std & conf phases.Additional context
Regression of this behavior may have appeared when the
spark-data-standardization
library was extracted. Thus consider:RecordIdGeneration
)