Closed yruslan closed 5 months ago
Currently, we have 2 options for schema transformation:
.option("schema_retention_policy", "keep_original") .option("schema_retention_policy", "collapse_root")
Field names in mainframe copybooks are usually unique, even if they are part of nested structs. Cobrix can remove all nesting until an array or a primitive is encountered.
Add a feature to collapse structs or the output data.
A simple example if applicable.
Solution Ideas
.option("schema_retention_policy", "collapse_struct")
that unstructs on-fly. OR
SparkUtils
Background
Currently, we have 2 options for schema transformation:
Field names in mainframe copybooks are usually unique, even if they are part of nested structs. Cobrix can remove all nesting until an array or a primitive is encountered.
Feature
Add a feature to collapse structs or the output data.
Example [Optional]
A simple example if applicable.
Proposed Solution [Optional]
Solution Ideas
that unstructs on-fly. OR
SparkUtils
that unstructs as a post-processing.