Closed joemirizio closed 3 years ago
Parameter is ignored at schema generation.
Parquet schema generation should skip partition attributes in list partitionValueAttributes
if skipPartitionAttributes
is true.
resolved in v2.2.4 https://github.com/IBMStreams/streamsx.objectstorage/releases/tag/v2.2.4
As a user I want to write partitioned data to S3 without the data containing the partition fields.
I am able to write partitioned parquet data to S3 using the
S3ObjectStorageSink
, but it contains the partition attributes. SettingskipPartitionAttributes
seems to have no effect, and the fields are still writtenTest Case
To reproduce, I created a simple SPL example.
This correctly creates a file at
s3://mybucket/streams-test/year=2000/month=1/day=1/obs-0.parquet
.Viewing the parquet file with
parquet-tools
, the partition attributes (year
,month
,day
) are still present in the data.Is this argument not working as intended or am I misunderstanding its purpose?