CREATE EXTERNAL TABLE IF NOT EXISTS {table_name} (
Records ARRAY<STRUCT<
eventVersion STRING,
userIdentity STRUCT<
type:STRING,
principalId:STRING,
arn:STRING,
....
skipping index Definition
CREATE SKIPPING INDEX ON {table_name} (
`Records.userIdentity.principalId` BLOOM_FILTER,
`Records.userIdentity.accountId` BLOOM_FILTER,
`Records.userIdentity.userName` BLOOM_FILTER,
`Records.sourceIPAddress` BLOOM_FILTER,
`Records.eventId` BLOOM_FILTER,
`Records.userIdentity.type` VALUE_SET,
`Records.eventName` VALUE_SET,
`Records.eventType` VALUE_SET,
`Records.awsRegion` VALUE_SET
) WITH (
...
)
How can one reproduce the bug?
Steps to reproduce the behavior:
Create a table as shown above
Create a skipping index as shown above
Spark returns Error
What is the expected behavior?
Skipping index should be able to work on top of such Array based indices
Utilize LATERAL VIEW explode({Array Field}) in some way
What is the bug? When defining a skipping index on top of an array based data structure - the create skipping index fails:
Data Definition
Table Definition
skipping index Definition
How can one reproduce the bug? Steps to reproduce the behavior:
What is the expected behavior? Skipping index should be able to work on top of such Array based indices Utilize
LATERAL VIEW explode({Array Field})
in some way