Closed gortiz closed 1 year ago
I also tried to remove/create an inverted index in airlineStats
, specifically in the Origin column, and it doesn't seem to break anything, so maybe the problem is only related to the github events table.
Per the exception message, I don't think it is related to the index change. The exception is thrown from RealtimeToOfflineSegmentsTask, which expects both REALTIME and OFFLINE table to exist. We should just remove this task from the table config
I had a look at this, from the description it looks like there are 2 issues
QuickStartBase.DEFAULT_STREAM_TABLE_DIRECTORIES
the githubEvents
table entry alone is under examples/minions/stream
while other tables are sitting under examples/stream
.examples/stream/githubEvents
but it does not follow the file name convention such as examples/stream/<TABLE_NAME>/<TABLE_NAME>_realtime_table_config.json
and also has a different schema compared to the current githubEvents
table.examples/minions/stream/githubEvents
is referenced at other places too so removing the task from the table config may not be the right way to handle thisexamples/stream/githubEvents
to examples/stream/pullRequestMergedEvents
in accordance to the naming convention and create a new examples/stream/githubEvents
which is a copy of examples/minions/stream/githubEvents
without the task in table config.githubEvents
table is from 2021 and the table config has retention set to 1 year so reload is deleting all the segmentsThe data in githubEvents table is from 2021 and the table config has retention set to 1 year so reload is deleting all the segments
Very good catch! That would explain why we only see this behavior in this test and not in production and honestly it will make me sleep better :)
RealtimeQuickStart seems to fail when indexes are changed (either created, updated or removed).
In order to reproduce it:
githubEvents
and either remove the included inverted index or add a new one.The behavior is not 100% consistent. Sometimes it is needed to retry.
I've been abled to replicate this with both master(866c796bd56cf846b654f29f024f3e610557b2c7) and with release
release-0.12.1
. When done in master, the following log is printed:This log is not printed in 0.12.1.
Before change:
After inverted index is removed:
Also all older segments get deleted and new ones are created. The backup of the old ones are still indexed, while the new ones are not.