Open Alex-Ikanow opened 8 years ago
The problem is that the ES-hadoop/spark logic doesn't try to figure out if indexes are timesliced except by their suffix .. if course in practice we're guaranteed that _
So should be safe to add a number-of-digit-at-the-end check to the TimeSliceUtils code
From Caleb:
for a bucket that is comprised of two indexes (
ttm_master__2b5fd691353a
andttm_master__2b5fd691353a_1
)I think the problem is when the tmin/tmax are specified, it tries to filter on date, my guess is that it's picking up the segment id as a date and then ignoring it
Ah looks like
candidateTimedDirectories
inTimeSliceDirUtils
inaleph2_core_shared_library
doesn't expect the segment id (which is a purely ES construct)