Open mattpodolak opened 3 years ago
A potential cause could be how the database is queried during time slicing. The oldest item utc_timestamp
is used as a before
field when generating subsequent timeslices. Pushshift queries the database using gt
and lt
for the after
and before
timestamps.
If multiple items have the same exact same utc_timestamp
but are not all returned in a single query (due to 100 item limit), we might expect that the items may not be returned in subsequent timeslices.
Test Query:
Results:
Discovered in #12