pushshift / api

Pushshift API
1.29k stars 109 forks source link

question about the delay between API response and the actual data on Reddit #74

Open regel opened 3 years ago

regel commented 3 years ago

query: $ date Mer 24 fév 2021 08:32:11 CET $ curl -s "https://api.pushshift.io/reddit/search/submission/?subreddit=xyz&num_comments=>100" | less

response: .... { "created_utc": 1614003387, "retrieved_on": 1614003637, ....

$ date -r 1614003637 Lun 22 fév 2021 15:20:37 CET

This item in Pushshift DB is 2 days old and consequently the number of comments, awards infos, etc, are no longer accurate.

In this example, I noticed a delay of 1.5 days between Pushshift API response data and the actual data on Reddit. How often is the data updated.

Are popular items updated at regular interval in Pushshift database?

ellie-me commented 3 years ago

I'm trying to search for content in the last 24 hours and it always returns 0 in total_results. How often is the schedule?