pushshift / api

Pushshift API
1.3k stars 111 forks source link

API. Search. Submissions older than November 3, 2022 are not available (search by Subreddits) #132

Open smirnovstr-paul opened 1 year ago

smirnovstr-paul commented 1 year ago

API. Search. Submissions older than November 3, 2022 are not available (search by Subreddits)

https://api.pushshift.io/reddit/search/submission/?metadata=true&size=500&subreddit=binance&before=1667505777

image

https://api.pushshift.io/reddit/search/submission/?metadata=true&size=500&subreddit=binance&before=1667505780

image

Sorting by creation date does not work. Not looking for submissions (on all subreddits) older than November 3, 2022.

abhranil26 commented 1 year ago

yes even older user submissions don't load anymore

datanizing commented 1 year ago

Same for me 😭. Also, the API is only available as http where it used to be https.

If there is anything to do where I can help, I would be happy to contribute.

jannat5134 commented 1 year ago

Do you know how to fix it? or Any API to get historical reddit?

RaedShabbir commented 1 year ago

Any word on when this will be resolved?

Temporary Solution here https://www.reddit.com/r/pushshift/comments/107pm28/does_anybody_have_reddit_web_scraping_alternatives/

ranbix666 commented 1 year ago

Same for me! Can someone please take a look at this issue? It works fine for comments.

JacobGeoGeek commented 1 year ago

Is there any news on this problem? Also, does the Reddit API allows devs to extract the oldest posts from a subreddit?

rbjakab commented 1 year ago

I'm afraid this problem is still active, any news on it? Does anybody know what causes it and is anybody working on it?

RaedShabbir commented 1 year ago

https://www.reddit.com/r/pushshift/comments/zkggt0/update_on_colo_switchover_bug_fixes_reindexing/

Caused by server switch-over. I'd advise you to follow the instructions here for a temporary workaround.

Copied below for anybody else that comes across this issue.

Link to Reddit Archives Torrent Link to extraction code for multi-processing:

rbjakab commented 1 year ago

@RaedShabbir thanks, but for me it won't work out since I don't have 2TB spare space on my laptop.