dmarx / psaw

Python Pushshift.io API Wrapper (for comment/submission search)
BSD 2-Clause "Simplified" License
361 stars 53 forks source link

Statistics about total number of comments (over time) #83

Open wuqui opened 4 years ago

wuqui commented 4 years ago

I want to retrieve the total number of comments posted on Reddit over time. I've discovered the amazing aggs parameter, am I right to assume that it returns these numbers if I use it without specifying any search parameters?

gen = api.search_comments(
    aggs = 'created_utc',
    frequency = 'month'
)

Plotting the results looks somewhat plausible, I guess:

stats

Any ideas about the drop after the turn of the year by any chance? Changes to the API?

PS: Thanks for this great package!