Open njtierney opened 7 years ago
Haven't looked at data yet, so keen though, thanks @maelle! What would help I think is shifting them all to the same relative timescale so they all start at t0. So we're going to need the date published. Or as a proxy we could use first star.
Yeah it's hard to compare them!
I added the CRAN downloads, took 5 minutes :grin: I used the minimal date from the Github stars as minimum date for all CRAN downloads.
I'm not sure how to get date published, maybe from Github API, maybe from the first timepoint at which the package has more than 0 CRAN downloads from RStudio... Not from CRAN page for the package anyway, since it gives the date of the latest version. But I'll let you think about this :wink:
Also I know we want to look into forecasting but my passion for aberration detection makes me wonder if we could link peaks in no. of stars / downloads to something (I guess releases, new versions of R). :angel: But this might be out of scope for a post about forecasting popularity.
Perhaps it might be good to separate out the package column into package and author?
Also, visualising this is harder than I thought.