JuliaComputing / JuliaHub-Feedback

Public repo for filing JuliaHub issues
6 stars 1 forks source link

Package new user counts are misleading overcounts. #146

Open LilithHafner opened 2 years ago

LilithHafner commented 2 years ago

They are labeled "XXX new users in the last 30 days", but seem to be computed as the sum of unique addresses each day. The text indicates that they should be computed as the number of previously unseen users (or IP addresses) over the 30-day span. With the current methodology, a user that downloads a package once per day from the same IP address will be counted as "30 new users in the last 30 days", even if they are not a new user at all.

Screen Shot 2022-07-15 at 9 31 56 AM
StefanKarpinski commented 2 years ago

Yeah, you can't just add up the new users per day and get a new users for the month, so that should absolutely be fixed. However, I'm not sure we provide that information in a rollup and I don't particularly want to just add a whole new set of rollups, especially because the way we're generating this data is currently costing us a bunch of JuliaLang money. I also think that the new users stats here are not terribly informative. I suspect that non-CI downloads rank over the past year, aggregated by week would likely be more informative.

StefanKarpinski commented 2 years ago

Bottom line:

Thanks for reporting, @LilithHafner