ContinuumIO / anaconda-package-data

Conda package download data
Creative Commons Attribution 4.0 International
98 stars 36 forks source link

Include `.conda` packages #45

Open jakirkham opened 1 year ago

jakirkham commented 1 year ago

It would be helpful to include both .conda & .tar.bz2 packages. Particularly as more of the former and less of the latter are produced. May also help to track these separately to track the transition to the newer format

jezdez commented 1 month ago

@jakirkham FTR, this has been prioritized and get more attention again

jakirkham commented 1 month ago

Thanks Jannis! 🙏

Please let us know if you need more info from us or need us to test anything 🙂

wolfv commented 1 week ago

The 1970 issues were actually issues in our code. Sorry about that!

wolfv commented 1 week ago

We just fixed things on our end, but it appaears that the pipeline to produce this data is not really working anymore?

The latest data is 2024-06...

jezdez commented 6 days ago

Huh, I'd check with @cappadona about it, he was working on an analysis

cappadona commented 6 days ago

Hi all. We've been running some analysis on the dataset in response to everyone's feedback and will share our findings when this is complete.

In the interim, responding to some of the recent questions in this thread...


@wolfv

We just fixed things on our end, but it appaears that the pipeline to produce this data is not really working anymore?

The latest data is 2024-06...

The latest data available in the s3 bucket is for 2024-05, which was made available in June. We have temporarily paused publishing new data until we complete the QA.

The 1970 issues were actually issues in our code. Sorry about that!

Thank you. This is one issue that we haven't been able to reproduce.


@jakirkham @phwuil @nicrie

Notably:

False alarm -- addressed by Wolf

This is the main focus of our QA effort and we're tentatively planning to replace data beginning in 2022-06 to address the undercounting.

Temporarily paused publishing new data (see my response above)

We still need to dig into the download counter displayed on anaconda.org. I will also comment on each of those issues.