ecosyste-ms / ost

A curated list of open technology projects to sustain a stable climate, energy supply, biodiversity and natural resources, based on data from https://opensustain.tech
https://ost.ecosyste.ms
GNU Affero General Public License v3.0
6 stars 1 forks source link

Missing projects in the download statistics #113

Open Ly0n opened 9 months ago

Ly0n commented 9 months ago

I compared the plots we did about Python dependence last year with the new dataset. Here some projects that are missing in the statistics that should be visible:

  1. https://pypi.org/project/xarray/ --> is listed but the download numbers are not correct: https://pypistats.org/packages/xarray
  2. https://pypi.org/project/plantcv/
  3. https://pypi.org/project/codecarbon/
  4. https://pypistats.org/packages/tsam
  5. https://github.com/powsybl/pypowsybl
  6. https://pypi.org/project/climetlab/
  7. https://pypi.org/project/intake-esm/
  8. https://github.com/Unidata/siphon
  9. https://github.com/oemof/oemof

I will also randomly go through the other programming languages filter out those with the most stars.

andrew commented 9 months ago

Most of those should be fixed once they next sync, although two won't fix themselves:

That's because there isn't a strong link between the package metadata on pypi and the repository on GitHub, will need to do some extra work to parse the github urls from the readmes of those projects to discover the repository url.

andrew commented 9 months ago

I've added some extra repository detection https://github.com/ecosyste-ms/packages/commit/62604e9c9da2ef7ad83c213f51b5ecf1c416cd44 which should find repositories for those last two packages.

Ly0n commented 9 months ago

I rebuild the statistics. Fantastic work @andrew . The only Python project missing at the moment is https://github.com/powsybl/pypowsybl.

andrew commented 9 months ago

@Ly0n https://github.com/powsybl/pypowsybl isn't listed in the readme: https://raw.githubusercontent.com/protontypes/open-sustainable-technology/main/README.md