sethmlarson / pypi-data

Data about packages and maintainers on PyPI
Apache License 2.0
122 stars 8 forks source link

Add `classifiers` table #29

Closed edgarrmondragon closed 3 months ago

edgarrmondragon commented 4 months ago

Closes https://github.com/sethmlarson/pypi-data/issues/28

edgarrmondragon commented 4 months ago

This is an example query

select name, count(distinct package_name) from classifiers where name like 'Programming Language :: Python :: 3%' group by 1 order by 2 desc;
name count(distinct package_name)
Programming Language :: Python :: 3 254803
Programming Language :: Python :: 3.8 99431
Programming Language :: Python :: 3.9 93880
Programming Language :: Python :: 3.7 86250
Programming Language :: Python :: 3.10 80917
Programming Language :: Python :: 3.6 78359
Programming Language :: Python :: 3.11 60005
Programming Language :: Python :: 3.5 46703
Programming Language :: Python :: 3.4 35560
Programming Language :: Python :: 3 :: Only 29212
Programming Language :: Python :: 3.12 26282
Programming Language :: Python :: 3.3 15959
Programming Language :: Python :: 3.2 5643
Programming Language :: Python :: 3.1 1498
Programming Language :: Python :: 3.13 1404
Programming Language :: Python :: 3.0 1083
jonathan-s commented 4 months ago

Also interested in getting this into the dataset.

jonathan-s commented 4 months ago

Also code looks good to me!

edgarrmondragon commented 3 months ago

Ping @sethmlarson :)