Dataset summaries: concept counts

lexibank / pylexibank

The python curation library for lexibank

Apache License 2.0

17 stars 7 forks source link

Dataset summaries: concept counts #231

Closed chrzyki closed 1 year ago

chrzyki commented 3 years ago

For a number of datasets (particularly those with multiple concept lists, e.g. https://github.com/lexibank/backstromnorthernpakistan), the concept summary statistics suggests more concepts than are actually used in the dataset. Is this something we want to fix (e.g. only report count of set(CONCEPTICON_ID) or similar)?

SimonGreenhill commented 3 years ago

I think there are two counts -- set(CONCEPTION_ID) and set(parameters.csv::NAME)

xrotwang commented 1 year ago

addressed by https://github.com/lexibank/pylexibank/commit/6075738ab3452095267c7443915f5416e7ff5589