marda-alliance / metadata_extractors_schema

Archive of MaRDA Metadata Extractors Schema. See datatractor/schema for the current repository.
https://github.com/datatractor/schema
MIT License
6 stars 1 forks source link

`Extractor:attributes:license:spdx` validate entries #24

Closed PeterKraus closed 4 months ago

PeterKraus commented 1 year ago

PR #21 introduced the option to specify licenses of Extractors using the SPDX license identifiers. However, we're not checking whether the supplied string is an actual identifier.

Option one is to check against a release of the SPDX data "statically".

Option two would be to probe an up-to-date online SPDX Identifier database. These seem to be queryable using the following format:

https://spdx.org/licenses/${identifier}.html

and will return a 404 if the ${identifier} is not a valid SPDX license identifier. We could leverage this and validate the provided entries against the live data.

ml-evs commented 10 months ago

Decided that this is not worth it for now --- could add a git submodule for e.g., https://github.com/spdx/license-list-data but for now we will leave validation to the registry.