marda-alliance / metadata_extractors_schema

Archive of MaRDA Metadata Extractors Schema. See datatractor/schema for the current repository.
https://github.com/datatractor/schema
MIT License
6 stars 1 forks source link

Add `associated_file_extensions` to `FileType` schema #48

Closed ml-evs closed 8 months ago

ml-evs commented 8 months ago

Currently, using the API and registry in tandem, you still have to choose which extractor to run.

This PR implements the associated_file_extensions field for FileType, which can be used to provide the initial hint of which extractor to run. This should be used carefully, and extractors should fail gracefully when run on types they do not understand.

Related to #45, and takes ideas from #9.