openpreserve / fido

Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is designed for simple integration into automated work-flows.
http://openpreservation.org/technology/products/fido/
Other
147 stars 47 forks source link

API for identifying single files #94

Open bitsgalore opened 7 years ago

bitsgalore commented 7 years ago

It would really be useful if Fido's identification could be invoked from other Python scripts. Even though this is possible, Fido's lack of an API makes this unnecessairily difficult. See e.g. the examples below from the OPF Wiki:

http://wiki.opf-labs.org/display/KB/FIDO+Python+workflow+implementation+tips

Proper API functions would greatly simplify this. (E.g. something along the same lines as the checkOneFile function in jpylyzer)

(Ran into this while investigating this:https://github.com/KBNLresearch/omSipCreator/issues/23)

carlwilson commented 4 years ago

Looking at about 5d effort. @carlwilson to act as a reviewer for API design documentation.

carlwilson commented 1 year ago

Hackathon 2023 Review: Selected for first set of activities. @replaceafill perhaps you would take a look at this as you were already assigned?