clowder-framework / extractors-s2orc-pdf2text

Extractor to convert pdf to text
Apache License 2.0
1 stars 0 forks source link

6 fix filename json output #7

Closed minump closed 1 year ago

minump commented 1 year ago

The output file names are fixed. Pyclowder now uploads the TEI.XML file and JSON file outputs as "input_filename".tei.xml and "input_filename".json to the same dataset. Better logging is done.

minump commented 1 year ago

Updated changelog. Updated branch to match main. Ready for review @lmarini