KBNLresearch / pdfbatchqa

Apache License 2.0
0 stars 0 forks source link

Manage XML output file size #8

Open bitsgalore opened 1 day ago

bitsgalore commented 1 day ago

Especially with ExifTool output, the XML files van get bulky very quickly. Some ideas on how to manage this:

  1. Split output across multiple XML files, with user-defined upper limit on no. of reported PDFs per file. Then use numerical suffix like this:

    • pdfbatchqa001.xml
    • pdfbatchqa002.xml
    • ...
    • pdfbatchqa999.xml
  2. Support user-defined formatting string that controls reported ExifTool elements