exponential-decay / demystify

Engine for analysis of Siegfried export files and DROID CSV. The tool has three purposes, break the export into its components and store them within a SQLite database; create additional columns to augment the output where useful; and query the SQLite database, outputting results in a readable form useful for analysis by researchers and archivists within digital preservation departments in memory institutions. The tool will find duplicates, unidentified files, blacklisted objects, character encoding issues, and more.
http://www.openplanetsfoundation.org/blogs/2014-06-03-analysis-engine-droid-csv-export
zlib License
23 stars 5 forks source link

Output integration test needed #64

Open ross-spencer opened 2 years ago

ross-spencer commented 2 years ago

The Python 3 work is close to being finished. String handling in output is not ideal but it works - a refactor and rethinking of how the output is achieved should happen in the near future. Integration tests from this perspective are to ensure that data is output without error and maybe some basic content checks, e.g. for specific headers? But primarily, Unicode/string handling errors are the biggest risk here so making sure they aren't triggered using some of the depth of test-data we've created will make a difference.