This should close #17 and close #14. For the 1st issue, we are creating a cache dictionary to save some time on creating the metadata (string_payload) for a record. You will notice the difference if you have a WARC file with several digest hashes that are the same. The time it takes to complete the process is reduced. For the 2nd issue, we are adding a return to the metadata_sidecar method. I added some assertions to the tests for these changes.
This should close #17 and close #14. For the 1st issue, we are creating a cache dictionary to save some time on creating the metadata (
string_payload
) for a record. You will notice the difference if you have a WARC file with several digest hashes that are the same. The time it takes to complete the process is reduced. For the 2nd issue, we are adding areturn
to themetadata_sidecar
method. I added some assertions to the tests for these changes.