uktrade / stream-read-xbrl

Python package to parse Companies House accounts data in a streaming way
https://stream-read-xbrl.docs.trade.gov.uk/
MIT License
17 stars 2 forks source link

feat: save both start and end date when syncing to S3 #112

Closed michalc closed 1 year ago

michalc commented 1 year ago

This is to avoid missing files, for example a monthly file and a daily file on the last day of the month.

It's not guaranteed that a montly file is exactly all of its daily files, so we have to ingest all of them and then put the onus on client code to deduplicate.