Open frabcus opened 11 years ago
Some big data sets are distributed as multiple CSV i.e. Science museum catalogue: http://api.sciencemuseum.org.uk/documentation/collections/ Or the Companies House data: http://download.companieshouse.gov.uk/en_output.html
It would be useful to have a tool to append data to a dataset with reporting on numbers of repeated lines (if any)
Another use case would be for datasets collected as Excel (for example) and progressively updated and uploaded)
See https://github.com/scraperwiki/tool-requests/issues/12#issuecomment-18794417
Some big data sets are distributed as multiple CSV i.e. Science museum catalogue: http://api.sciencemuseum.org.uk/documentation/collections/ Or the Companies House data: http://download.companieshouse.gov.uk/en_output.html
It would be useful to have a tool to append data to a dataset with reporting on numbers of repeated lines (if any)
Another use case would be for datasets collected as Excel (for example) and progressively updated and uploaded)
See https://github.com/scraperwiki/tool-requests/issues/12#issuecomment-18794417