openknowledge-archive / dpm-py

DEPRECATED Data package manager written in Python
http://frictionlessdata.io/tools/
MIT License
8 stars 6 forks source link

Cannot upload non-tabular data as data validation fails #89

Closed subhankarb closed 7 years ago

subhankarb commented 7 years ago

We can't publish non-tabular data because we do tabular data validation of all resources (irrespective of type) on every publish.

More info: We validate the data prior to uploading to data package registry. Previously we focused on publishing csv data to DPR. But recently we tried geojson data and found a bug because dpmpy is throwing an error that geojson is invalid when it auto checks its validity [because it is not tabular data].

Ref: #88

Tasks

Acceptance Criteria:

Analysis

Options:

  1. Turn off default validation and make it an optional switch
  2. Skip data validation for any non-tabular data (tabular data = OR(format csv, filename ends in .csv, mediatype=text/csv)
  3. Validation option for geojson data [NO]

We recommend option 1 or 2 as we do not need data validation for geojson data (right now). For now we will just implement option 2 and can revisit option 1 later if needed.