CivicSpleen / ambry

A comprehensive data package manager
BSD 2-Clause "Simplified" License
4 stars 5 forks source link

Export to CKAN #131

Closed ericbusboom closed 8 years ago

ericbusboom commented 8 years ago

Implement a connection to CKAN, http://ckan.org, to push Ambry packages to CKAN, based on the access and source metadata.

[ This issue isn't really detailed enough to complete the task, so give it a best shot and we'll deal with details later. ]

CKAN has these fundamental organizational concepts:

Ambry bundles are mapped to CKAN datasets. Ambry partitions are mapped to resources. When a bundle is published to CKAN:

There should be some additional Ambry configuration for uploading to CKAN:

Don't publish bundles for these access values:

Publish publicly, with no access restrictions. The organization is based on the identity.source. :

Publish to any logged in CKAN user. The organization is based on the identiy.source. :

Publish only to the organization of the user who owns the data library that the publications are being made from. The organization is the organization configured by the user. :

It would probably be easiest to install CKAN using docker: http://extensions.ckan.org/extension/datacats/

nmb10 commented 8 years ago

Done. Requires CKAN setup as described here - https://github.com/CivicKnowledge/ambry/tree/develop#ckan-export

Found problems:

  1. API released on http://demo.ckan.org does not delete datasets - it marks them as deleted. So publishing dataset after deleting fails with 'Url already in use' error. Newer versions of the API have dataset_purge method for complete deleting.
  2. Couldn't make user_role_bulk_update to work. Now it's individual role update for each user of organization.

Exporting dataset:

ambry ckan_export <dataset_vid>
ericbusboom commented 8 years ago

On Nov 4, 2015, at 6:11 AM, Kazbek notifications@github.com wrote:

Done. Requires CKAN setup as described here - https://github.com/CivicKnowledge/ambry/tree/develop#ckan-export

Found problems:

  1. API released on http://demo.ckan.org does not delete datasets - it marks them as deleted. So publishing dataset after deleting fails with 'Url already in use' error. Newer versions of the API have dataset_purge method for complete deleting.

    • Couldn't make user_role_bulk_update to work. Now it's individual role update for each user of organization.

Excellent, thanks. I’ll have you work on the census data packages next, but I’ll need some time to describe what to do. We’ll create a new git repo for them, since there will be a lot of them ( probably about 50 )

eric.


Eric Busboom (619) 363 2607
eric@busboom.org http://www.busboom.org