ckan / ckanext-harvest

Remote harvesting extension for CKAN
130 stars 203 forks source link

I cannot understand how to use this extension, please share me more tips #530

Open fishfree opened 1 year ago

fishfree commented 1 year ago

Questions are as below:

  1. How do I confirm the a third-party CKAN site is harvestable? For example the data.gov.uk. I don't know how to input the URL field on the /harvest/new page. Any URL of a CKAN site is harvestable?
  2. How do I confirm the configurations as described here of a third-party CKAN site? For example the data.gov.uk.
  3. How to make my CKAN site harvestable? Is it a built-in core feature or enabled by this extension?

Thank you very much!

martabracho commented 10 months ago

I,ll try to answer:

  1. To confirm if a CKAN is harvesteable just add "api/3/action/package_search" to the url, in your example "https://ckan.publishing.service.gov.uk/api/3/action/package_search", it returns a JSON with all the data. To input the URL field you must write "https://ckan.publishing.service.gov.uk".
  2. The configurations I don't controll them much, I've only made work the "organizations_filter_include": ["organization-name"]
  3. To make your CKAN harvesteable I think you need to install the DCAT plugin.
metaodi commented 10 months ago

There are different kinds of harvesters, so for a default CKAN site to be harvestable you only need a publicly available API (specifically the package_search endpoint).

So you only need the dcat extension, if you want to use the DCAT harvester.