ckan / ckanext-harvest

Remote harvesting extension for CKAN
130 stars 203 forks source link

Is there a way to filter/harvest only "datasets" type from portals that have other types of files? #513

Open raobregon opened 1 year ago

raobregon commented 1 year ago

Hello, I am searching for a way to only harvest datasets (type) from the following ckan portal : https://suche.transparenz.hamburg.de/?type=dataset&q=&sort=publishing_date+desc%2Ctitle_sort+asc&esq_not_all_versions=true

Does anybody has been able to harvest datasets with other filters than the regularly proposed by the harvester extension (organizations_filter_include, organizations_filter_exclude, groups_filter_include, groups_filter_exclude)? or do you know any solutions to filter datasets coming from harvested portals other than CKAN?

tino097 commented 1 year ago

@raobregon there are custom harvesters where you could use custom filters but keep in mind that you need to create mapping from source to CKAN metadata fields