ckan / ckanext-harvest

Remote harvesting extension for CKAN
130 stars 203 forks source link

Gather stage failed #294

Open davidheuckrodt opened 7 years ago

davidheuckrodt commented 7 years ago

Hey.

I try to get data in my ckan-instance by using the ckanext-harvest extension. I have to use a proxy so I already put this settings in my the /etc/environment, apache.wsgi and ckanharvester.py files.

There was a problem before, which I'am was able to solve. But now I get this error message in the gather_consumer.log

2017-07-05 09:21:29,158 DEBUG [ckanext.harvest.queue] Received harvest job id: fb70cddf-4d37-4690-9b66-d60363aeb3e4

2017-07-05 09:21:29,163 DEBUG [ckanext.harvest.harvesters.ckanharvester] In CKANHarvester gather_stage (https://opendata.swiss) 2017-07-05 09:21:29,165 DEBUG [ckanext.harvest.harvesters.ckanharvester] Last error-free job: None

2017-07-05 09:21:29,165 DEBUG [ckanext.harvest.harvesters.ckanharvester] Searching for CKAN datassets: https://opendata.swiss/api/3/action/package_search?sort=id+asc&start=0&rows=100

2017-07-05 09:23:36,446 INFO [ckanext.harvest.harvesters.ckanharvester] Searching for all datasets gave an error: Error sending request to search remote CKAN instance https://opendata.swiss using URL u'https://opendata.swiss/api/3/action/package_search?sort=id+asc&start=0&rows=100'. Error: URL error: [Errno 110] Connection timed out

2017-07-05 09:23:36,462 INFO [ckanext.harvest.model] Unable to search remote CKAN for datasets:EError sending request to search remote CKAN instance https://opendata.swiss using URL u'https://opendata.swiss/api/3/action/package_search?sort=id+asc&start=0&rows=100'. Error: URL error: [Errno 110] Connection timed out url:https://opendata.swissterms:[]

2017-07-05 09:23:36,471 ERROR [ckanext.harvest.queue] Gather stage failed

Can anyone help me with this issue?

thenets commented 7 years ago

Hi, @davidheuckrodt. Can you try the same havest process using a non HTTPS connection? I'm having some issues with harvest from HTTPS portals too.

metaodi commented 6 years ago

@davidheuckrodt errors like these ("Connection timed out") are most likely a problem of the server rather then the client.