ckan / ckanext-archiver

Archive CKAN resources
MIT License
21 stars 46 forks source link

Error with the download: 'ascii' codec can't encode character #74

Open abdelrahman146 opened 4 years ago

abdelrahman146 commented 4 years ago

Hello,

I'm developing a spanish ckan instance version 2.8.3 on ubuntu 16.04. the datasets are harvested from other spanish ckan platforms. and all the datasets information and titles are "ascii" encoded. ckan archiver gives me this error in all harvested documents

 Link is broken
- Error with the download: 'ascii' codec can't encode character u'\xf3' in position 39: ordinal not in range(128)
This resource has failed 76 times in a row since it first failed: Noviembre 21, 2019
We do not have a past record of it working since the first check: Noviembre 21, 2019
Link checked: Enero 26, 2020

No cached copy available

here is the archiver priority log:

2020-01-26 06:33:36,903 INFO  [rq.worker] ckan:default:priority: ckanext.archiver.tasks.update_package('/etc/ckan/default/production.ini', u'5bafb6c1-a2c0-455c-85fb-dc28bd7a987e') (fc0c08f$
2020-01-26 06:33:36,904 INFO  [ckan.lib.jobs] Worker rq:worker:opendata.4792 starts job fc0c08f9-3146-4e0a-8f28-0ed99302b1b5 from queue "priority"
2020-01-26 06:33:37,238 DEBUG [ckanext.harvest.model] Harvest tables already exist
2020-01-26 06:33:37,738 DEBUG [ckanext.harvest.model] Harvest tables already exist
2020-01-26 06:33:37,876 INFO  [ckanext.archiver.tasks] Starting update_package task: package_id=u'5bafb6c1-a2c0-455c-85fb-dc28bd7a987e' queue=bulk
2020-01-26 06:33:38,270 DEBUG [ckanext.harvest.model] Harvest tables already exist
2020-01-26 06:33:38,451 INFO  [ckanext.archiver.tasks] Attempting to download resource: http://geoserver.villanuevadelaserena.es:80/geoserver/LG3_WS_MapPublish_public/ows?service=WMS&reque$
2020-01-26 06:33:38,455 INFO  [ckanext.archiver.tasks] GET error: Download error - DownloadException("Error with the download: 'ascii' codec can't encode character u'\\xf3' in position 39:$
2020-01-26 06:33:38,456 INFO  [ckanext.archiver.tasks] API <function wms_1_3_request at 0x7fca26daecf8> error: DownloadException("Error with the download: 'ascii' codec can't encode charac$
2020-01-26 06:33:38,458 INFO  [ckanext.archiver.tasks] API <function wms_1_1_1_request at 0x7fca26daed70> error: DownloadException("Error with the download: 'ascii' codec can't encode char$
2020-01-26 06:33:38,459 INFO  [ckanext.archiver.tasks] API <function wfs_request at 0x7fca26daede8> error: DownloadException("Error with the download: 'ascii' codec can't encode character $
2020-01-26 06:33:38,462 INFO  [ckanext.archiver.tasks] Archival from before: <Archival Broken /dataset/vias-verdes/resource/bd78deb5-5ada-4154-90a5-f473a45ca9b3 75 failures>
2020-01-26 06:33:38,466 INFO  [ckanext.archiver.tasks] First_archival=False Previous_broken=True Failure_count=75
2020-01-26 06:33:38,466 INFO  [ckanext.archiver.tasks] Archival saved: <Archival Broken /dataset/vias-verdes/resource/bd78deb5-5ada-4154-90a5-f473a45ca9b3 76 failures>
2020-01-26 06:33:38,755 DEBUG [ckanext.harvest.model] Harvest tables already exist
2020-01-26 06:33:38,933 INFO  [ckanext.archiver.tasks] Attempting to download resource: http://geoserver.villanuevadelaserena.es/geoserver/wfs/ows?service=WFS&version=1.0.0&request=GetFeat$
2020-01-26 06:33:38,937 INFO  [ckanext.archiver.tasks] GET error: Download error - DownloadException("Error with the download: 'ascii' codec can't encode character u'\\xf3' in position 39:$
2020-01-26 06:33:38,938 INFO  [ckanext.archiver.tasks] API <function wms_1_3_request at 0x7fca26daecf8> error: DownloadException("Error with the download: 'ascii' codec can't encode charac$
2020-01-26 06:33:38,940 INFO  [ckanext.archiver.tasks] API <function wms_1_1_1_request at 0x7fca26daed70> error: DownloadException("Error with the download: 'ascii' codec can't encode char$
2020-01-26 06:33:38,941 INFO  [ckanext.archiver.tasks] API <function wfs_request at 0x7fca26daede8> error: DownloadException("Error with the download: 'ascii' codec can't encode character $
2020-01-26 06:33:38,943 INFO  [ckanext.archiver.tasks] Archival from before: <Archival Broken /dataset/vias-verdes/resource/4fecc22a-896c-49f4-a44c-5ba43605cda3 75 failures>
2020-01-26 06:33:38,947 INFO  [ckanext.archiver.tasks] First_archival=False Previous_broken=True Failure_count=75
2020-01-26 06:33:38,947 INFO  [ckanext.archiver.tasks] Archival saved: <Archival Broken /dataset/vias-verdes/resource/4fecc22a-896c-49f4-a44c-5ba43605cda3 76 failures>
2020-01-26 06:33:39,318 DEBUG [ckanext.harvest.model] Harvest tables already exist