GSA / datagov-ckan-multi

Other
10 stars 6 forks source link

USGS Harvest Sources Not Harvesting #542

Closed thejuliekramer closed 3 years ago

thejuliekramer commented 3 years ago

These harvest sources confirmed as active by USGS staff are not harvesting properly and errors are not occurring in production. I've documented individual errors to debug below.

source error
national-elevation-dataset-ned-1-arc-second-collection Could not harvest WAF link http://localhost: HTTPConnectionPool(host='localhost', port=80)
usgs-national-geologic-map-database Validation Error - name or id missing value
usgs-ned-1-3-arc-second-contours Could not harvest WAF link http://localhost: HTTPConnectionPool(host='localhost', port=80)
national-elevation-dataset-ned-alaska-2-arc-second-collection Validation Error - name or id missing value
usgs-national-watershed-boundary-dataset-wbd-downloadable-data-collection Validation Error - name or id missing value
5-meter-alaska-digital-elevation-models-dems-usgs-national-map Transformation to ISO Failed
alaska-orthorectified-radar-intensity-image-usgs-national-map Transformation to ISO Failed
usgs-national-boundary-dataset-nbd-downloadable-data-collection Transformation to ISO Failed
usgs-land-cover-woodland-downloadable-data-collection Transformation to ISO Failed
fsa-10-1-naip-imagery-collection Validation Error - name or id missing value
usgs-lidar-point-cloud-las-harvest-source 443 timeout when importing harvest source
usgs-ned-original-product-resolution-opr-downloadable-data-collection 443 timeout when importing harvest source
usgs-us-topo-maps 443 timeout when importing harvest source

Acceptance Critera

The following harvest sources are importing properly:

thejuliekramer commented 3 years ago

related to #510

thejuliekramer commented 3 years ago

@hkdctol

Screen Shot 2021-01-05 at 1 33 44 PM

national-elevation-dataset-ned-alaska-2-arc-second-collection has its collection_metadata_url in production as localhost - we need this to be updated - can you ask your contact about this one?

hkdctol commented 3 years ago

@thejuliekramer Have reached out to USGS lead on this issue and copied you on email.

thejuliekramer commented 3 years ago

Updated above with additional harvest sources from the USGS spreadsheet Todo items: 1. Need to update the above in the main progress spreadsheet. 2. Need to check if sources that are failing are failing in production 3. Ask REI team if there is anything we can do about timeouts

hkdctol commented 3 years ago

@thejuliekramer did you see in email the Alaska source - USGS says they fixed the URL

thejuliekramer commented 3 years ago

Moving to QA - and updating the above still not working in the main harvest source progress spreadsheet

hkdctol commented 3 years ago

This ticket can be moved to done, as the only sources left are the ones to be added manually