Open FuhuXia opened 5 months ago
The team is leaning towards # 3
Document the steps the get full job report programmingly for registered catalog.data.gov user.
Create a token at /user/[YOUR-USER-NAME]/api-tokens
.
Get the last job id. Go to https://catalog.data.gov/api/action/harvest_source_show?id=[YOUR-HARVEST-SOURCE], get the last_job id.
Using command line with curl and jq installed, can be done:
curl -s https://catalog.data.gov/api/action/harvest_source_show?id=[YOUR-HARVEST-SOURCE] | jq '.result.status.last_job.id'
Download json report at https://catalog-prod-admin-datagov.app.cloud.gov/api/action/harvest_job_report?id=[LAST-JOB-ID]
Using command line this can be done as:
curl -H "Authorization: [YOUR-API-TOKEN]" "https://catalog-prod-admin-datagov.app.cloud.gov/api/action/harvest_job_report?id=[LAST-JOB-ID]"
We have received multiple requests from agencies to access full harvest error reports.
ckanext-harvest
only include top 20 errors in the report. Agency users complain 20 is too few when they are working on fixing large sources with hundreds of errors.There are three ways to accomplish this.