GSA / data.gov

Main repository for the data.gov service
https://data.gov
Other
607 stars 98 forks source link

O+M 2024-04-22 #4705

Closed Jin-Sun-tts closed 4 weeks ago

Jin-Sun-tts commented 5 months ago

As part of day-to-day operation of Data.gov, there are many Operation and Maintenance (O&M) responsibilities. Instead of having the entire team watching notifications and risking some notifications slipping through the cracks, we have created an O&M Triage role. One person on the team is assigned the Triage role which rotates each sprint. This is not meant to be a 24/7 responsibility, only East Coast business hours. If you are unavailable, please note when you will be unavailable in Slack and ask for someone to take on the role for that time.

Check the O&M Rotation Schedule for future planning.

Acceptance criteria

You are responsible for all O&M responsibilities this week. We've highlighted a few so they're not forgotten. You can copy each checklist into your daily report.

Daily Checklist

Weekly Checklist

Monthly Checklist

ad-hoc checklist

Reference

Jin-Sun-tts commented 5 months ago

DOE's /harvest/arm-data-json are down for two days

Screenshot 2024-04-22 at 12 59 20 PM

Jin-Sun-tts commented 5 months ago

Tuesday 04/23

https://github.com/GSA/data.gov/

Image

rerun the failed commit job, it succeeded.

Check Catalog Auto Tasks

Check Harvesting Emails

Update: Could not access the DOE site for this harvest source : https://www.archive.arm.gov/data/, but the job finished without error today. Looks like their server goes up and down.

Checked catalog, inventory production, works fine.

Also checked Solr leader and followers, all work as normal.

Jin-Sun-tts commented 5 months ago

Tuesday 04/23

https://github.com/GSA/data.gov/

Screenshot 2024-04-25 at 9 45 55 AM

Check Catalog Auto Tasks

Check Harvesting Emails

Harvest Source: NASA Data.json
Organization: nasa-gov
Created: 2024-04-24 16:52:45.861624
Finished: 2024-04-24 16:54:02.582310
    - Error loading json content: not enough values to unpack (expected 2, got 0).
    - ProxyError getting json source: HTTPSConnectionPool(host='[data.nasa.gov](http://data.nasa.gov/)', port=443): Max retries exceeded with url: /data.json (Caused by ProxyError('Unable to connect to proxy', RemoteDisconnected('Remote end closed connection without response'))).

The error occurred on April 23, 2024, but it was success on April 22, 2024. This issue has occurred intermittently in the past, indicating possible issue in resource availability

Harvest Source: DOI EDI
Organization: doi-gov
Created: 2024-04-24 16:52:45.119525
Finished: 2024-04-24 16:53:46.331441
    - Error loading json content: not enough values to unpack (expected 2, got 0).
    - HTTPError getting json source: 504 Server Error: Gateway Time-out for url: https://datainventory.doi.gov/data.json.

The job run on April 24, 2024, at 5:23 PM was successful. This connection error had not occurred previously.

Checked catalog, inventory production, works fine.

Also checked Solr leader and followers, all work as normal.

FuhuXia commented 5 months ago

As one user pointed out, harvester /harvest/energy-json appears to be using a wrong URL. The current https://www.energy.gov/sites/default/files/2023-04/pdl040323.json is frozen at 2023-04 time frame. A dynamic URL https://www.energy.gov/data.json seems to be the correct URL, it redirects to the current year current month. @hkdctol