pulibrary / lib_jobs

Enterprise Services batch processing tasks. Rails 7 Ruby 3.1.0
4 stars 0 forks source link

Datasync xref job failing since May 8th #783

Closed mzelesky closed 4 months ago

mzelesky commented 4 months ago

Expected behavior

Files are retrieved from the OCLC FTP server weekly.

Actual behavior

Files have not been retrieved since May 8th.

Steps to replicate

Impact of this bug

We are sending hundreds of files to OCLC daily and having them re-processed. This will lead to records potentially having multiple OCLC numbers when the files are processed.

Honeybadger link and code snippet

https://app.honeybadger.io/projects/100710/faults/107868248

Net::SSH::HostKeyMismatch: fingerprint SHA256:jzPlRQf9nD6aJEGymaXvLKfP0fq6PFhPSleRbLpM5X0 does not match for "filex-m1.oclc.org,132.174.0.174"
51 |         host_keys.add_host_key(args[:key])
52 |     end
53 |     raise exception
54 |   end
55 | end

Implementation notes, if any

mzelesky commented 4 months ago

This also affects the newly cataloged job.

maxkadel commented 4 months ago

I have gone onto the server (ssh deploy@lib-jobs-prod2) and deleted that host from the known_hosts file, then ssh'd using the credentials we have on the server and confirmed it's a host we like, so we shouldn't hit that issue again.

Now it's just a question of recovering from the jobs that are behind / were missed.