Closed kevinreiss closed 1 year ago
@mzelesky Is naming the "new" files in this fashion enough for Alma to "know" which files to pick up when it connects to pull these weekly? https://gist.github.com/mzelesky/3e4a0167c00899839fd1d5f79e8bb047#file-datasync_error_processing-rb-L54
Yes, Alma keeps track of which files have been loaded.
Implementation Notes
See gist for details. Marc collection files should be limited to 20,000 records. If there are more that were processed we need to create multiple files to be submitted to Alma. https://gist.github.com/mzelesky/4382296a6fa4bfac7c68a4139ccbb60e#file-datasync_xref_processing-rb-L68
See https://github.com/pulibrary/lib_jobs/pull/595 for parallel work on the Exception Files.
Noting that the 20,000 record limit can be enforced using this new config value added in the completion of ticket #580.