NASA-PDS / harvest

Standalone Harvest client application providing the functionality for capturing and indexing product metadata into the PDS Registry system (https://github.com/nasa-pds/registry).
https://nasa-pds.github.io/registry
Other
4 stars 3 forks source link

harvest.log summary not correct #149

Open plawton-umd opened 9 months ago

plawton-umd commented 9 months ago

Checked for duplicates

Yes - I've already checked

🐛 Describe the bug

When I did ran havest and reviewed the log, I noticed the summary listed 269 files as skipped and 3 files as failed.

🕵️ Expected behavior

I expected correct information.

Reviewing the 269 skipped files - it was actually 150 unique files that were skipped one or multiple times. The skips were because the LIDVID/LID LID already exists in registry database. In a few specific cases reviewed, the file was loaded within minutes before the skip. However, they appear not to be included in the "Loaded_files" count. They are accessible via an OpenSearch query.

Reviewing the 3 failed files - it was 2 unique files. One had 2 different errors "sessions" - (it appears) the 2nd one happened although it had been loaded- although the first error session looks like it was not loaded. Both files are accessible via an OpenSearch query. If that is not how the log should be read, the error message needs to list the file that failed please.

📜 To Reproduce

  1. I do not know how to force these errors.

🖥 Environment Info

🩺 Test Data / Additional context

N/A

🦄 Related requirements

Tightly coupled with

⚙️ Engineering Details

N/A