Closed c-suh closed 1 year ago
@c-suh @jordanpadams: what's the format of the package_id
? Is it provided on the command-line? Is it derived from the data set ID? Can I use a random UUID?
@nutjob4life I looked at the solr doc xml files created by harvest, and the package_id
fields all had the same UUID. If I recall correctly, Jordan said that they are an identifier for that ingestion batch.
@c-suh ah, okay … so if they're per-batch, that suggests I can use an ad hoc UUID or provide a command-line option for the user to specify one.
From tag-up 2023-09-26: the package ID should be generated by the tool, not specified on the command-line
Undoing all changes to
src/main/java/gov/nasa/pds/citool/CITool.java
src/main/java/gov/nasa/pds/citool/CIToolIngester.java
src/main/java/gov/nasa/pds/citool/commandline/options/Flag.java
src/main/java/gov/nasa/pds/citool/ingestor/CatalogDB.java
src/main/java/gov/nasa/pds/citool/ingestor/CatalogVolumeIngester.java
src/main/java/gov/nasa/pds/citool/ingestor/Constants.java
src/main/java/gov/nasa/pds/citool/ingestor/ProductFactory.java
src/main/java/gov/nasa/pds/citool/registry/client/RegistryClientSolr.java
src/main/java/gov/nasa/pds/citool/registry/model/RegistryObject.java
src/main/java/gov/nasa/pds/citool/registry/model/Slots.java
src/main/java/gov/nasa/pds/citool/search/DocGenerator.java
src/main/resources/search-conf/defaults/pds/dataset.xml
src/main/resources/search-conf/defaults/pds/instrument.xml
src/main/resources/search-conf/defaults/pds/instrumenthost.xml
src/main/resources/search-conf/defaults/pds/investigation.xml
src/main/resources/search-conf/defaults/pds/target.xml
And making just one change to src/main/java/gov/nasa/pds/citool/search/DocWriter.java
Also suppressing stack traces from Solr communication in RegistryClientSolr.java
.
Checked for duplicates
Yes - I've already checked
🐛 Describe the bug
When I run
registry-mgr
on the output solr XML, there is an error because each doc does not have a package_id field🕵️ Expected behavior
I expected each doc to have a package_id so that registry-mgr is able to index PDS3 data (similar to what harvest does for PDS4 data)
📜 To Reproduce
./catalog --mode ingest --doc-config $CATALOG_HOME/search-conf/defaults/ --output-dir $REGISTRY_DATA_HOME/pds3/solr-docs/ --report-file $REGISTRY_DATA_HOME/pds3/log/JNOJNC_0024.log --target $REGISTRY_DATA_HOME/pds3/JNOJNC_0024
./registry-mgr <catalog's output-dir>
📚 Version of Software Used
2.1.0-SNAPSHOT