pkiraly / qa-catalogue

QA catalogue – a metadata quality assessment tool for library catalogue records (MARC, PICA)
GNU General Public License v3.0
76 stars 18 forks source link

Failed to read PICA Plain format with PICA_PLAIN #481

Open nichtich opened 1 month ago

nichtich commented 1 month ago

Setting --marcFormat PICA_PLAIN seems to be useful to validate human-readable samples of PICA+ records, but apparently the PICA serialization is not the one described at https://format.gbv.de/pica/plain.

It looks like parsing pica plain is supported but only after additional configuration with "PICA related parameters" -2 <path> and -u <char>. The default should be to parse PICA Plain as defined at https://format.gbv.de/pica/plain

pkiraly commented 3 weeks ago

I think it is not a real issue. The error described in #482 is due to a different issue. Both -2 (--picaIdField) and -u (--picaSubfieldSeparator) has default values: 003@$0 and $. You can check it with

./qa-catalogue --schema PICA --params "--marcFormat PICA_PLAIN" --input src/test/resources/pica \
  --input-dir "." --mask pica-plain.pp --output ~/temp/test-06-14 completeness

@nichtich If you still can reproduce the error, please provide some details.