gbv / Catmandu-PICA

Catmandu modules for working with PICA+ data
https://metacpan.org/release/Catmandu-PICA
Other
4 stars 4 forks source link

Add support for normalized PICA+ format #47

Closed Phu2 closed 7 years ago

Phu2 commented 7 years ago

The GBV provides data in a "normalized" variant of the PICA+ plain format, have a look at the documentation https://verbundwiki.gbv.de/pages/viewpage.action?pageId=40009828 and the attached file test.txt

It would be nice to do a conversion without any replacing of characters upfront, eg. by

catmandu convert PICA --type normalized to PICA --type xml < test.txt

cKlee commented 7 years ago

@Phu2 Can you issue that within https://github.com/gbv/PICA-Data and close this one?

cKlee commented 7 years ago

@Phu2 Seems like this is already covered by PICA --type binary.

catmandu convert PICA --type binary to PICA --type xml < test.txt

seems to work properly.

Phu2 commented 7 years ago

Yeah, it works :) Thanks a lot, @cKlee !

powerriegel commented 6 years ago

Why do you add two \x1D at the end of each record? That won't work with our CBS. We have to remove the duplicate occurrence.

jorol commented 6 years ago

Could you please give an example of your conversion command and data as I couldn't reproduce the problem? I would also recommend to open a new issue.