ufal / ParCzech

ParCzech is a project on compiling Czech parliamentary data into annotated corpora.
https://ufal.mff.cuni.cz/parczech
0 stars 1 forks source link

Release script #205

Open matyaskopp opened 10 months ago

matyaskopp commented 10 months ago

Implement release script in the same way as ParlaMint-UA finalize is implemented: https://github.com/ufal/ParlaMint-UA/blob/f0a64b45832f787295de70c54e01e3befaa9864f/Scripts/ParlaMint-UA-finalize.xsl

Use ParlaMint taxonomies with ParlaMint prefix:

And ParCzech and ParlaMint-CZ with ParCzech prefix:

matyaskopp commented 10 months ago

Script also changes audio paths from: [0-9]{4}ps/audio/<YYYY>/<MM>/<dd>/<YYYYMMdd><HHmm><HHmm>.mp3 to audio/psp/<YYYY>/<MM>/<dd>/<YYYYMMdd><HHmm><HHmm>.mp3

https://github.com/ufal/ParCzech/blob/ea119ef3b9a08c4fdaaf462d93e2f580c01750ae/src/tools/ParCzech-finalize.xsl#L281-L283

it will be stored in this way in separate repository record