clarin-eric / ParlaMint

ParlaMint: Comparable Parliamentary Corpora
https://clarin-eric.github.io/ParlaMint/
41 stars 52 forks source link

insert new sample data #651

Closed matyaskopp closed 1 year ago

matyaskopp commented 1 year ago

Insert sample data created from release-ready files into the data branch. Source:

/project/corpora/Parla/ParlaMint/ParlaMint-full/Data/Corpora/Sample-ParlaMint-XX
matyaskopp commented 1 year ago

@TomazErjavec, I have inserted new sample data into data branch. Everything validates (I haven't done a manual inspection - I will do it later or not at all...).

ParlaMint-ES-GA data is missing, @adina-v provided (7th April, subj: ParlaMint-ES-GA corpus ) us full data, but it seems that you haven't downloaded and processed them.

TomazErjavec commented 1 year ago

ParlaMint-ES-GA data is missing

Now ready in the usual place.

TomazErjavec commented 1 year ago

Once all the corpora are processed for 3.0, this should be done again. Currently the samples are in (non GitHub directory) /Distro/Master/Sample-ParlaMint-*

Then again, I could just change script that generates the samples to put them in the Data directory, or symlink the Sample-ParlaMint-* to there.

TomazErjavec commented 1 year ago

OK, inserted samples generated from 3.0 full data. This is in the devel branch, sorry, but it works out that way easier in the current set up. If happy, @matyaskopp, pls. merge to main (and data I guess) and close this issue.

TomazErjavec commented 1 year ago

Merged, closing.