clarin-eric / ParlaMint

ParlaMint: Comparable Parliamentary Corpora
https://clarin-eric.github.io/ParlaMint/
41 stars 52 forks source link

TEITOK - allow cwb indexing on cluster #874

Closed matyaskopp closed 4 months ago

matyaskopp commented 4 months ago

Currently TEITOK builds cwb index directly on teitok machine without parallelization - it is extremelly slow because it does only a one corpus at time without any parallelization due to shared files.