Closed ChristophLeonhardt closed 3 years ago
Similar problem with Ubuntu 18.04:
germaparl_add_p_attribute_stem()
... decoding token stream for p-attribute 'word'
... adjusting encoding
... stemming
... running cwb-encode
Total size: 101013708 tokens (96.3M)
... reading existing registry file
... writing registry file
... calling cwb-makeall
=== Makeall: processing corpus GERMAPARL ===
Registry directory: /home/user/R/x86_64-pc-linux-gnu-library/3.4/GermaParl/extdata/cwb/registry
ATTRIBUTE word
- lexicon OK
- frequencies OK
- token stream OK (COMPRESSED)
- index OK (COMPRESSED)
========================================
Index compression requires the REVCORP component
Warnmeldung:
Ausführung von Kommando ''/home/user/R/x86_64-pc-linux-gnu-library/3.4/cwbtools/extdata/cwb/bin/cwb-compress-rdx' -r /home/user/R/x86_64-pc-linux-gnu-library/3.4/GermaParl/extdata/cwb/registry -P word GERMAPARL' ergab Status 1
When preparing the CRAN release of GermaParl, we thought that germaparl_add_p_attribute_stem()
was not sufficiently generic to be included in the package. We suggest to include the function in the cwbtools package, see the respective issue:
https://github.com/PolMine/cwbtools/issues/14
Might be a cwbtools problem as stemming seems to be performed: