PolMine / GermaParl

GermaParl R Data Package
12 stars 3 forks source link

germaparl_add_p_attribute_stem() doesn't work | p_attribute_encode problem? #6

Closed ChristophLeonhardt closed 3 years ago

ChristophLeonhardt commented 5 years ago

Might be a cwbtools problem as stemming seems to be performed:

> GermaParl::germaparl_add_p_attribute_stem()
... decoding token stream for p-attribute 'word'
... adjusting encoding
... stemming
... running cwb-encode
Total size: 101013708 tokens (96.3M)              
... reading existing registry file
... writing registry file
... calling cwb-makeall
=== Makeall: processing corpus GERMAPARL ===
Registry directory: ~/Library/R/3.5/library/GermaParl/extdata/cwb/registry
ATTRIBUTE word
 - lexicon      OK
 - frequencies  OK
 - token stream OK (COMPRESSED)
 - index        OK (COMPRESSED)
========================================
Index compression requires the REVCORP component
Warning message:
In system2(command = normalizePath(file.path(cwb_get_bindir(), if (.Platform$OS.type ==  :
  running command ''/usr/local/bin/cwb-compress-rdx' -r ~/Library/R/3.5/library/GermaParl/extdata/cwb/registry -P word GERMAPARL' had status 1
Studentenfutter commented 5 years ago

Similar problem with Ubuntu 18.04:

germaparl_add_p_attribute_stem()
... decoding token stream for p-attribute 'word'
... adjusting encoding
... stemming
... running cwb-encode
Total size: 101013708 tokens (96.3M)              
... reading existing registry file
... writing registry file
... calling cwb-makeall
=== Makeall: processing corpus GERMAPARL ===
Registry directory: /home/user/R/x86_64-pc-linux-gnu-library/3.4/GermaParl/extdata/cwb/registry
ATTRIBUTE word
 - lexicon      OK
 - frequencies  OK
 - token stream OK (COMPRESSED)
 - index        OK (COMPRESSED)
========================================
Index compression requires the REVCORP component
Warnmeldung:
Ausführung von Kommando ''/home/user/R/x86_64-pc-linux-gnu-library/3.4/cwbtools/extdata/cwb/bin/cwb-compress-rdx' -r /home/user/R/x86_64-pc-linux-gnu-library/3.4/GermaParl/extdata/cwb/registry -P word GERMAPARL' ergab Status 1 
ablaette commented 3 years ago

When preparing the CRAN release of GermaParl, we thought that germaparl_add_p_attribute_stem() was not sufficiently generic to be included in the package. We suggest to include the function in the cwbtools package, see the respective issue: https://github.com/PolMine/cwbtools/issues/14