Open feider opened 3 years ago
I think the each line is interpreted as a new document. With adding a "." to the end of the line, there are still no multiple sentences per document.
Can you try to add all the different lines into a single line, then add a "." between them ?
So the line would look like:
wer hat mich guoter uf getan . si ez iemen der mich kan . beidiu lesen und versten . ...
Thank you for the suggestion, but that did not help. This way it just reordered the sentences/lines, similar to each verse on a single line with the -s option.
Hi,
my issue may be related to issue #4
I call the tiling script with
sh topictiling.sh -tmd ../topicmodel -s -tmn model-final -fp "Wigalois.txt" -fd ../../data/pdf/ascii/ -out results -d
The LDA model (generated with jgibblda) and the file seem to be read correctly and the file is printed out using the
-d
option. I use-s
for simple segmentation, but also tried adding a.
at the end of every line instead. The text is in Middle High German, but converted to ASCII characters. Here is an example:The full error output is here:
I tried openjdk-15, openjdk-7 and the current oracle jre. Is there anything I'm doing wrong or anything different that I can try?
Kind regards