Problems with Greek text

computationalstylistics / stylo

R package for stylometric analyses

177 stars 46 forks source link

Problems with Greek text #55

Open andreash22 opened 1 year ago

andreash22 commented 1 year ago

I am analyzing (Biblical) Greek text with stylo. Everything works fine with stylo(), but oppose() does not work - even if I set the slice size to very low levels, I get an error message "this text is too short". I have used load.corpus.and.parse and then used oppose() on the stylo corpus thus prepared (the preparation seemed to work according to what I could see in R), but then I received the error message "wrong corpus format". Do you have an idea what might be the problem?

locusclassicus commented 2 months ago

Same problem with Ancient Greek; have you found a solution?

andreash22 commented 2 months ago

A colleague who is working with cyrillic letters has written to me how she solved the problem. It is rather complicated; I hope to get to try it by next week and will then post again.

computationalstylistics commented 2 months ago

Well, the thing is the oppose() function so far goes with a very simple GUI, which doesn't provide a direct access to many settings. The user needs, then, to set these options manually. In the case of Greek/Cyrylic/Arabic please run the function with an additional parameter:

oppose(corpus.lang = "Other")

locusclassicus commented 2 months ago

Thank you for your reply; corpus.lang = "Other" did not work for me, by splitting.rule = "[ \t\n]+" eventually did the trick.