Open orr721 opened 5 years ago
Hey This looks like a missing bundler issue.
Do you have bundler installed?
What does gem which bundler
say? If it is empty, please install bundler, gem i bundler
And do you have multiple versions of ruby installed in your system? If thats the case I highly recommend using a version manager like rbenv.
yep, that was the issue, gem install bundler
fixed the error and the demo runs fine. bundler isn't listed as one of the run-time dependencies though..
any thoughts about the tokenizer question? thx.
Bundler is generally part of a standard Ruby setup.
While building this library I had evaluated many tokenizers, including pragmatic_tokenizer. What I remember is it could not split some types of sentences the way I was looking for but the tokenizer gem, which I have included, did.
The preprocessor for Smalltext can accept your list of stopwords, I simply have not exposed that config yet. As far as other languages are concerned I can surely include an option to have the preprocessor consider that. Give me some time.
hi,
wanted to try it out, but i'm not able to run your demo program (I have put it in sm.rb file):
gem install smalltext
completed successfully.any ideas what is wrong?
hey, and while I am at it, can you please consider support for using other tokenizers as well? I myself prefer the
pragmatic_tokenizer
as it supports many languages, has stopwords, cleanup methods, etc.. thank you.