OpenPecha / Botok

🏷 བོད་ཏོག [pʰøtɔk̚] Tibetan word tokenizer in Python
https://botok.readthedocs.io/
Apache License 2.0
58 stars 15 forks source link

test_tokenizer.py fail #3

Closed mikkokotila closed 6 years ago

mikkokotila commented 6 years ago

the function test_split_token() returns:

Loading Trie...
Time: 0.0014331340789794922
---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-8-c086f7958c84> in <module>()
----> 1 test_split_token()

<ipython-input-2-b4146ab2cb9c> in test_split_token()
     49     tok = Tokenizer(trie)
     50     tokens = tok.tokenize(PyBoTextChunks('གཏན་གྱི་བདེ་བའི་རྒྱུ།'))
---> 51     tok_utils = TokenSplit(tokens)
     52     tok_utils.split_affixed_particles()
     53     for t in tokens:

~/dev/mimic3/pybo/BoTokenUtils.py in __init__(self, tokens)
      8     def __init__(self, tokens):
      9         self.tokens = tokens
---> 10         self.matcher = BoMatcher()
     11 
     12     def split_affixed_particles(self):

TypeError: __init__() missing 1 required positional argument: 'query'
drupchen commented 6 years ago

This happens because is it still work in progress. This part should have been in a feature branch instead of the master...