This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is regular-expression based, extensible, and advanced tokeniser written in C++ (http://ilk.uvt.nl/ucto).
29
stars
5
forks
source link
Python-ucto installation error on Google Colab #17
This morning I tried to install python-ucto on Colab by running:
!pip install python-ucto
Unfortunately, I got this error:
Building wheels for collected packages: python-ucto
error: subprocess-exited-with-error
× Building wheel for python-ucto (pyproject.toml) did not run successfully.
│ exit code: 1
╰─> See above for output.
note: This error originates from a subprocess, and is likely not a problem with pip.
Building wheel for python-ucto (pyproject.toml) ... error
ERROR: Failed building wheel for python-ucto
Failed to build python-ucto
ERROR: Could not build wheels for python-ucto, which is required to install pyproject.toml-based projects
With version 0.6.4 this worked fine, so I'm guessing there is a problem with version 0.6.5. Also, in my own python environment I had no problems installing the new version.
This morning I tried to install python-ucto on Colab by running:
!pip install python-ucto
Unfortunately, I got this error:
With version 0.6.4 this worked fine, so I'm guessing there is a problem with version 0.6.5. Also, in my own python environment I had no problems installing the new version.
Hope this can be fixed! :)