Closed MiniXC closed 3 years ago
As of v0.5.2 this is now supported via a verbose
argument to .split(..)
, but only for the Python bindings. Like this:
(py38) bminixhofer@pop-os:~/Documents$ ipython
Python 3.8.5 (default, Sep 4 2020, 07:30:14)
Type 'copyright', 'credits' or 'license' for more information
IPython 7.18.1 -- An enhanced Interactive Python. Type '?' for help.
In [1]: import nnsplit
In [2]: splitter = nnsplit.NNSplit.load("en")
In [3]: _ = splitter.split(["Hello!"] * 100_000, verbose=True)
100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 100000/100000 [00:00<00:00, 221552.76it/s]
In [4]:
I'm currently using nnsplit on a fairly big dataset. Is it possible to track progress on a long list of inputs?