Closed shivanraptor closed 1 year ago
Due to a lack of information, the cause of the problem cannot be determined at this stage. In the meantime, you might want to try to isolate the problem by trying to see if it works in a vanilla Linux/Python environment or C++ CL tool (spm_train).
If there are no further discussions, we will automatically close this bug on May 1.
I'm trying to train
SentencePiece
with the following code:In the last line, it loads forever, and Jupyter Notebook indicates it's still running (marked as
[*]
in the block), but the kernel activity indicator at the top shows the Kernel is idle. No file is being generated in the directory, and no error has been generated.What could be the reason? Is the
train()
still running? or stuck?The
blogs.zip
comes from hereSample data of
blog_data.txt
: