[x] Have you tried rerunning the command with the --clean flag?
Describe the issue
I encountered an ImportError requiring Chinese tokenization support, which conflicts with numpy and sklearn versions.
I installed MFA via conda, and the version of numpy is 1.26.4.
I tried aligning on a Mandarin corpus, but the terminal prompted that I needed to install dependencies.
ImportError: Please install Chinese tokenization support via pip install spacy-pkuseg dragonmapper hanziconv.
However, spacy-pkuseg requires numpy>=2.0.0. I attempted to run the command provided in the error message, which updated Numpy to 2.0.2. However, if I install Numpy 2.0.2, the dependency installed with MFA is sklearn 1.2.2, and these two packages seem to conflict. I encountered an error:
ValueError: numpy.dtype size changed, may indicate binary incompatibility. Expected 96 from C header, got 88 from PyObject
And I found a solution here: StackOverflow, which suggested downgrading numpy to 1.26.4.
I ultimately found a temporary solution by specifying an older version of spacy-pkuseg:
pip install spacy-pkuseg==0.0.33
This version only need numpy>=1.19.0.
I hope MFA can resolve this dependency issue and update the documentation, as there are no instructions indicating that I need to install these dependencies, but I receive a prompt when executing mfa align ....
For Reproducing your issue
Corpus structure
What language is the corpus in? Mandarin
How many files/speakers? 1 speaker, 1 audio and 1 text. Just for test.
Are you using lab files or TextGrid files for input? No.
Dictionary
Are you using a dictionary from MFA? If so, which one? Yes. mandarin_china_mfa
If it's a custom dictionary, what is the phoneset?
Acoustic model
If you're using an acoustic model, is it one download through MFA? If so, which one? Yes. mandarin_mfa
If it's a model you've trained, what data was it trained on?
Debugging checklist
mfa version
?--clean
flag?Describe the issue I encountered an ImportError requiring Chinese tokenization support, which conflicts with numpy and sklearn versions.
I installed MFA via conda, and the version of numpy is 1.26.4.
I tried aligning on a Mandarin corpus, but the terminal prompted that I needed to install dependencies.
However, spacy-pkuseg requires
numpy>=2.0.0
. I attempted to run the command provided in the error message, which updated Numpy to 2.0.2. However, if I install Numpy 2.0.2, the dependency installed with MFA is sklearn 1.2.2, and these two packages seem to conflict. I encountered an error:And I found a solution here: StackOverflow, which suggested downgrading numpy to 1.26.4.
I ultimately found a temporary solution by specifying an older version of spacy-pkuseg:
This version only need
numpy>=1.19.0
.I hope MFA can resolve this dependency issue and update the documentation, as there are no instructions indicating that I need to install these dependencies, but I receive a prompt when executing
mfa align ...
.For Reproducing your issue
mandarin_china_mfa
mandarin_mfa
To reproduce:
Log file sp1.log
Desktop (please complete the following information):
Additional context