Closed thfrkielikone closed 3 months ago
Hi,
How does one properly build opus-fast-mosestokenizer such that it finds pybind11 and works without LD_LIBRARY_PATH hacks? Can this be fixed upstream? (Issues are disabled from the repo so I am mentioning this here)
I actually wasn't aware that issues were not enabled for our opus-fast-mosestokenizer
fork. I opened them now, if you could move the related discussion there.
Is fastText a relevant dep for the non-extra collection of deps? The git repo has been archived so it seems somewhat dead: https://github.com/facebookresearch/fastText
A reasonable suggestion, I think we can move fasttext to extras. Especially now that also lingua is supported, there's another good option for language detection.
Is cld2 a relevant dep for the non-extra collection of deps? It doesn't look as dead but at least I couldn't get the obvious ways to install it or to compile it to work. (I can ofc ask upstream about how it should be installed or compiled)
Indeed the same as above applies here.
Can eflomal's pypi package be updated? The library itself works perfectly when built from the repo, so this seems to be a strictly packaging-related problem.
PyPI does have the latest version of eflomal. I guess your problem is that it's tagged as a pre-release version (1.0.0b1), while the old version is tagged as 0.1, so the latter is installed if not additional restrictions are set. The packaging update was changing quite a lot of things (see details at here) and I wasn't sure if that would be the final shape of things, so didn't want to go to final 1.0.0 release.
Thanks for taking the time to answer. I detailed the opus-fast-tokenizer issues in the appropriate repo. The eflomal update was merged over a year ago, is there a current reason to keep it in the pre-release state? (I should have looked at the exact version when installing, though.)
Eflomal has now version 2.0.0 in PyPI. (This was a bit confusing, but 1.0.0 was actually tagged before 1.0.0b1, and as the new version is incompatible, I had to increase the major version to 2).
The fasttext and pycld2 libraries have now been changed to optional. @thfrkielikone, can you confirm if everything works now? The changes are in the develop
branch.
eflomal installs:
FROM fedora:40
RUN python3 -m ensurepip
RUN dnf install -y gcc g++ git cmake make
RUN pip3 install eflomal
opus-filter installs
FROM fedora:40
RUN python3 -m ensurepip
RUN dnf install -y gcc g++ git cmake make
RUN git clone https://github.com/Helsinki-NLP/OpusFilter.git
WORKDIR /OpusFilter
RUN git checkout develop
RUN pip3 install .
And both seem to work in practice in my environment. Again, thanks for fixing this.
Hi. I am trying to install opus-filter on fedora 40 in a docker container and have had to do the following workarounds to install it (here's the dockerfile for reproducibility):
patch-opus.patch
:My issues are thus: