Closed nassie256 closed 2 months ago
tokenizer and dictionary beforehand pip install fugashi[unidic-lite] - this installed tokenization library does not implement the full huggingface tokenizer API.
have you considered opening a feature request in this library?
Thank you for your answer. I understand that it is a compatibility issue on the fugashi side. We will consider submitting a feature request to the library.
This issue is closed as it has become clear that it is an issue that needs to be addressed on the model side.
Thanks for managing the issue and your reply! 😃
System Info
OS version: Ubuntu 22.04.3 LTS Model being used: hotchpotch/japanese-reranker-cross-encoder-large-v1 Hardware used (GPUs/CPU/Accelerator): NVIDIA GeForce RTX 3090 The current version being used: Python 3.11.7(pyenv virtualenvs), torch==2.2.1, transformers==4.39.3, sentence-transformers==2.6.1, infinity_emb==0.0.31
Information
Tasks
Reproduction
I am trying to use the reranker model for Japanese, "hotchpotch/japanese-reranker-cross-encoder-large-v1".
This works fine from the Python code. For example, the following code works perfectly
However, when I try to serve it from the CLI, I get the following error:
Expected behavior
I can use it from the CLI.