-
1. 我已搜索相关问题,但无法获得预期的帮助。
**描述错误**
运行默认工作流,弹出如下错误:
相关目录已经有相关的文件但是任然提示找不到。
!!! Exception during processing!!! Unable to load vocabulary from file. Please check that the provided vocabulary is…
-
Hey guys,
thanks so much for releasing all the t51.1 and mt5 weights!
I'm currently working on porting all these models to huggingface's transformers.
Is there anyway to run mesh tensorflow in…
-
Similar to #2689, I'm having an issue where computing the `principal_component` quality metric hangs at 0% on Linux when run as part of a script. Similar to that issue, it seems to require multiple pa…
-
Hello, I am trying to train the monolingual model using the given run.sh script.
I have successfully trained the monolingual model for English using mT5. The evaluation results are:
TSR: 66.98 …
-
### Describe the bug
I installed text generation webui and downloaded the model(TheBloke_Yarn-Mistral-7B-128k-AWQ) and I can't run it. I chose Transofmer as Model loader. I tried installing autoawq b…
-
Hello. It seems that original Grossmend's model was deleted: [https://huggingface.co/Grossmend/rudialogpt3_medium_based_on_gpt2](https://huggingface.co/Grossmend/rudialogpt3_medium_based_on_gpt2)
Do …
-
I just download the docker that you provided (cuda11 version) and I find the version of torch is the cpu version.
I run the code "python app/hydit_app.py" in the container shows:
root@docker-des…
-
### This issue is part of our **Doc Test Sprint**. If you're interested in helping out come [join us on Discord](https://discord.gg/J8bW9u5abB) and talk with other contributors!
Docstring examples …
-
### What happened?
Hello,
I've been experimenting with some Olive passes on a custom model containing a transformer and some extra layers. Using the passes seem to slow down both the throughput and …
-
The regular MBart model (and not MBart-50) for example, has a config property `decoder_start_token_id` that needs to be updated after the model is trimmed. The model pulls this id from the config duri…