-
I follow [BioGPT](https://github.com/microsoft/BioGPT)/[examples](https://github.com/microsoft/BioGPT/tree/main/examples)/[QA-PubMedQA](https://github.com/microsoft/BioGPT/tree/main/examples/QA-PubMed…
-
-
Hi, i want to use this method preprocess many unlabed face images, how i use your pretrained model to classify and labeled. Thank you very much!
-
In the Duktape shared library (.dll, .so) are missing some functions reported in API guide.
These functions are very useful to a better comunication with the embedding program (C program, or whatever…
-
https://aclanthology.org/2020.acl-main.124/
-
Hi Team,
Thanks for developing such a great library.
When using RecBole, I found that evaluation in full sort evaluation mode runs slowly. After reading the code, I found the reason from this li…
-
I want to run T5 example. This is my command. But there is an error. How can I fix it?
export CUDA_VISIBLE_DEVICES=2,3
bash tools/train.sh tools/train_net.py projects/T5/configs/mt5_pretrain.py 2
…
-
**Describe the bug**
When running `merge_lora_weights/merge.py` with TP and PP set to 1 on a fine-tuned minitron checkpoint, I run into the following error:
```sh
raise RuntimeError(f"world_size ({w…
-
### System Info
Copy-and-paste the text below in your GitHub issue and FILL OUT the two last points.
- `transformers` version: 4.29.2
- Platform: Linux-4.18.0-477.51.1.el8_8.x86_64-x86_64-with-gl…
-
Great work, thank you to you and your team!
I have some questions:
- When will the smaller model you mentioned in the paper be released and what are typical use cases for the large vs. small model…