-
Hello. I'm looking to Fine-Tune BERT MultiLingual on a specific closed-domain context in one of the languages that already exists in the multi-lingual-language set. (Romanian) My task will be to :
1)…
-
We want to prepare an example pipeline using static word embeddings https://github.com/CLARIN-PL/embeddings#available-models
-
In the second step:
```
Second, this project uses pretrained GloVe word embeddings of 100 dimensions, trained on 6B tokens. Download and extract under $DATA/embeddings_glove/.
```
The link to g…
-
Word models requested:
- [x] UK
- [x] Germany
- [x] France
- [x] Sweden
- [x] Netherlands
- [x] Finland
- [x] Ireland
- [x] Canada
- [ ] European Parliament
- [ ] Denmark -> check with Anne …
-
Model loaded succeed
2022-06-10 00:47:25,759 INFO: 1 batches, avg len: 77.5
Traceback (most recent call last):
File "D:\BaDouAI\SIFRank_zh-master\main.py", line 16, in
keyphrases = SIFRank(…
-
Thanks for sharing the code,
I am using python 2, got this error:
using coding utf-8, otherwise will got another coding error.
import sys
reload(sys)
sys.setdefaultencoding('utf8')
(deep…
-
Hello,
I love this collection of notebooks! The book is great!
One thing. When using the GloVe embeddings, there seems to be a flaw in the implementation: When creating the embedding matrix, the…
-
### Have you completed your first issue?
- [X] I have completed my first issue
### Guidelines
- [X] I have read the guidelines
- [X] I have the link to my latest merged PR
### Latest Merged PR Lin…
-
when I run this code, it shows that the length of tokenizer equals to 128001.
```
from transformers import AutoModelForSequenceClassification, AutoTokenizer
tokenizer = AutoTokenizer.from_pretraine…
-
### Your current environment
```
(vllm-gptq) root@k8s-master01:/workspace/home/lich/QuIP-for-all# pip3 list | grep aphrodite
aphrodite-engine 0.5.3 /workspace/home/lich/aphrodite-eng…