-
[OpenAI Whisper API](https://platform.openai.com/docs/api-reference/audio/verbose-json-object) response has been updated and it no longer gives the "words" key in the response due to which whisper_str…
-
Traceback (most recent call last):
File "inference.py", line 3, in
from preprocess_data import preprocess_batch
File "/data/jquan/codes/paraphraser-master/paraphraser/preprocess_data.py", …
-
### System Info
- `transformers` version: 4.41.2
- Platform: Linux-4.9.151-015.ali3000.alios7.x86_64-x86_64-with-glibc2.17
- Python version: 3.8.18
- Huggingface_hub version: 0.23.2
- Safetenso…
-
您好, 非常感谢您能够开源代码!
在复现您的代码时, 我注意到在`data_loader.py`文件的第130行, 代码直接导入了保存好的Word2Vec的模型, 如下所示:
```
w2v = pickle.load(open(word_embedding_path, 'rb'))
```
论文中提到FSRU使用了公开的word2vec模型:
> We utilize publ…
-
In the Multilingual word Embeddings section, the Arabic file is not correct. It is only ~417MB and there is no \n at the end of the file like other languages.
-
Hi, I'm trying to reproduce the result reported in your paper. However, when I tried
python supervised.py --src_lang $src --tgt_lang "de $tgt" --src_emb /data/embeddings/$src.emb.txt --tgt_emb "/data…
-
Hi I have a list of arabic text and I want to extract keywords of each list element, for this I'm following the documentation ,
So I started by initiating the keybert model with this model
`from k…
-
project_layer的参数应该与bert.embeddings.word_embeddings_2的参数对应
-
Hello!
I have been really excited about your work! I attempted to use Palu for model compression on the Qwen2 series models, but regardless of the compression rate I set, I seem to encounter signif…
-
- [x] 1. Use Gensim to train word embeddings (why can't we just use pre-trained word embeddings? Maybe we ask Wei if we can just use pretrained word2vec)
- [ ] 2. Take individual job descriptions for…