xlmroberta Search Results

148 results
for xlmroberta

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

allenai/longformer #91

GPU OOM when training XLM-RoBERTa with LongSelfAttention

Hi, thanks for the great example on training RoBERTa with long attention. Followed this example: https://github.com/allenai/longformer/blob/master/scripts/convert_model_to_long.ipynb Was able to s…

KasparPeterson updated 4 years ago
1
facebookresearch/DPR #179

problem with retriever training on xlm-roberta

Hello, I added a bit of simialr code(adding xlmroberta tokenizer, encoder description) to use xlm-roberta-base from huggingface instead of bert encoder model. Problem is when I trying to train on xlm…

ffaisal93 updated 2 years ago
3
keshav22bansal/BAKSA_IITK #3

Finetuning XLM Roberta causes output saturation

Hi, It seems from the source code that XLM Roberta is finetuned with the gradient updates based on the LSTM attention model. However, when I follow the README instructions and train the model on hi…

ilkyyldz95 updated 3 years ago
4
drawthingsai/draw-things-community #17

Not an issue rather a question: how to use LocalImageGenerat…

Would you please provide the sample code particularly to show how to construct a LocalImageGenerator instance? ``` let imageGenerator = LocalImageGenerator( queue: queue, configurations: conf…

haochen08 updated 3 weeks ago
3
Unbabel/OpenKiwi #115

Why does it need "--model" paramter when I give a specific c…

![image](https://user-images.githubusercontent.com/45490378/185097604-ad164b3c-8a60-4f49-94a6-dc4268f0a1fb.png) Why does it need "--model" paramter when I give a specific config? And what does "que…

michelleqyhqyh updated 1 year ago
2
dbiir/UER-py #328

预训练时加载huggingface T5模型报错

使用经过脚本转换后的huggingface上的mengzi-t5-base模型时报错： ``` RuntimeError: Error(s) in loading state_dict for Model: size mismatch for embedding.word_embedding.weight: copying a param with shape torch.S…

fade-color updated 2 years ago
2
foundation-model-stack/foundation-model-stack #169

Loading (XLM)-Roberta `AutoModel.from_pretrained`

@JRosenkranz This looks amazing, learned about this lib at vllm yesterday. I am trying to run `bge-m3` using this custom modeling code for https://github.com/michaelfeil/infinity . I am aware that thi…

michaelfeil updated 9 months ago
3
FlagOpen/FlagEmbedding #291

reranker训练效果不及训练前Baseline

你好，我的训练数据量级~10w，我做了以下两组实验： 1. embedding finetune 和 reranker finetune 用同一份数据，前者微调完成后比未微调的通用模型效果好，但后者微调后明显比微调前效果更差 2. 用finetuned embedding model采样难负样本后微调reranker，依旧比微调前效果差上述两个实验中，reranker收敛正常，评测…

LexieeWei updated 3 days ago
3
plkmo/NLP_Toolkit #9

Problem in installing the package

I installed **nlptoolkit** package through pip. But the following line is repeatedly giving me an error. from nlptoolkit.utils.config import Config I tried upgrading pandas, tqdm as these a…

SivaAndMe updated 3 years ago
1
UKPLab/sentence-transformers #860

Fast tokenizer for stsb-xlm-r-multilingual model

Hi, I am blocked with low latency response due to tokenizer computation from `stsb-xlm-r-multilingual` model. Could anyone have an idea on how to get a fast tokenizer for `stsb-xlm-r-multilingua…

Matthieu-Tinycoaching updated 3 years ago
3

上一页 1...1 2 3 4 5 6 7...15 下一页

148 results for xlmroberta

148 results
for xlmroberta