-
Some weights of RobertaModel were not initialized from the model checkpoint at roberta-base and are newly initialized: ['roberta.pooler.dense.bias', 'roberta.pooler.dense.weight']
You should probably…
-
After training on a seperate machine we got some promising results, and we are now looking to move our model into production. However we encounter an issue. Downloading missing files and verifying the…
-
I plan to implement 2 transformer models, and finally choose the one which performs the best on evaluation.
T5 (Text-to-Text Transfer Transformer) is a powerful and flexible Transformer-based languag…
-
Hi there,
When I run finetune-exp2.sh, I get crash at the inference part:-
------
python3 inference.py --cktpath checkpoints/exp2/pflen5_iter5_loss1_1_2_lr0.0001_bsz2_seed128/checkpoint_best.pt
…
-
作者您好!在您的JMRL-DREEAM里,您描述的第一步将model.py更改为model.py.bak是如何操作的,单纯更改命名还是有单独相应的文件呢?
-
### Model description
I'm experimenting Funsd like dataset using layoutLmv3, I'm trying to replace roberta embedding with new bge_large, Is it possible to replace the embedding? will it improve the…
-
**Describe the bug**
For iOS devices with iOS 11.x and under, the `Robot Configuration` tab does not show the robot board.
**To Reproduce**
Steps to reproduce the behavior:
1. Go to `lab.open-ro…
-
Hi, thank for the great work!
When I tried to run your baseline evaluation script with:
```
TASK=SST-2 K=16 SEED=42 BS=8 LR=1e-5 MODEL=roberta-large bash finetune.sh
```
the script will break d…
-
在代码中如下面路径的文件,请问这些文件需要到哪里下载呀,这个没有找到呢
self.dict_path = 'E:/bert_weight_files/roberta/vocab.txt' self.config_path='E:/bert_weight_files/roberta/bert_config_rbt3.json' self.checkpoint_pat…
-
Hi @tridao,
I made an updated version of the HuggingFace RoBERTa implementation using your implementation of flash-attention v2 as a drop-in replacement for the legacy pytorch implementation (https…