freelb Search Results - Githubissues

28 results
for freelb

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

zhuchen03/FreeLB #12

Having issues with training RoBERTa. Loss not decreasing

Hi！Thanks for your great repo. I tried the script in fairseq-RoBERTa/launch/FreeLB/rte-fp32-clip.sh and used the same setting as that in Issue #11 . ``` # run_exp GPU TOTAL_NUM_UPDATES …

GodXuxilie updated 4 years ago
2
zhuchen03/FreeLB #11

Reproducing results from the paper with roberta using fairse…

Hi! Thanks for this repository. I've been trying to reproduce the results from the paper but ran into some problems. I tried the script in `fairseq-RoBERTa/launch/FreeLB/rte-fp32-clip.sh` which I w…

bminixhofer updated 4 years ago
5
zhuchen03/FreeLB #10

FreeLB didn't use the original training samples?

In this [code](https://github.com/zhuchen03/FreeLB/blob/master/huggingface-transformers/examples/run_glue_freelb.py#L229), if adv_init_mag > 0, model will only be trained on adversarial examples? I …

YawYoung updated 4 years ago
2
zhuchen03/FreeLB #8

Does anyone meet the Nan error during the end epochs of trai…

First thanks for your wonderful work. Does anyone meet the Nan error during the training-end epoch? I embedding FreeLB as a plugin format(without handle dropout_mask): freelb.attack() fr…

PantherYan updated 4 years ago
5
zhuchen03/FreeLB #4

Is it still working with update_freq > 1?

In fairseq implementation, the "update_freq" configuration (from the original fairseq code) specifies how often the optimizer updates model parameters. when update_freq > 1, it will accumulate gradien…

hitvoice updated 4 years ago
2
zhuchen03/FreeLB #9

Would you please release the hyper-parameters for FreeLB bas…

There are only 4 tasks' hyper parameters in this [file](https://github.com/zhuchen03/FreeLB/blob/master/huggingface-transformers/launch/run_glue.sh), would you please release others?

FFYYang updated 4 years ago
6
zhuchen03/FreeLB #3

'AlbertForSequenceClassification' object has no attribute 'e…

hi, when i used the example from `huggingface-transformers/examples/run_glue_freelb.py` i met the error as this `'AlbertForSequenceClassification' object has no attribute 'encoder'` it seems the cod…

trueto updated 4 years ago
2
namisan/mt-dnn #126

Clarification of reported number in SMART paper

Hey, Great work on the SMART paper. I have a very quick questions about numbers reported in Table 2 of the paper. 1) Is `RTE` model on the last row of table 2, initialized from `MNLI` checkpoin…

ngoyal2707 updated 4 years ago
3

上一页 1...1 2 3...3 下一页

28 results for freelb

28 results
for freelb