bert2bert Search Results

92 results
for bert2bert

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

HaoUNSW/PISA #10

Request for Assistance with BERT2BERT Model Error in Hugging…

Hello, I am very interested in your research and am currently trying to run some experiments based on it. However, I encountered an issue while running the program from the HuggingFace_EncDec directo…

cccccrj updated 1 day ago
1
Pgibby8/SentenceFusion #5

RuntimeError: probability tensor contains either `inf`, `nan…

Hi, thank you for sharing such impressive work. I'm new for this topic. I met a problem when I try to run the code. I wrote the test code as you guided, showing as follows: from transformers import…

Pppapaya updated 1 year ago
1
huggingface/blog #67

Reported results cant be achieved in Leveraging Pre-trained …

@patrickvonplaten I have been trying to achieve a bleu score of 31.7 (as reported in the blog and paper for WMT en->de evaluation) using hugging-face model **google/bert2bert_L-24_wmt_en_de** but I…

zmf0507 updated 3 years ago
4
UKPLab/sentence-transformers #1540

TSDAE for token classification?

Hi, I would like to know whether TSDAE procedure is advisable for token classification task? or is it better to go with MLM? Can the TSDAE training code can be also used with any transformer (en…

KrishnanJothi updated 2 years ago
6
huggingface/transformers #8944

how to use EncoderDecoderModel to do en-de translation?

I have trained a EncoderDecoderModel from huggging face to do english-German translation task. I tried to overfit a small dataset (100 parallel sentences), and use `model.generate()` then `tokenizer.d…

CharizardAcademy updated 1 year ago
19
AkihikoWatanabe/paper_notes #493

Leveraging Pre-trained Checkpoints for Sequence Generation T…

https://arxiv.org/pdf/1907.12461.pdf

AkihikoWatanabe updated 1 year ago
1
UKPLab/sentence-transformers #609

No speedup when batching Sentence Transformer Bert

Hey guys. I get no benefit from batching (no speedup whatsoever) Sentence-Transformer. I would love your opinion on the following situation : I run inferences on **'bert-base-nli-mean-tokens'** mo…

PetreanuAndi updated 1 year ago
6
adapter-hub/adapters #575

How to train a EncoderDecoder Adapter/Prediction Head needed…

## Environment info - `adapter-transformers` version: 3.2.1 - Platform: Linux-6.2.0-27-generic-x86_64-with-glibc2.37 - Python version: 3.10.9 - PyTorch version (GPU?): 1.13.1 (GPU) ## De…

julianpollmann updated 7 months ago
1
allenai/longformer #29

Is it possible to finetune the pretrained model on casual la…

Hi, Thanks for providing and presenting this nice work. As mentioned in your paper, your attention pattern for modeling long sequences can be plugged into any pretrained transformer model. I wond…

fabrahman updated 4 years ago
11
huggingface/transformers #9295

Good Second Issue: T5 FP16 in Pytorch

# 🚀 Feature request This "Good second issue" should revisit some of the problems we were having with FP16 for `T5ForConditionalGeneration`: https://github.com/huggingface/transformers/issues/4586 a…

patrickvonplaten updated 6 months ago
11

上一页 1...1 2 3 4 5 6 7...10 下一页

92 results for bert2bert

92 results
for bert2bert