xlnet Search Results - Githubissues

1000+ results
for xlnet

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

synlp/SRL-MM #2

How much VRAM needed?

@yuanheTian friendly ping How much GPU GBs needed for BERT? for XLnet? I'm talking about training not inference.

LifeIsStrange updated 1 year ago
2
Hanlard/Transformer-based-pretrained-model-for-event-extraction #12

Segmentation fault (core dumped)

python DataLoadAndTrain.py --LOSS_alpha=1 --lr=1e-5 --l2=1e-5 --early_stop=5 --PreTrain_Model="Gpt2" --batch_size=16 2023-03-01 10:52:17.583697: W tensorflow/stream_executor/platform/default/dso_load…

wangbofan updated 1 year ago
1
utterworks/fast-bert #45

Runtime Crashes on Google Colab

I was trying to create Databunch on Google Colab, using the sentiments140 twitter dataset from google colab. But no matter what batch size I use the GPU always crashes. I tried all batch sizes from 2 …

dxganta updated 5 years ago
3
lessw2020/Ranger-Deep-Learning-Optimizer #13

Did you try to fine-tune transformers LM with Ranger?

Recent transformers architectures are very famous in NLP: BERT, GPT-2, RoBERTa, XLNET. Did you try to fine-tune them on some NLP task? If so, what was the best Ranger hyper-parameters and learning rat…

avostryakov updated 1 year ago
4
shenwzh3/DialogXL #6

请问有在中文xlnet上实验过吗？

chinese-xlnet-base

geolvr updated 1 year ago
1
thunlp/OpenAttack #221

An error is always reported when the victim model is xlnet_s…

I want to try to use the existing xlnet_sst as the attacked model. Unfortunately, it keep reporting errors. Can you check and try this example？thanks! ![image](https://user-images.githubusercontent.c…

wei1826676931 updated 3 years ago
2
zihangdai/xlnet #79

Question about `b_target`

https://github.com/zihangdai/xlnet/blob/master/data_utils.py#L316 I wonder why this is `b_begin: b_end + 1`, not `b_begin+1: b_end + 1`. Also, what is mean? https://github.com/zihangdai/xlnet…

graykode updated 5 years ago
2
qingyujean/document-level-classification #2

Help

您好，请问在运行xlnet_hierarchical_attn模型时出现TypeError: linear(): argument 'input' (position 1) must be Tensor, not str怎么解决？

wwhss updated 10 months ago
9
huggingface/swift-coreml-transformers #4

is there any way to convert other pytorch transformers to ml…

Here the model generation shows how to convert gpt2 model specifically to mlmodel. How to apply this to other models like pretrained bert and xlnet? please help.

harold1505 updated 5 years ago
2
microsoft/OmniParser #76

Unrecognized model in weights/icon_caption_florence.

I run the program in pycharm, one error listed below occurs, how to solve it? ValueError: Unrecognized model in weights/icon_caption_florence. Should have a `model_type` key in its config.json, or co…

xiaoscofield updated 3 weeks ago
3

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for xlnet

1000+ results
for xlnet