xlmr Search Results - Githubissues

307 results
for xlmr

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

masakhane-io/masakhane-ner #16

Truncated results for XLM-R and mBERT

Hi, It seems that some of the prediction files truncate sentences too short. For example, here is a long sentence in Hausa in the test file: * https://github.com/masakhane-io/masakhane-ner/blob/ma…

neubig updated 3 years ago
1
huggingface/transformers #10850

How to train encoder decoder for explicit negation generatio…

Hi I am trying to generate negations out of non-negated sentences. I used a simple “I have tea” => “I don’t have tea” formatted dataset for training an XLMR encoder-decoder model using the example p…

thak123 updated 3 years ago
1
facebookresearch/fairseq #1367

XLM-RoBERTa example for XNLI

Could you please provide an example of XNLI tasks for XLM-RoBERTa? Current example (https://github.com/pytorch/fairseq/tree/master/examples/xlmr) is quite simple and it is for single sentence. Thank…

ericwtlin updated 3 years ago
7
huan/tensorflow-handbook-tpu #1

TensorFlow 2.0 with Colab TPU does not work.

Tensorflow 2.0 RC on Colab can not work. @yuefengz ```python cluster_resolver = tf.distribute.cluster_resolver.TPUClusterResolver(tpu='grpc://' + os.environ['COLAB_TPU_ADDR']) tf.config.experimen…

huan updated 3 years ago
23
yzhangcs/parser #69

CUDA error: device-side assert triggered

I run into this error when training a model with encoder=bert on the Italian UD training set: python -u -m supar.cmds.biaffine_dep train -p=exp/it_isdt.dbmdz-electra-xxl/model \ -c=config.ini -…

attardi updated 3 years ago
16
facebookresearch/fairseq #3056

Yep, that's what the [classification heads](https://github.c…

Yep, that's what the [classification heads](https://github.com/pytorch/fairseq/blob/master/fairseq/models/roberta/model.py#L273) do _Originally posted by @lematt1991 in https://github.com/pytorch/f…

charlesfufu updated 3 years ago
1
huggingface/transformers #10163

Increasing gradient accummulation steps significantly slows …

When training with a batch size of 32 (grad accummulation step = 1), training speed is approximately 6 it/s, however I increase gradient accummulation step to 4 or 8 (equivalent to batch size of 128 a…

keleog updated 3 years ago
4
microsoft/Unicoder #4

decoder parameter initialization and dictionary during defin…

Hi: Good evening! I read from your paper that Unicode uses an XLM/transformer structure. …

ever4244 updated 3 years ago
13
hankcs/HanLP #1623

Get the index of a token or a ner for example in the input t…

**Describe the feature and the current behavior/state.** I've been looking into hanlp source code and documentation to find a way to get the index of a token or a ner in the original input text. …

maky-hnou updated 3 years ago
2
VinAIResearch/BERTweet #15

Reproducing the results of fine-tuning XLMR large in the pap…

Hi, I'm Interested in your great work and tried to reproduce you results of fine-tuning XLMR (with my own code). And I got `92.6` in `Ritter11`, `93.4` in `ARK`, `95.0` in `TB-v2`. I find that the re…

wangxinyu0922 updated 4 years ago
6

上一页 1...24 25 26 27 28 29 30...31 下一页

307 results for xlmr

307 results
for xlmr