-
### Reminder
- [X] I have read the README and searched the existing issues.
### System Info
[2024-06-07 10:17:14,980] [INFO] [real_accelerator.py:191:get_accelerator] Setting ds_accelerator t…
-
I'm interested in your paper -- 'Input Combination Strategies for Multi-Source Transformer Decoder', Would you mind telling me how can I reproduce this work. I want cite this paper. Thanks
-
Hi @themanojkumar ,
I was trying to use BioGpt model in my QA task for fine-tuning. I would like to use the tokenizer as a fast tokenizer, so that I could use the offsets_mapping to know from which w…
-
### Question
Hi,
I am working on using embeddings from a pre-trained model which is not published. When I try to import it as a TransformerWordEmbedding, it fails with this error message:
```
…
-
Hi, mir ist grad noch ne coole Klasse in OpenCV aufgefallen, die mir einfach nen extra Issue Wert war. Die Klasse heißt "BOWTrainer" also BagOfVisualWordsTrainer. Das ist ne Basisklasse für nen Traine…
-
exec readme bash Pairwise Knowledge Fusion
FuseLLM/FuseChat/train/trainer.py", line 121, in compute_loss
if self.args.distill_loss_type == "ce":
loss_lm = cross_entropy…
-
It looks like in the vocab, the preferred method of defining types of (in this case) relations is to create a subclass which then becomes part of the vocabulary. @elf-pavlik has examples: Membership …
-
When I run to "iterative rank-aware training" step , I got the following report:
`2020-04-16 01:14:58,580 - INFO - allennlp.training.trainer - Beginning training.
2020-04-16 01:14:58,580 - INFO - …
-
运行时没有进行模型转换,产生报错
-
* The guide gives an example of a training config, but not an exhaustive list of what fields are possible to include.
* The `Vocabulary` and `Trainer` fields are general, while for the model and dat…