vocabulary-trainer Search Results

997 results
for vocabulary-trainer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

nlpyang/PreSumm #199

Test_text mode: IndexError: tensors used as indices must be …

When I try to run this command: `python src/train.py -mode test_text -text_src data/sum_twitter_sample.txt -test_from bertext_cnndm_transformer.pt` I get this error: ``` [2020-11-18 17:36:5…

MackieBlackburn updated 2 years ago
3
facebookresearch/fairseq #2120

issues with pretrain mBART models

## ❓ Questions and Help Thanks for releasing the mbart models! Referring to [#1758 ](https://github.com/pytorch/fairseq/issues/1758) I reproduced the same results, which is basically close to the r…

fansiawang updated 2 years ago
35
facebookresearch/XLM #310

No file in 'data/wiki' when prepare the data for XLM MLM

But the processing is the same with I.1, and it should have files under 'data/wiki/txt', rather than 'data/wiki'. The script is confusing. ``` # build the training set for BPE tokenization (50k cod…

LetsGoFir updated 3 years ago
4
kermitt2/delft #150

Sub-tokenization with certain transformers

@pjox and I are working on a model trained with Roberta and using the BPE tokenizer, in particular [zeldarose](https://github.com/LoicGrobol/zeldarose) which uses slightly different special tokens. …

lfoppiano updated 1 year ago
22
thinhlpg/vixtts-demo #6

how long did you finetune the model?

Hi @thinhlpg, I'm curious about how many epochs you fine-tuned the model to achieve this performance level

thivux updated 3 weeks ago
6
apache/mxnet #21118

Issue when running Distributed Training with Sparse Gradient…

## Description (A clear and concise description of what the bug is.) When running distributed training (multi-instance with each instance having a single GPU) with sparse gradients (produced by nega…

BlakeLazarine updated 2 years ago
1
openlm-research/open_llama #63

LORA fine-tuning with openlm-research/open_llama_7b as a plu…

Hi. Thanks for the open sourced models! This is a major step forward to the democratization of LLMs. I'm trying to fine tune `openlm-research/open_llama_7b` using the LORA. I first tried the cod…

gjmulder updated 1 year ago
23
tensorflow/tensor2tensor #1567

[Bug] Multitask loss taking 6 arguments instead of 5

### Description Hi, I defined a multitask learning problem by fusion of PTB and IMDB for testing the mixing of different type of modality problems. But when the training goes to the line 444 of multi…

Nikotarou updated 4 years ago
1
nicolas-ivanov/tf_seq2seq_chatbot #20

Error when run "python train.py"

Preparing dialog data in /var/lib/tf_seq2seq_chatbot/data Creating vocabulary /var/lib/tf_seq2seq_chatbot/data/vocab20000.in from data /var/lib/tf_seq2seq_chatbot/data/chat.in Traceback (most recent…

minhntm updated 5 years ago
15
google-research/t5x #1004

bad_alloc error before training starts, seemingly caused by …

Hi, I have been trying to run the wmt demo on TPUv2 or TPUv3 VMs, but I keep encountering a `bac_alloc` error before training even starts. It seems that the output also says that no TPU backend is…

RobertLiJN updated 1 year ago
5

上一页 1...10 11 12 13 14 15 16...100 下一页

997 results for vocabulary-trainer

997 results
for vocabulary-trainer