vocabulary-trainer Search Results

950 results
for vocabulary-trainer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

freesunshine0316/semantic-nmt #1

Retrieving source and target vocabs

Hi, I was wondering how exactly the source and target vocabs for the Dual2Seq experiment are retrieved. Are you using one of your get_vocab scripts? Do you simply concatenate your sentence-words wi…

Fije updated 5 years ago
3
tensorflow/tensor2tensor #588

Continue training with new data

Hello all, I'm using my own data for training a transformer model for machine translation. I am using the standard pipeline with t2t-datagen and t2t-trainer and it's fine to train the model. In som…

tmkhalil updated 6 years ago
6
OptimalScale/LMFlow #870

DPO+ZeRO train error

I would like to ask for your advice on the following two questions. 1. DPO train does not seem to support DeepSpeed ZeRO. After manually integrating `DPOAlignerArguments` with the `FinetunerArguments…

tankeui updated 1 week ago
2
pytorch/torchtune #812

[RFC] Proximal Policy Optimisation

# Implementing Proximal Policy Optimisation I've used some of the [PyTorch RFC](https://github.com/pytorch/rfcs/blob/master/README.md) template here for clarity. **Authors:** * @salmanmohammadi…

SalmanMohammadi updated 1 week ago
10
microsoft/CNTK #2677

memory leakage cause gpu out of memory

Hi I want to train a RNNLM. My vocabulary size is 48603 (cutoff=100) or 72294 (cutoff=50). Data is stored in CTF format which is suitable for sparse data. Training RNN with vocabulary size of 4860…

mohamad-hasan-sohan-ajini updated 6 years ago
1
google-research/t5x #1226

How to Load UL20B as InteractiveModel?

I am not able to figure out how to set the interactive model config to load `checkpoint_path='gs://scenic-bucket/ul2/ul220b/checkpoint_2650000'` from [here](https://github.com/google-research/google-r…

KeremTurgutlu updated 1 year ago
2
Gnurou/tagainijisho #78

Switching between main display and alternate writings for vo…

I'd like to see a feature that lets the user switch out main display of a word with an alternative writing. Or at least a way to disambiguate a word that has several alternate writings and meanings, b…

K-410 updated 9 years ago
4
pytorch/pytorch #127176

RuntimeError: "_amp_foreach_non_finite_check_and_unscale_cud…

### 🐛 Describe the bug i use pytorch==2.3.0 and peft to train llama3 8b , when i run my code, its raise error like: ```text torch._amp_foreach_non_finite_check_and_unscale_( RuntimeError:…

ykallan updated 1 month ago
3
rewicks/ersatz #7

How to use the pretrained model for fine-tuning?

I checked there is a pretrained model in repo "https://github.com/rewicks/ersatz-models/tree/main/monolingual/en". As I cannot find the tokenizer Vocabulary, I am not sure how to finetune the existed…

robotsp updated 2 years ago
1
THUNLP-MT/THUMT #104

TypeError: Can't instantiate abstract class MapDataset with …

When I run: thumt-trainer \ --input corpus.tc.32k.zh.shuf corpus.tc.32k.en.shuf \ --vocabulary vocab.32k.zh.txt vocab.32k.en.txt \ --model transformer \ --validation newsdev2017.tc.32k.zh…

zhuchenxi updated 2 years ago
1

上一页 1...1 2 3 4 5 6 7...95 下一页

950 results for vocabulary-trainer

950 results
for vocabulary-trainer