Closed puraminy closed 3 years ago
okay I found the solution!
okay I found the solution!
Hi! I am just wondering do you mind sharing the solution you had previously? Thank you!
I can't recall it, but you may check if you use the following code in finetune.py
can work or not
model = t5.models.MtfModel (
tpu=None,
model_dir=model_dir,
model_parallelism=8,
batch_size=batch_size,
sequence_length={"inputs": 128, "targets": 128},
mesh_shape="model:1,batch:1",
mesh_devices=["gpu:0"],
learning_rate_schedule=learning_rate,
save_checkpoints_steps=save_checkpoints_steps,
keep_checkpoint_max=n_checkpoints_to_keep,
iterations_per_loop=100,
)
model.finetune(
mixture_or_task_name=mixture,
pretrained_model_dir=pm_dir,
finetune_steps=n_steps,
split=split,
)
Hello, I don't have access to Google Cloud! and I want to do some experiments using my Cuda GPU. What should I do? Thanks