Closed Saood1810 closed 1 month ago
Hey,
I have no experience with wandb offline on a cluster. Sometimes I use it locally with their docker image but I guess that doesn't help for a cluster. You might have more luck on the wandb support instead of here.
Oh okay Alright!
Hi
I am using a GPU cluster and I training the PQL algorithm. The issue is the GPU cluster does not have access to the Internet, meaning I need to use Wandb in offline mode so I can save the plots locally and then sync them online after training. How would I do given that I can only enter the Experiment and Project name parameters for Wandb when initialising the algorithm? I tried being creative but nothing seems to work.
In the approach I did below, I initialised and in offline mode, and the code works fine (it doesn't try and make a connection to the Internet to log plots online). But now, after running, I have issues with the log file when I sync it online. So i don't think it is getting logged on properly. Any suggestions?
import mo_gymnasium as mo_gym import numpy as np import random from morl_baselines.multi_policy.pareto_q_learning.pql import PQL import os import wandb
SEEDS = [42,43,44]
env = mo_gym.make("deep-sea-treasure-concave-v0") ref_point = np.array([0, -25])
wandb.init(mode="offline",project="Research Project Logs") for seed in SEEDS: