Open GenVr opened 1 year ago
Hi @GenVr! Can you show your training code as well alongside your config? There might be an error in how you passed the training data in. Thanks!
@maxreciprocate Regarding the dataset and train, I use this train() code:
trlx.train(
samples = [(text,output) for text,output in zip(ttv_ds['train']['text'],ttv_ds['train']['output'])],
rewards = labels,
eval_prompts=ttv_ds['validation']['text'][:16],
config = config,
)
Where:
samples = [(string, string), (string, string), ...] # list of tuples (string, string)
labels = [0,1,0,1...] # list of labels 0/1
samples = [string, string, ..] # list of strings
Thanks for your answer!
Hi, I'm trying an ILQL training with a gpt-j network trained with this code. I don't have this problem with the original pre-trained net, nor with a flan-xl.
This is my config:
Thanks.