Fine Tuning Dataset Format

Hi there, I am trying to fine-tune the gpt-j-6b model using my custom dataset. I am trying to figure out the correct format for my dataset. Currently, when generating tfrecords, I have tried the following formats:

I applied these formats to the whole dataset. The resulting model produced outputs which seemed to suggest the model was unable to recognize the "<|endoftext|>" or "#####" separator token.

Any information on this would be helpful. Please and thank you

kingoflolz / mesh-transformer-jax

Fine Tuning Dataset Format #193