Closed xiaxiaxiatengxi closed 5 months ago
Yes, it is in the provided google drive link in the README.
Is _gpt2_bc_workshophistory.pt the sft model for webshop? Does it means I need to load the model for training acther in rl step? How can I train a new sft model with different base LLM like Llama2? Thanks.
请问我们的工作中,对于webshop的训练有提前进行SFT模型么?