Closed Shiruiruiwang closed 11 months ago
I noticed that you have set sample=64 in the shell file, which controls the training dataset only containing 64 instructions according to this code
train_data["train"] = train_data["train"].shuffle(seed=seed).select(range(sample)) if sample > -1 else train_data["train"].shuffle(seed=seed)
in https://github.com/SAI990323/TALLRec/blob/a10cdbcc13e47c9f86a669c3345825255664dc8b/finetune_rec.py#L212C5-L212C5
I am confused about why only using 64 samples of total around 20k samples (for book dataset)? Would this setting stay consistent for other comparison methods?
Sorry for my careless check in closed issues, now I have acquired the answer from #16 . Thanks for your excellent work!!
I noticed that you have set sample=64 in the shell file, which controls the training dataset only containing 64 instructions according to this code
in https://github.com/SAI990323/TALLRec/blob/a10cdbcc13e47c9f86a669c3345825255664dc8b/finetune_rec.py#L212C5-L212C5
I am confused about why only using 64 samples of total around 20k samples (for book dataset)? Would this setting stay consistent for other comparison methods?