Closed Rookie-Kai closed 6 months ago
Dear Rookie-Kai,
Thanks for your attention to our work, we have reproduced and fixed the reported bug.
This bug is caused by our inconsistent Read and Write implementation of Sampler in the pre-compute process for perplexity statistics. We have fix the bug and update the latest code, which has tested by a small series of cases.
Please pull the latest code and check whether the problem is solved. We apologize for the issues the reported bug leading to, and please feel free to talk to us for any other issues.
PhealenWang, Yulan-GARDEN Team
As no further comments are given, this issue will be set as completed. Please feel free to reopen it if you have any other questions.
Hello, author, thank you very much for your open source, which helps me a lot. But I encountered some problems in using it.
When PPL is turned on, bug always causes the program to break, reminding me that my Input path is a folder. Examples of errors are as follows:
Finally, an error was reported.
ValueError: Instruction "train" corresponds to no data!
FileNotFoundError: Directory /mnt/afs/Data_preprocess/Yulan-GARDEN/output/data/.dedup is neither a
Datasetdirectory nor a
DatasetDictdirectory.
This happens only when PPL is turned on, and the program can run normally when PPL is closed. In addition, I have updated the latest code.