Open marcocst opened 5 months ago
Thank you for your interest in our work! The results in the paper are from the test set.
Thank you!
@marcocst were you able to run inference on the test set?
@sborse3 @marcocst Sorry for not making it clear earlier. The results in the paper are from the test set. But this test set differs from the test part of the original dataset from Huggingface. We partition the dataset as follows:
For small datasets (n_samples < 10K), we divide validation set to half, use one half as test set and one half as validation set. For larger datasets (n_samples > 10K), we divide training set into 1K as validation and the rest as training set, keeping the original validation set as the test set. You can find the specific implementation in the get
function within the SoRA/src/processor.py
file (Lines 87-106).
Please don't hesitate to contact us if you have further questions or need more assistance!
May I ask if the results in the paper are from the eval set or the test set? Please let me know