Open aflah02 opened 2 years ago
You can download the preprocessed yelp dataset and format your custom dataset following the instruction in Readmes.
Please feel free to ask if you have any further questions.
@hzhwcmhf Thanks for the instructions!
@hzhwcmhf Can the test files be run without multiple human references as well? I see the paper mentions Luo et al. (2019) for the Yelp dataset as they provided multiple references but for GYAFC there is no such mention. I don't have multiple human references hence would like to know if the code already auto handles single references or would I need to make the changes manually?
Hi, @aflah02
First, we use multiple human references as well for GYAFC. You can find the references here. Multiple references are recommended in evaluating style transfer models since they can cover more possible transferred phrases, leading to reliable results.
Second, it should be ok if you test files only contain one reference per sample. For example, the test file can be
ever since joes has changed hands it 's just gotten worse and worse . ever since joes has changed hands it 's gotten better and better .
there is definitely not enough room in that part of the venue . there is so much room in that part of the venue
...... (NOTE: THE BLANK LINE IS REQUIRED)
(If it does not work, please tell me. I will figure out the problem.)
Moreover, you can change the format of input file here
where SentenceDefault
indicates a line, and SessionDefault
indicates mutliple lines with an empty line as ending.
Hey! Great Paper!! Can you share some instructions for formatting a custom dataset as well?