Hi, thanks for your fantastic work! I am trying to understand how to use the ReFL method to train a new reward model with a custom dataset. Specifically, I am unsure how to structure my dataset and integrate it into the current codebase. The current code reads image files in a way that is not immediately clear to me.
The current code reads image files in a way that seems to rely on specific identifiers rather than absolute file paths. This makes it unclear how to apply my own dataset. Thanks if you can give a guidance on how to ensure my dataset is correctly loaded and processed for training.
Hi, thanks for your fantastic work! I am trying to understand how to use the ReFL method to train a new reward model with a custom dataset. Specifically, I am unsure how to structure my dataset and integrate it into the current codebase. The current code reads image files in a way that is not immediately clear to me.
The current code reads image files in a way that seems to rely on specific identifiers rather than absolute file paths. This makes it unclear how to apply my own dataset. Thanks if you can give a guidance on how to ensure my dataset is correctly loaded and processed for training.
Thanks a lot!