Open zhaoyi11 opened 3 months ago
@zhaoyi11 I was going to email you all and ask you if you would be down to integrate your labeling pipeline in the repo 😂 So to answer your question, yes! I would be more than happy to help you, please feel free to submit a PR! Love the paper btw!
Thanks @kevinzakka! Great, I will prepare the PR asap.
Hello,
Thanks for the great work! Recently, we released a paper named RP1M (https://arxiv.org/abs/2408.11048, cc @clthegoat) which includes a reward term based on optimal transport, enabling the agent to play MIDI files without human fingering. We want to know whether it is possible to integrate the method in this repo, such that people can conveniently use the Robopianist to play more songs beyond the PIG dataset.
Here are some comparison results from the paper as well as a short plan for the modification of the code. Please let me know your thoughts.
Results:
Modifications: I plan to change these lines https://github.com/google-research/robopianist/blob/d9cde23e46cb30ebb8eeebb375a9c52191238a30/robopianist/suite/tasks/piano_with_shadow_hands.py#L135-L136 as:
where the
_compute_ot_reward
is defined as: