Add sections to explain how to choose target accuracy, eval frequency, number of submission runs and some things to keep in mind while generating RCPs for the first time during reference development.
Use learnings from previously added benchmarks to help make similar decisions in the future
Add sections to explain how to choose target accuracy, eval frequency, number of submission runs and some things to keep in mind while generating RCPs for the first time during reference development.
Use learnings from previously added benchmarks to help make similar decisions in the future