Closed DavidKorczynski closed 1 week ago
Generated using [far-reach-low-coverage,jvm-public-candidates,easy-params-far-reach]
and max 6 targets per oracle
/gcbrun exp -n dk-test-jvm3234 -m vertex_ai_gemini-1-5 -b jvm-small
Experiment looks good
/gcbrun exp -n dk-test-infra5205 -m vertex_ai_gemini-1-5 -b jvm-small -i
Small experiment is looking great https://llm-exp.oss-fuzz.com/Result-reports/ofg-pr/2024-07-11-445-dk-test-infra5205-jvm-small/index.html, let's do a full run
/gcbrun exp -n dk-test-infra5209 -m vertex_ai_gemini-1-5 -b jvm-all -i
Changes the JVM benchmarks to be split into three buckets: 1) jvm-all: all java projects in oss-fuzz 2) jvm-medium: a smaller set of ~20 projects, which is useful for testing semantic changes in prompts without needing to check all projects. 3) jvm-small: set of three projects that can be used to test changes in infrastructure to ensure no infra regressions.