-
Hi there,
Thank you for the excellent work and for publishing the code base.
I am attempting to reproduce the retrieval performance of BGE-base as shown in Table 3 but have encountered some issu…
-
-
Page 26:
> OGC®Observations and Measurements Standard (O&M).
It's called _Observations, Measurements, and Samples (OMS)_ now: https://www.ogc.org/standard/om/
Also, in the sentence following, i…
-
### Duplicates
- [X] I have searched the existing issues
### Summary 💡
Evaluate against the SWE-bench benchmark:
https://github.com/princeton-nlp/SWE-bench
### Examples 🌈
_No response_
### Moti…
-
Hello - very interesting paper!
I noticed that your JSONL file is incomplete. Would you be able to add the RepoUnderstander results and logs from your experiments? I am especially interested in se…
-
It would be interesting to see if/how `blar` performs against the SWE-Bench benchmarks:
- https://www.swebench.com/
- https://github.com/princeton-nlp/SWE-bench
- > [ICLR 2024] SWE-Bench: Can L…
-
Let's make @swe-yc happy and put it in our "backlog":
_Freedom is just one example. I understand not much ppl use freedom and maybe little people know why i like it: apart from looks, and many opti…
-
It takes a lot of time to run it now. Thanks!
-
# Background
[SWE-bench](https://www.swebench.com/) has "assisted" and "unassisted" scores. Assisted means you are told what files to modify. Devin is presumed to have the SOTA score of 14% unassis…
-
Once we have daily sensor readings, we will also want daily SWE predictions. This will involve taking the best model found during #3 and deploying it to production.