-
Hello Authors,
Thank you for your incredible work and the comprehensive experiments presented in the paper.
I have a question regarding the implementation of attacks. Specifically, some attacks,…
-
DEM scoring section is kinda outdated by now. Most tests just contain applicable DEM penalties on their own scoresheet. The percentages mentioned are not really applied there either.
-
## Problem
The Census only keeps track of all available peers, and their liveness is checked once per hour.
When peers are requested for a given content id, census filters out all peers whose ra…
-
### Motivation
Encourages nodes to stay up and penalizes those that are consistently down.
### Description:
The idea is to utilize the health indexer to Implement an **uptime score system** where…
-
While we have a working version of Pathfinder (#794 #832) now...its scoring of results is not meaningful. Lab Slack convo:
Jackson:
> [...] the context in which the scores are generated is comple…
-
### Enhancement Description
DRA supports the concept of "under specifying" a request. This gives the scheduler more flexibility to satisfy a request, increasing the likelihood of success in environ…
-
Instances that score `0` are a great place to look when figuring what is going wrong with a solver, or when looking for ways to make a solver better.
For example, `0`-scoring examples will often s…
-
Items to draft/develop:
- changelog DB table/model. Possibility to use triggers, or some logic in a python script with django models.
- changelog file to be generated from DB at release and placed in…
fyvon updated
1 month ago
-
**Purpose**
The purpose of this addition is to trial run coppercore's state machine class by rewriting the scoring subsystem to use coppercore state machine.
**Project Scope**
- [ ] Rewrite Scori…
-
This issue tracks currently known problems in our scoring of SWE-bench. As well as false positives and false negatives, there are three types of failures. Cases where only our implementation has the f…