Closed MaxiBoether closed 3 months ago
( % to main) ( % to main)
Attention: Patch coverage is 55.55556%
with 20 lines
in your changes missing coverage. Please review.
Project coverage is 82.37%. Comparing base (
cb0be37
) to head (6017c37
).
Files | Patch % | Lines |
---|---|---|
...n/evaluator/internal/dataset/evaluation_dataset.py | 50.00% | 14 Missing :warning: |
.../trainer_server/internal/dataset/online_dataset.py | 64.70% | 6 Missing :warning: |
:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.
Can you piggyback deleting this log https://github.com/eth-easl/modyn/blob/55fef7c88b66a61d375d1555db46682157f3bbf5/modyn/supervisor/internal/pipeline_executor/pipeline_executor.py#L480 in this PR? This log is basically wrong because len(s.triggers)
is not the number of triggers we this time are processing.
Before, we had recovery logic based on reply indices. However, while working on storage, I realized those responses come non deterministically from multiple threads. Hence, we cannot rely on the ordering. We need to keep track of the sample IDs we already yielded. I changed the logic to just keep a list which is cheap to append to, and only convert to a set / hash table as soon as we failed once and we actually need to do many
in
checks.