penalty Search Results - Githubissues

1000+ results
for penalty

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ValveSoftware/Dota2-Gameplay #21525

Lobby wont load and gives me quo dodge penalty

### Description Start a turbo matchmaking, click accept, it makes the sound like im entering the lobby, and nothing happens. It looks like i can rejoin, but it just makes that dunk "lobby join" sound…

steffhall3000 updated 1 week ago
1
HULKs/hulk #546

Penalty Shootout Striker

Our shootout performance at the robocup 2023 competition ... leaves room for improvement. The focus here is the defending capabilities of the robot, especially jumping in the right direction at the …

oleflb updated 1 month ago
2
PyVRP/PyVRP #193

Penalty updates

We currently update penalties as follows: auto const diff = params.targetFeasible - feasPct; if (-0.05 < diff && diff < 0.05) // allow some margins on the difference return pen…

N-Wouda updated 2 months ago
9
OpenRLHF/OpenRLHF #236

adding length penalty to reward

Hi Team, While using the PPO pipeline we observe at times spikes in response length and were curious if any techniques related to length penalty is available or explored

karthik-nexusflow updated 1 day ago
2
ggerganov/llama.cpp #8971

Bug: uncached prompt is not used for penalty

### What happened? Sometimes the part of the initial prompt that should be considered for the penalties is ignored. Only the newly generated tokens are used for calculating penalty. For now I can ass…

z80maniac updated 5 days ago
2
hats-finance/Proof-Of-Humanity-V2-0xef0709445d394a22704850c772a28a863bb780b0 #103

Malicious Vouchers Can Dodge Penalties by Manipulating Chall…

**Github username:** -- **Twitter username:** -- **Submission hash (on-chain):** 0xe2f5045df3b2ba5395f8efc92f548f7ce92cca2348ac930c0418087cabfa8709 **Severity:** medium **Description:** **Descriptio…

hats-bug-reporter[bot] updated 1 week ago
3
piazzatron/anki-smart-notes #13

[Feature Request] Setting for KI-Model: Temperature, Frequen…

ChatGPT is providing settings to have more control over the execution of the promt. Maybe it is possible to make these settings available per promt- With these settings, you can control: 1. **Tem…

hienstorfer updated 6 days ago
5
whomwah/rqrcode_core #37

Mask pattern penalty rules wrong? Or old?

Hello! Thank you for this library and for your effort maintaining it for so long! In using this library, I have recently concluded that three of the four penalty rules for selecting a mask patt…

swifthand updated 4 days ago
3
haotian-liu/LLaVA #836

repetition penalty

### Question when i try to use repetition_penalty to avoid repeat answer, i met this error "cuda error:device-side assert triggered". After my debug, i found that the input_ids include -200 which is…

SuXuping updated 3 months ago
6
huggingface/trl #2012

`OnPolicyConfig`: Change `non_eos_penalty` to be more clearl…

### Feature request The `OnPolicyConfig` has a flag: `non_eos_penalty: bool = False`, which is described as: `"""whether to penalize responses that do not contain stop_token_id"""` I interpreted…

RylanSchaeffer updated 1 day ago
5

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for penalty

1000+ results
for penalty