reward Search Results - Githubissues

1000+ results
for reward

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

frozn/TipTac #356

Reward level for keystones always shows '-'

### Describe the bug ![image](https://github.com/user-attachments/assets/6675452a-826e-4aa1-9e00-c218065d1279) As can be seen in the screenshot, the `RL` value shows only `-`. I believe this is due …

jwidauer updated 2 days ago
1
dres-dev/DRES #496

Correct Video Reward for KIS

For KIS tasks it would be good to reward finding the correct video, because often a video contains similar or even equal scenes (for example in news videos, where there is a preview of a scene). Such…

klschoef updated 1 week ago
4
paritytech/polkadot-sdk #5894

[pallet-staking] Auto payout validator reward

## Context Validator payouts are lazy, and paged. Meaning for each era, and page of nominators (see [MaxExposurePageSize](https://paritytech.github.io/polkadot-sdk/master/pallet_staking/trait.Config.…

Ank4n updated 1 day ago
1
crowbartools/Firebot #2689

[Feature Request] Edit Reward Cooldown through Update Channe…

**Describe the solution you'd like** Currently, whether or not an effect has a cooldown and the duration of that cooldown are only manageable in the UI, it'd be nice to allow this to be updated via e…

Oceanity updated 1 week ago
1
NVIDIA/NeMo-Aligner #230

reward-bench for Reward Model

After train RM（step1-step3） with steerLM，I'll get reward model(.nemo), is it as the final reward model? Nemotron-4-340B technical report show the perfermance of reward model based on reward-bench …

lss11005 updated 1 month ago
1
MarkusBordihn/BOs-Daily-Rewards #34

Reward Screen

I'm trying to use the "OVERVIEW" rewards screen but I'm not able to, it stays on "COMPACT" even though I change it to both "DEFAULT" and "OVERVIEW", and I'm also not able to remove the list of special…

skannnnnnnnnnnn updated 2 weeks ago
1
luwo9/bomberman_rl #46

Reward shaping for kills

While not clear yet, it is likely that killing opponents or laying bombs next to them will rarely happen during normal training. In this case one might need to make use of suitably shaped rewards (tha…

luwo9 updated 3 weeks ago
3
luwo9/bomberman_rl #14

Reward shaping

It may be worth to collect some ideas here of what to reward: obvious should be: -coins collected (+) -opponents killed (+) -winning the game (+) -getting killed by a bomb(-) maybe also to thi…

RuneRost updated 1 month ago
3
ggerganov/llama.cpp #9203

Feature Request: Support for large reward-type models (Nemot…

### Prerequisites - [X] I am running the latest code. Mention the version if possible as well. - [X] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.md)…

tblattner updated 1 week ago
1
vegaprotocol/specs #2317

Reward anti-whaling

Currently rewards suffer do not work as intended when applied to a range of markets, such as all markets with the same settlement assst. Users on long tail markets receive almost no rewards compare…

barnabee updated 1 week ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for reward

1000+ results
for reward