reward-design Search Results

1000+ results
for reward-design

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/trl #2156

[CGPO] Add support for Constrained Generative Policy Optimiz…

### Method description Constrained Generative Policy Optimization was introduced by Meta in a recent paper (https://arxiv.org/pdf/2409.20370). It seems to outperform PPO and DPO and is specifically…

gaetanlop updated 1 month ago
2
Flex-NFT-Marketplace/Flex-Marketplace-Contract #122

[Experimental] [POC] ERC-5173 NFT Future Rewards (nFR)

Create and work inside folder `experimental/erc_5173_future_rewards`: nFR allows owners to benefit from future price appreciation even after selling their tokens, without the need for market predic…

0xandee updated 5 days ago
15
polkadot-fellows/RFCs #131

New consensus mechanism

# Proposal: Proof-of-Consciousness Consensus Mechanism **Summary:** This proposal introduces a new consensus mechanism called Proof-of-Consciousness (PoC). Under PoC, only users who actively par…

AminMemariani updated 6 days ago
9
chromiecraft/chromiecraft #7500

[Sunwell Plateau] Plans/Patterns missing from Trash loot

### What client do you play on? enUS ### Faction Both ### Content Phase: 70 ### Current Behaviour [Design: Amulet of Flowing Life](https://wowgaming.altervista.org/aowow/?item=35202) ID:35202 …

amed80 updated 21 hours ago
1
Dooders/Experiments #18

Feature: Implement Hybrid Attack Decision-Making System with…

Develop a hybrid attack decision-making system for agents that combines a Q-learning neural network (QNN) and rule-based constraints. This system will allow agents to dynamically decide between aggres…

csmangum updated 3 days ago
1
GoodDollar/GoodCollective #231

Create new GoodCollective

## Business Description An interface to allow anyone to create a GoodCollective pool smart contract without interacting with an explorer. The user will be able to complete the following steps: 1) …

decentralauren updated 1 week ago
3
stacks-network/stacks-core #5292

Fix typos in the codebase

Care should be taken to not fix typos in DB column names or similar. desierialized -> deserialized • contruct -> construct • DkgPublicshares -> DkgPublicShares • signaure -> signature • atomicb…

hstove updated 1 month ago
2
codex-storage/codex-contracts-eth #154

Additional reward for host repairing slot

As described in [the design doc](https://github.com/codex-storage/codex-research/blob/41c4b4409d2092d0a5475aca0f28995034e58d14/design/marketplace.md#repairs), the node that repairs a slot should be a…

AuHau updated 3 months ago
5
Edouard360/Halite-Python-RL #17

Change state and reward design

Different shapes for looking around the square might be possible. * Maybe use a mask ? In tensorflow or in python * For now the game state is 27 number, and it is **hard-coded**, which is bad for …

Edouard360 updated 7 years ago
2
GarimaSingh0109/WasteManagment #359

[Feature] Waste Management through Reinforcement Learning te…

### Description The project aims to develop a reinforcement learning (RL) agent to optimize waste collection in a simulated environment, minimizing overflow events and improving efficiency. Environm…

Panchadip-128 updated 1 week ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for reward-design

1000+ results
for reward-design