reward Search Results - Githubissues

1000+ results
for reward

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/trl #1893

Rewards actual_end index in PPOv2 trainer

In PPOv2 trainer.train(), # 4. compute rewards, when computing the rewards index, the sequence_lengths_p1 is used. `actual_end = torch.where(sequence_lengths_p1 < rewards.size(1), sequence_lengths…

zhuyuzy updated 6 days ago
3
kevslinger/DTQN #10

Replacing `done` with `truncated` and `terminated`

Hey Kevin, I hope you are doing well. I noticed a small bug where the step function returns only `obs, reward, done, info` instead of the `obs, reward, terminated, truncated, info`. I came across th…

ashok-arora updated 2 days ago
9
cs-internship/cs-system #63

Implement Global Toast Notification Component

We aim to integrate global toast notifications into our project to indicate various actions. We will use the default Bootstrap toast in three modes: **success** (green), **error** (red), and **info** …

Ali-Sdg90 updated 3 weeks ago
2
nautical/lpu_presentation #6

Error while analysis

got the following error: """ (albert㉿aimmore)-[~/Desktop/lpu_presentation/Supplement/Slither Myth] └─$ slither ./3.sol 'solc --version' running Traceback (most recent call last): Fi…

mandalabhash updated 14 hours ago
4
Dooders/Dooders #16

Reward System

Research and design a reward system based on the role dopamine plays on behavior. Basically, this would be a system of reinforcement learning where a Dooder is motivated to take action based on antici…

csmangum updated 11 months ago
1
bagisto/bagisto-reward-points #3

Composer Error to install the bagisto-reward-points

Hi I've installed the latest Bagisto and when I try to install the "bagisto-reward-points" package with this command `composer require bagisto/bagisto-reward-points` it returns the below error: …

hosseinhashemi updated 1 month ago
1
PWhiddy/PokemonRedExperiments #134

reward farm

So, the rewards you used are mostly ok but i have noticed that the ai spends a lot of time in the pokedex and since the pokedex as a lot of lines and they are different enough to trigger the explorati…

creeperita09 updated 10 months ago
6
isaac-sim/IsaacLab #941

[Question] PhysX Error: Material Limit and Buffer Overflow I…

### Question Hello everyone, I’m encountering some issues while running reinforcement learning experiments in IsaacLab with large agent counts, specifically when using more than 107 agents. Here’s…

H-Hisamichi updated 1 day ago
10
ltdrdata/ComfyUI-Inspire-Pack #155

After updating ComfyUI, the node fails

After updating ComfyUI, the node fails, updating the node does not help, and it still cannot be used

Edward-keyes updated 1 week ago
3
capacitor-community/admob #251

Rewarded interstitial support

Hi, Is there any plan to support the new Rewarded interstitial? https://developers.google.com/admob/android/rewarded-interstitial

yakovyarmo updated 5 months ago
2

上一页 1...16 17 18 19 20 21 22...100 下一页

1000+ results for reward

1000+ results
for reward