actor-critic Search Results

1000+ results
for actor-critic

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mastodon/mastodon #20117

Rename content warning (CW) to content notice (CN)

### Pitch I would suggest to rename content warning to content notice in the mastodon frontend. This would make it more clearly what it is actually used for by many people without changing the mean…

leahoswald updated 1 month ago
166
blockchainweek/data #81

[NEW]:

Dear team, Hope you are well. We would like to suggest a new event for the BBW website; not sure how this works here; pleases advice; below is the content: BAI Online Course DIGITAL ART, AI, AN…

BerlinArtInstitute updated 6 months ago
2
leela-zero/leela-zero #2069

AlphaZero paper peer-reviewed is available

See here: https://deepmind.com/blog/alphazero-shedding-new-light-grand-games-chess-shogi-and-go/ Not read yet...

Friday9i updated 5 years ago
70
zaiyan-x/RFQI #2

Training time

Hi @zaiyan-x , I'm running and trying to reproduce your code for my project. But it takes quite a long for training. I checked the log file and found out that most of the time is for eta optimiza…

linhlpv updated 7 months ago
16
tensorflow/tensorflow #41746

Autograph fails to convert nested **if-else** in a for loop

**System information** - Have I written custom code (as opposed to using a stock example script provided in TensorFlow): Yes - OS Platform and Distribution (e.g., Linux Ubuntu 16.04): 16.04 - Mobil…

AakashKumarNain updated 3 months ago
16
extratone/bilge #118

Social Media Methodology Documentation

#108 is going to be an important place to start. Twitter is the only context where I do not currently see a need for increased emphasis on self-curation at this time considering how that all appearan…

extratone updated 6 months ago
43
PKU-Alignment/safe-rlhf #167

[Question] PPO-Lag 微调大模型大概需要多少显存

### Required prerequisites - [X] I have read the documentation . - [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/safe-rlhf/issues) and [Discussions](https://github.com/PKU-…

pxyWaterMoon updated 8 months ago
3
pytorch/rl #1920

[BUG] kl divergence calculation in KLPENPPOLoss is always ze…

## Describe the bug KL divergence calculation in KLPENPPOLoss is always zero, causing the contribution to the loss to be 0. A clear and concise description of what the bug is. It seems that the w…

dennismalmgren updated 9 months ago
1
OpenRLHF/OpenRLHF #233

cuda.is_available is False in LLMRayActor

Hi, thanks for the great work. I encountered the problem when initializing vllm-engine engine in PPO training. It seems that the program cannot find available GPUS in the initialization. ``` F…

THINK2TRY updated 8 months ago
9
trackmania-rl/tmrl #91

got issue with the self driving

i have be try to record a reward on track ![Screenshot 2024-04-26 185119](https://github.com/trackmania-rl/tmrl/assets/86875051/c1e790c5-f80b-4c4c-a343-b228418a98a4) ![Screenshot 2024-04-26 185200]…

koi0823 updated 6 months ago
36

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for actor-critic

1000+ results
for actor-critic