rl-agents Search Results

1000+ results
for rl-agents

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/rl #1486

[BUG] Unsqueezing the error in ddpg

https://github.com/pytorch/rl/blob/3595c45eeb0c3e45c41bfff99ecbc973a7d1704f/torchrl/objectives/ddpg.py#L272 this line is not compatible with multi-agent, as it unsuquezees a dimension before the ag…

matteobettini updated 1 year ago
2
hill-a/stable-baselines #552

[question] Custom output activation function for actor (A2C,…

I really like `stable_baselines` and would like to use it for a custom environment with continuous actions. To match the specific needs of the environment, I need to apply a custom activation function…

stefanbschneider updated 10 months ago
8
renatolfc/sched-rl-gym #2

Unable to leverage real-world .swf trace from PWA

Although I tried to replace the trace generator with one that can load directly load .swf files, the simulation step is extremely slow and unable to be used to train RL agents.

LANNDS18 updated 1 year ago
4
google-deepmind/open_spiel #953

A question about laser_tag

Is laser_tag not available for this game? When I run example --game=laser_tag, the following message appears: Creating game.. Starting new game... Initial state: State: ....... ....... ..*.*.…

GaoZiHong updated 10 months ago
5
Unity-Technologies/ml-agents #5936

Extending Imitation Learning model catalog with SQIL

**Is your feature request related to a problem? Please describe.** A clear and concise description of what the problem is. Ex. I'm always frustrated when [...] BC algorithm can fall victim to covari…

stkovacevic94 updated 1 year ago
3
LucasAlegre/sumo-rl #163

Understanding plot.py and the experiments

Hi Lucas, First, I really like the Python library you and your team have put together. I currently use this library for my PhD research concerning collaborative multi-agent algorithms. My quer…

rohitrajgopalan updated 1 year ago
4
leap-hand/LEAP_Hand_Sim #4

Problem generating my own cache

When I try to run the mentioned code to generate a cache of stable grasps for different cube sizes: ``` for cube_scale in 0.9 0.95 1.0 1.05 1.1 do bash scripts/gen_grasp.sh $cube_scale custom_gr…

wredsen updated 1 year ago
1
sotetsuk/pgx #1039

Tak environment

# Tak Tak is an abstract strategy board game similar to Go, except pieces can stack on top of each other into the third dimension. There is an active community that plays this game, makes bots and …

ViliamVadocz updated 1 year ago
4
Unity-Technologies/ml-agents #5564

ML-agents python low-level API

Hello, ML agents developer! I'm very happy to use your training tools,but I have some problems in using them at present. I'm trying to use [Python-API.md](https://github.com/Unity-Technologies/ml-a…

X-DDDDD updated 1 year ago
6
upb-lea/ElectricGrid.jl #98

Benchmarking performance

Some specification on performance benchmarks: - run the benchmarks on CPU and GPU - Profiling to find bottleneck (if any) - Metrics for comparison: timesteps / unit time, updates / unit time - vary …

VikasChidananda updated 1 year ago
2

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for rl-agents

1000+ results
for rl-agents