reinforcement-learning-environment Search Results

1000+ results
for reinforcement-learning-environment

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

aws-neuron/aws-neuron-sdk #781

Error Finetuning Huggingface model - NEFF Not Found

Here's a strange error I keep getting and I haven't seen that anyone has posted it yet. I'm running the Causal Language Model finetuning [script found on the optimum-neuron examples page ](https://git…

greenguy33 updated 1 month ago
8
isaac-sim/IsaacLab #168

[Question] How to randomize the position of fixed joints?

In the reinforcement learning process, there will be some fixed joints whose positions are uncertain, in order to achieve the migration from simulation to real environment, I would like to be able to …

2361098148 updated 6 months ago
4
google-deepmind/mujoco #1679

Issue with Camera Visibility of Dynamically Merged XML Model…

Dear MuJoCo Support Team, I am currently encountering a problem with model visualization in a reinforcement learning setup using MuJoCo. To facilitate training in randomized environments, I dynamic…

kankannali updated 5 months ago
1
uchicago-computation-workshop/Spring2020 #1

04/09: Bainbridge

Comment below with questions or thoughts about the reading for this week's [workshop](https://github.com/uchicago-computation-workshop/Spring2020/tree/master/04-09_Bainbridge). Please make your com…

shevajia updated 4 years ago
77
opendilab/LightZero #224

how to well model a grid env when it changes frequently?

Suppose there is a game, a grid 10 by 10 ,each position was placed a piece of gold with a randomly positive value , and an agent do the mining job on this grid. The rule is when the agent digs a posit…

valkryhx updated 3 months ago
9
PKU-Alignment/omnisafe #223

[Question] why the form of IPO algorithm is not the same as …

### Required prerequisites - [X] I have read the documentation . - [X] I have searched the [Issue Tracker](https://github.com/OmniSafeAI/omnisafe/issues) and [Discussions](https://github.com/OmniSafe…

stvsd1314 updated 3 months ago
8
WEC-Sim/WEC-Sim #1276

WEC-Sim Example RM3

**Note: italicized text below is include as an example and should be updated before submission. If you feel any section is not applicable to your request, please replace with 'N/A' rather than delete …

donprofaghatise updated 3 months ago
8
RealVNF/distributed-drl-coordination #5

Questions and ideas about moving to hierarchical multi-agent…

hi @stefanbschneider Recently, I have finally completed the task of migrating `d-drl-coordination SB3` to the `rllib` version. After adding the curiosity module, I found that a similar success rate …

burnCalories updated 2 months ago
9
nsidn98/InforMARL #19

Have some questions about the scene?

What is the size unit of the map in the current scene design? What is the distance that the agent moves each time, and what is its unit? Is the current map continuous or grid-designed?

Yu-zx updated 3 months ago
28
leela-zero/leela-zero #1311

Facebook open sources elf opengo

https://research.fb.com/facebook-open-sources-elf-opengo/ ELF OpenGo ELF OpenGo is a reimplementation of AlphaGoZero / AlphaZero. It was trained on 2,000 GPUs over a two week period, and has achie…

kityanhem updated 6 years ago
416

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for reinforcement-learning-environment

1000+ results
for reinforcement-learning-environment