offline-rl Search Results

1000+ results
for offline-rl

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tianheyu927/mopo #4

Why MOPO is given access to terminal function in rollout gen…

Hi, thanks for sharing your great work. However, I am confused about the rollout generation process. As I see in the code, the agent can access to a pre-defined terminal function to cut down the un…

IcarusWizard updated 3 years ago
2
CityofToronto/bdit_data-sources #654

Check Bluetooth readers are working.

To look at locations ```sql SELECT detector_name, street1, street2, bluetooth_id, wifi_id, detector_name_old, project, date_start, date_end, latitude, longitude, "PX", index1, loc FROM blueto…

radumas updated 10 months ago
4
Farama-Foundation/D4RL #136

Training Mujoco-Gym Tasks with Image Observations

Hi @justinjfu @aviralkumar2907, I want to train Mujoco-Gym Continuous Control Tasks with Image Observations. I figured out that `image_envs` branch support Image Observations. But when I did `gym.ma…

nileshop22 updated 9 months ago
1
facebookresearch/ReAgent #394

Handle datasets with no terminating states

Imagine that we use ReAgent to train a personalization policy, and the workflow is as follows: 1. We collect a number of user interaction histories (episodes) and train a DQN model in offline (Batch…

ikatsov updated 3 years ago
3
yg-smile/RL_VVC_dataset #2

2 questions

Sorry, I have two new problems through learning your program. 1. How do you define offline and online. 2. Why does (offline training online -------- training training -------- converting data) nee…

zhangdahua1 updated 2 years ago
8
Bellman-devs/bellman #1

add a development plan

link to in contribution guidelines, what is in scope etc

hstojic updated 3 years ago
1
backgom2357/Recommender_system_via_deep_RL #6

The problem of model training

Dear author, I am reproducing this code recently. I have some questions to ask you: 1. Why is the model training "MAX_EPISODE_NUM" set to 8000 ? Is the result better the more times you train? Will i…

nayujia updated 2 years ago
1
Zhendong-Wang/Diffusion-Policies-for-Offline-RL #12

question about the effect of timesteps of diffusion model ov…

![image](https://github.com/Zhendong-Wang/Diffusion-Policies-for-Offline-RL/assets/87383739/771568ad-84af-4db4-8e21-c5c2fda8701c) the above is the description of the effect of timesteps of diffusion …

thu-yao-01-luo updated 1 year ago
1
ManifoldRG/Manifold-KB #10

AF Survey - "Towards A Unified Agent with Foundation Models"

bfaught3 updated 8 months ago
2
google-research/planet #60

Train models on custom dataset

I am interested in training these models on a completely different dataset (i.e. not using DM Control or MuJoCo). I have recorded simulation data from Isaac Sim and I would like to train the models in…

adamconkey updated 3 years ago
1

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for offline-rl

1000+ results
for offline-rl