in-context-reinforcement-learning Search Results

752 results
for in-context-reinforcement-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

irthomasthomas/undecidability #904

[2210.03629] ReAct: Synergizing Reasoning and Acting in Lang…

- [ ] [[2210.03629] ReAct: Synergizing Reasoning and Acting in Language Models](https://arxiv.org/abs/2210.03629) # [ReAct: Synergizing Reasoning and Acting in Language Models](https://arxiv.org/ab…

ShellLM updated 3 months ago
1
ros-simulation/gazebo_ros_pkgs #1268

[gazebo_ros]: 'Step control' of ROS-Gazebo for ROS-based Rei…

**Description** This is a feature request for supporting step control of Gazebo, via ROS. Gazebo independently offers step control via the gazebo::msgs::WorldControl message, but that functionality i…

alikureishy updated 2 years ago
13
liuyuemaicha/Deep-Reinforcement-Learning-for-Dialogue-Generation-in-tensorflow #5

expected int32 got list containing tensors of type '_message…

at python grl_train.py Prepare Chitchat data in ./grl_data/ Tokenizing data in ./grl_data/chitchat.train.answer Tokenizing data in ./grl_data/chitchat.train.query Tokenizing data in ./grl_data/…

SeekPoint updated 5 years ago
8
JuDFTteam/best-of-atomistic-machine-learning #301

Clean up categorization I

- [x] Add category - [x] Update category: **Category details:** Currently, the distribution of projects among categories is very uneven. ``` Active learning 4 projects Bio…

Irratzo updated 3 months ago
1
JuliaApproximation/DomainSets.jl #115

clamp() and clamp!()

An important operation for reinforcement learning contexts is to clamp an input to the nearest point within a set. Does anyone have objections to implementing `Base.clamp` and `Base.clamp!` for some s…

zsunberg updated 2 years ago
3
DLR-RM/rl-baselines3-zoo #321

[Feature Request] Support Stochastic Weight Averaging (SWA) …

### 🚀 Feature Stochastic Weight Averaging (SWA) is a recently proposed technique can potentially help improve training stability in DRL. There is now a new implementation in `torchcontrib`. Quoting/p…

pchalasani updated 2 years ago
2
crewAIInc/crewAI #1577

[BUG]Not able to use long term memory with Azure Open AI

### Description When using memory=True for a crew that uses Azure Open AI, there is an error creating long term memory. ### Steps to Reproduce ``` import os from chromadb.utils.embedding_…

talrejanikhil updated 2 weeks ago
1
facebookresearch/BenchMARL #76

Request for Example of AEC API Usage with Agent Masking in P…

I've been exploring the BenchMARL library and am impressed with its capabilities and design—great work! I am currently interested in implementing a multi-agent reinforcement learning scenario using…

wmn7 updated 7 months ago
6
derailed/k9s #2648

Remove angry emojis

### Is your feature request related to a problem? Please describe. Using angry emojis in response to invalid command inputs can inadvertently create a negative atmosphere, particularly for th…

teocns updated 7 months ago
1
openai/baselines #411

Robotics env. Failure to record video

When I use keyboard "V" to record the Video of Env "FetchReach-v1", this is some errro! I use debain 9 , Nvidia 390.59, cuda 9, python 3.6, tensorflow 1.8 Traceback (most recent call last): Fil…

Baichenjia updated 6 years ago
1

上一页 1...1 2 3 4 5 6 7...76 下一页

752 results for in-context-reinforcement-learning

752 results
for in-context-reinforcement-learning