in-context-reinforcement-learning Search Results

752 results
for in-context-reinforcement-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Stable-Baselines-Team/stable-baselines3-contrib #202

[Feature Request] Hybrid PPO

### 🚀 Feature Hello, in accordance with DLR-RM/stable-baselines3#1624, @SimRey and I would like to implement **Hybrid PPO** in this library. [This](https://arxiv.org/pdf/1903.01344.pdf) is the pa…

AlexPasqua updated 1 month ago
3
apache/lucene #1701

Extended spell checker with phrase support and adaptive user…

Extensive javadocs available in patch, but I also try to keep it compiled here: http://ginandtonique.org/\~kalle/javadocs/didyoumean/org/apache/lucene/search/didyoumean/package-summary.html#package_de…

asfimport updated 2 months ago
19
jupyter-book/jupyter-book #1912

Footnote does not show up with Utterances.

### Describe the bug Hi, I wish this is not a duplicated issue and I am sorry for my poor English in advance. **context** I have used Utterances as commenting service on my Jupyter Book and I jus…

HiddenBeginner updated 1 year ago
1
liuyuemaicha/Deep-Reinforcement-Learning-for-Dialogue-Generation-in-tensorflow #8

failed to run

Prepare Chitchat data in ./grl_data/ Reading development and training data (limit: 0). b_set length: 0 b_set length: 6 b_set length: 2 b_set length: 0 Creating st_model model with fresh para…

sherrytong updated 4 years ago
2
JuliaDynamics/Agents.jl #648

Multi-Agent RL

**Is your feature request related to a problem? Please describe.** First off, I would like to thank you for building and maintaining an amazing project! One feature, I would be interested in adding/c…

mplemay updated 3 weeks ago
16
openai/mujoco-py #683

distutils.errors.CompileError

**Describe the bug** I am trying to install mujoco on my windows10 laptop. But it report the error as follow. **To Reproduce** I have installed other gym environments like the CartPole, Bipedal…

DijieDeng updated 2 months ago
5
irthomasthomas/undecidability #706

LMOps/README.md at main · microsoft/LMOps

- [ ] [LMOps/README.md at main · microsoft/LMOps](https://github.com/microsoft/LMOps/blob/main/README.md?plain=1) # LMOps/README.md at main · microsoft/LMOps ## LMOps LMOps is a research initiati…

irthomasthomas updated 8 months ago
1
Thinking-with-Deep-Learning-Spring-2024/Readings-Responses #17

Week 9. May. 17: Reinforcement Learning - Orienting

Post your questions here about: “Reinforcement Learning” and “Deep Reinforcement Learning”, Thinking with Deep Learning, Chapters 15 & 16

JunsolKim updated 6 months ago
22
microsoft/DeepSpeedExamples #922

Actor loss nan and Resizing model embedding

The model I use is GPT-2 124M. When resizing model embeddings during the training of STF and RW, I often encounter issues where the generated answers consist entirely of zeros. This causes both the lo…

ouyanmei updated 3 months ago
1
meta-introspector/llama.cpp #3

llm implant

Designing a dynamic neural network implant for large language models involves implementing a system that can adapt and learn dynamically. Here's a high-level approach: ### Dynamic Neural Network Im…

jmikedupont2 updated 11 months ago
3

上一页 1...1 2 3 4 5 6 7...76 下一页

752 results for in-context-reinforcement-learning

752 results
for in-context-reinforcement-learning