in-context-reinforcement-learning Search Results

740 results
for in-context-reinforcement-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

HumanCompatibleAI/imitation #808

Trained reward function outputs constant zeros using MCE alg…

## Bug description Hi, I'm currently adapting the Inverse Reinforcement Learning algorithm to analyze the behavior of mice in our lab studies. For this, I have used the Maximum Causal Entropy (MCE) a…

spearsheep updated 10 months ago
1
openjournals/joss-reviews #5052

[REVIEW]: Scarlet: Scalable Anytime Algorithms for Learning …

**Submitting author:** @ritamraha (Ritam Raha) **Repository:** https://github.com/rajarshi008/Scarlet **Branch with paper.md** (empty if default branch): main **Version:** v1.0.0 **Editor:** @adi3 **R…

editorialbot updated 9 months ago
79
microsoft/autogen #108

Infinite Loops

Hello, https://github.com/JayZeeDesign/microsoft-autogen-experiments/blob/main/content_agent.py I got a few instances of infinite loops running the above with gpt4. - An agent, if I recall co…

adriangalilea updated 1 year ago
16
pytorch/rl #861

[Feature Request] Examples Suggestion

## Motivation This is somewhat related to https://github.com/pytorch/rl/issues/849. I realize TorchRL is quite new and I expect the documentation and examples will improve over time, but I believe th…

smorad updated 8 months ago
35
ManifoldRG/Manifold-KB #23

AF Survey - Reflexion: Language Agents with Verbal Reinforce…

Paper title: Reflexion: Language Agents with Verbal Reinforcement Learning ([link to paper](https://arxiv.org/pdf/2303.11366.pdf)) Estimated time to complete the review: by 09/22/23 If you are new t…

pranavguru updated 1 year ago
2
NorbertZheng/read-papers #133

Sik-Ho Tang | Review: Representation Learning with Contrasti…

Sik-Ho Tang. [Review: Representation Learning with Contrastive Predictive Coding (CPC/CPCv1)](https://sh-tsang.medium.com/review-representation-learning-with-contrastive-predictive-coding-cpc-cpcv1-8e…

NorbertZheng updated 1 year ago
12
DLR-RM/stable-baselines3 #1464

[Feature Request] AlphaZero development

### 🚀 Feature Include AlphaZero in the library of available RL algorithm possibly with maskable actions option. ### Motivation I am a beginner in Reinforcement Learning, but I get some interesting …

fede72bari updated 1 year ago
3
Lightning-AI/utilities #130

Fabric support for TensorDict

### Description & Motivation [TensorDict](https://pytorch.org/rl/tensordict/) is a dictionary-like class that inherits properties from tensors, such as indexing, shape operations, casting to device…

belerico updated 1 year ago
4
mosaicml/llm-foundry #343

How did you train the MPT-7b-chat model?

Hi everyone, thanks a lot for the great library! Could someone please explain a bit more on how the chat model has been trained? More specifically, I am interested in how the input/output data has …

eldarkurtic updated 1 year ago
9
Farama-Foundation/Gymnasium #701

[Bug Report] AttributeError: module 'jax.numpy' has no attri…

### Describe the bug Import Gymnasium will result in error: ```python Traceback (most recent call last): ... File "*", line 5, in import gymnasium as gym File "*/venv/lib/python3.10/…

BillHuang2001 updated 1 year ago
1

上一页 1...39 40 41 42 43 44 45...74 下一页

740 results for in-context-reinforcement-learning

740 results
for in-context-reinforcement-learning