stable-baselines Search Results

1000+ results
for stable-baselines

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

DLR-RM/stable-baselines3 #1387

Attention mechanism / Transformer architecture

### 🚀 Feature Hi, since these days attention mechanisms and transformer architectures are everywhere, I thought that it would be good to have this also in SB3. I have found this implementation: ht…

anilkurkcu updated 1 year ago
1
HITSZ-HLT/JointCL #7

Should we use the test dataset to evaluate the model perform…

Hi @BinLiang-NLP ! In this [line](https://github.com/HITSZ-HLT/JointCL/blob/2b8a3e13c32cc81fcb2936451ff8a6f87382bc47/run_semeval.py#L529), the test set is used as the verification set to determine whi…

meteorlin updated 1 year ago
5
PCCproject/PCC-RL #19

Can not reproduce the training reward with stable baseline3

Hi, I can not use the setting with stablebaselines because of the TensorFlow version issue. I tried to use the PPO in stable-baseline3 instead. However, both the reward and ewma reward are super un…

chenxi-yang updated 1 year ago
9
DLR-RM/stable-baselines3 #1414

[Question] loading pretrained model weights into a new model

### ❓ Question Hello, I saw the previous post here https://github.com/DLR-RM/stable-baselines3/issues/543 with the corresponding paper and google collab notebooks. These helped for sure, thank y…

WreckItTim updated 1 year ago
5
AI4Finance-Foundation/FinRL #1022

TypeError: StockTradingEnv.reset() got an unexpected keyword…

Run Stock_NeurIPS2018_2_Train.ipynb in FinRL/examples Colab report error ![image](https://github.com/AI4Finance-Foundation/FinRL/assets/112242664/3f45e6a3-c9c3-49c9-81ea-98ae1519cb66) t…

LunbiWa updated 1 year ago
18
flutter/flutter #77132

iOS app crashes when NSAllowsArbitraryLoads is set to false …

## Steps to Reproduce 1. Run `flutter create crash_test`. 2. Update ios/Runner/Info.plist files as follows: ``` CFBundleDevelopmentRegion $(DEVELOPMENT_LANGUAGE) CFBundl…

iprox updated 1 year ago
24
Kinds-of-Intelligence-CFI/animal-ai #2

Taking action in env

In the AnimalAI environment, when you want to take a step, the action has to be called as action.item(). This may be an issue with AnimalAI or it could be due to the way stable baselines stores the mo…

chaubeyniha updated 1 year ago
1
DLR-RM/stable-baselines3 #445

[Feature Request] Recompute the advantage of a minibatch in…

### 🚀 Feature According to this [paper](https://arxiv.org/pdf/2006.05990), recomputing the advantage can be helpful for the PPO performance. The function is provided by `tianshou` library. http…

yangysc updated 1 year ago
17
DLR-RM/stable-baselines3 #1366

Observation as Pytorch Tensor (on cuda)

### 🚀 Feature Hello Stable Baselines community, I am currently working on state representation learning in Robotics, using an observation that consists of a latent vector obtained by encoding simu…

Ludwig-Graef updated 1 year ago
1
Farama-Foundation/HighwayEnv #276

environment

What environment is needed to run the code in scripts? Like gym and SB3?

HUXIAOWANG513 updated 1 year ago
2

上一页 1...93 94 95 96 97 98 99...100 下一页

1000+ results for stable-baselines

1000+ results
for stable-baselines