-
not sure why this comment exists, I cannot delete it
-
Hi,
I am currently dealing with "agents/tf_agents/bandits/" . I am wondering where or if the classic Contextual Bandit off-policy evaluation procedures are present in Tensorflow.I mean exactly the…
-
Hello,
in Tutorial 8: Networks, there's a faulty link to Keras' network.py in the section Defining Networks. I am not sure how this bug came about but my guess is that Keras removed this API/file/w…
-
```
RainbowAgent(sess=tensorflow.compat.v1.Session(),
num_actions=2,
stack_size=4,
observation…
-
Hi, thanks for open sourcing this work! I tried running:
```
./ibc/ibc/configs/particle/run_mlp_ebm_langevin_best.sh 2
```
And got this error
```
File "ibc/ibc/train_eval.py", line 397, in mai…
-
Hello,
I believe the tutorial 1_dqn_tutorial.ipynb has an unnecessary import of dynamic_step_driver. The module is not used at all, so the scripts runs just fine when commented out. Furthermore, I …
-
Currently I have a huge dilemma:
- backport all my code to TF 1, in order to use Stable Baselines and my code in one project
- or use something less mature than Stable Baselines (eg TF Agents) only …
-
## Title: Semi-Automated Warehouse
### Submitter(s):
Samuele Burattini (University of Bologna)
### Description:
A manufacturing facility has its last production step aimed at packaging bat…
-
Here is a minimal example:
```python
from tf_agents.replay_buffers import tf_uniform_replay_buffer
import tensorflow as tf
batch_size = 8
counter = 0
data_spec = {'observations': tf.Tensor…
djl11 updated
4 years ago
-
I try to implement getting the second order gradient by tf_agent.
The reason why I do second order gradient is came from meta-learning algorithm [MAML](https://arxiv.org/abs/1703.03400).
First I c…