issues
search
sfujim
/
BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
MIT License
599
stars
139
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Sampling from Replay Buffer
#17
Yigit-Kuyu
opened
2 months ago
0
A problem
#16
guest-oo
opened
6 months ago
0
Some questions about the experiments for demonstrating extrapolation error.
#15
awecefil
opened
11 months ago
0
Can the original BCQ paper be used with a discrete action space?
#14
VinalAsodia
closed
1 year ago
1
Does discrete BCQ use vae?
#13
zichuan-liu
closed
2 years ago
1
bug?
#12
tangbotony
closed
2 years ago
1
Performance of DDPG and BCQ
#11
SZH1230456
closed
1 year ago
1
Curious about figure 1.f for the true value and the estimation
#10
ReinholdM
closed
1 year ago
1
Whether the "done" condition was used incorrectly in the discrete action branch?
#9
XuJing1022
closed
3 years ago
0
Did you test your discrete-BCQ code before?
#8
wadx2019
closed
3 years ago
0
Training BCQ while evaluating policy with environment?
#7
HYDesmondLiu
closed
3 years ago
3
A potential bug in discrete BCQ FC network
#6
Trinkle23897
closed
3 years ago
1
Update README.md
#5
ReinaKousaka
closed
4 years ago
1
Discrete Environment other than Atari
#4
CaralHsi
closed
4 years ago
1
Minor changes for python 3.6
#3
pathway
opened
5 years ago
0
Can you please post software packages configuration?
#2
quanvuong
closed
5 years ago
6
Buffer size
#1
jack1442
closed
5 years ago
1